I have an undergrad degree in computational mathematics (though that was about 10 years ago), and spent my professional career in software development.
If I wanted to understand what's happening behind the scenes in ML, and not just blindly apply equations, what disciplines would I need to study? Stochastic calculus? Something else?
A list of courses and recommended reading would be the most useful answer, with the goal being to learn over the course of a few years, not a quick fix.
Books: Hastie et al., Elements of Statistical Learning is a grad-level text, 745 pages, pdfs online.
See also stats.stackexchange.com/questions/tagged/machine-learning references .
Learning by doing, in Python: scikit-learn has many algorithms, examples, solid code -- highly recommended.