I've started a course on financial mathematics and I'm currently being introduced to stochastical analysis, spesifically Itô's formula. From the book:
It is sometimes useful to use the following shorthand version of [Itô's formula]: $$ \mathrm{d}f(t,X(t)) = \dfrac{\partial f(t,X(t))}{\partial t}\mathrm{d}t + \dfrac{\partial f(t,X(t))}{\partial x}\mathrm{d}X(t) +\dfrac{1}{2}\dfrac{\partial^2 f(t,X(t))}{\partial x^2}(\mathrm{d}X(t))^2 $$ together with the calculation rules $ (\mathrm{d} t)^2 =0,\ \mathrm{d}t\mathrm{d}B(t) = \mathrm{d}B(t)\mathrm{d}t = 0 \text{ and } (\mathrm{d}B(t))^2 = \mathrm{d}t$
$f(t,x)$ is a function twice differentiable in $t$ and twice in $x$. $X(t)$ is a semimartingale. The definition of a semimartingale used in the book is (in case there is several definitions)
The stochastic process $X(t)$ is a semimartingale if there exists two Itô integrable stochastic processes $Y(t)$ and $Z(t)$ such that $$ X(t) = x + \int_0^t Y(s) \mathrm{d}B(s) + \int_0^t Z(s)\mathrm{d}s $$
I'm having a hard time with understanding this short version of Itô's formula. The reason for this is that I'm not familiar with these differential-terms (name?) of type $\mathrm{d}t$ other than in the classical integral. These "calculation rules" that the author speaks of are entirely Greek to me. How should I think of these terms?
Any written, preferably online, resources explaining these terms for a newcomer is also appreciated.
The initial formula can be thought of as Taylor expansion out to second order. The rules can be understood in the framework of Riemann sums. An integral $dB$ in sum form looks like:
$$\sum f_i (B_{i+1}-B_i).$$
An integral $dt$ in sum form looks like
$$\sum f_i (t_{i+1}-t_i).$$
An integral $(dB)^2$ in sum form looks like
$$\sum f_i (B_{i+1}-B_i)^2.$$
An integral $dt dB$ in sum form looks like
$$\sum f_i (t_{i+1}-t_i)(B_{i+1}-B_i).$$
Finally an integral $(dt)^2$ in sum form looks like
$$\sum f_i (t_{i+1}-t_i)^2.$$
When you derive Ito's formula, you prove two things. One is fairly trivial: the fourth and fifth types of sums converge to zero as you refine the mesh. This is because, if the time step size is $h$, then there are $O(1/h)$ terms in the sum and the summands are $o(h)$. The other is quite nontrivial: the third type of sum converges, not to zero, but to an integral of the second type.
We rephrase these two observations heuristically by saying that $(dB)^2=dt$ and anything higher order than $dt$ is zero.
Incidentally, in Radically Elementary Probability Theory (this links to a pdf of the book from the author's web site), Nelson shows that one can actually say things like $dB=\pm \sqrt{dt}$ rigorously within the framework of hyperreal-type nonstandard analysis. In standard analysis language, this amounts to saying that if $D_k$ are iid variables equally likely to be $+1$ or $-1$, then
$$B(t) = \lim_{h \to 0^+} \sum_{k=1}^{\left \lfloor \frac{t}{h} \right \rfloor} \sqrt{h} D_k.$$