We had the above mentioned theorem in class recently.
Assume that $(X_n)$ is a nonnegative submartingale. Then for all $n \in \mathrm{N}$ and all $p \ge 2$ $$ \| \max_{k \le n}X_k \|_p \, \le \, \frac{p}{p-1} \|X_n\|_p $$
In the middle of the proof there is one step I can't follow. We introduced a stopping time $\tau$ given by
$$ \tau = \inf \{n : X_n \ge K \} $$
for $K \le \infty$ and then deduced
$$ \mathrm{E} \left[ \max_{k \le n}\left( X_{\tau \land k } \right)^p \right] \le \mathrm E \left[ \max_{k \le n-1} \left( X_{\tau \land k } \right) ^p \right] + \mathrm E \left[ \left( X_n \right)^p \right] \le K^p + \mathrm E \left[ \left( X_n \right)^p \right] .$$
I could see that this is true for example for a Brownian motion but why does this also hold in discrete time. Any help is appreciated very much.
Assume without loss of generality that $p=1$ and consider $M_n=\max\limits_{1\leqslant k\leqslant n}X_{\tau\wedge k}$, then $$M_n=M_n\mathbf 1_{\tau\geqslant n+1}+\sum_{k=1}^nX_k\mathbf 1_{\tau =k}$$
On the event $\{\tau\geqslant n+1\}$, $X_{\tau\wedge k}=X_k<K$ for every $1\leqslant k\leqslant n$ hence $$M_n<K$$
For each $1\leqslant k\leqslant n$, the event $\{\tau=k\}$ belongs to the sigma-algebra $\sigma(X_\ell; 1\leqslant \ell\leqslant k)$ and the process $X$ is a submartingale hence $$E(X_k\mathbf 1_{\tau =k})\leqslant E(X_n\mathbf 1_{\tau =k})$$
Summing these yields $$E(M_n)\leqslant KP(\tau\geqslant n+1)+E(X_n\mathbf 1_{\tau\leqslant n})\leqslant K+E(X_n)$$