I don't understand how to find a logarithm of Jordan block. Wikipedia says, that we can use Mercator series, but I don't know why it works for matrices - I know only the proof for real numbers. In some other books I've only found explanation on the level "by contruction we have that $ e^B=A $" and the elements of matrix $ B $. Can anybody explain formally how to find this logarithm?
Edit. By finding the logarithm I mean not just deriving the formula for it, but rigorous proof that this formula is correct. If $ A $ is a Jordan block of the form $ \lambda I + N $, where $ N $ is nilpotent and $ I $ is identity matrix, I know that
$$ B=\log{A}=I\log{\lambda}+\sum_{j=1}^{n-1}\frac{(-1)^{j+1}}{j\lambda^j}N^j. $$
Unfortunately, I don't understand how to prove that $ e^B=A. $ I tried to do it by the definition of exponential, but I wasn't successful.
A Jordan block is in the form $A=\lambda I_n+J$ where $J$ is the nilpotent Jordan block of dimension $n$ and $\lambda\in\mathbb{C}\setminus \{0\}$.
When $\lambda\notin (-\infty,0]$, the principal logarithm of $A=\lambda(I+u J)$ (where $u=\dfrac{1}{\lambda}$) is
$\log(A)=\log(\lambda)I+uJ-\dfrac{1}{2}(uJ)^2+\dfrac{1}{3}(uJ)^3+\cdots$
with the definition: $\log(\lambda)=\log(r)+i\theta$ when $\lambda=r\exp(i\theta), \theta\in(-\pi,\pi)$.
Note that the above sum is finite because $J^n=0$ and that we use the standard series of $\log(1+v)$ when $|v|<1$.
EDIT.
Assume that, on the complex numbers, $f(y)=\sum_i a_iy^i,|y|<u$ and $y=g(x)=\sum_{j\geq 1} b_jx^j,|x|<v$ is s.t. $b_1\not= 0$ and for every $|x|<v$, one has $|g(x)|<u$. Let $\phi_k(x)=\tau_k(\sum_{i=0}^k a_i(\sum_{j=1}^k b_jx^j)^i)$ where $\tau_k$ keeps only the monomials of degree $\leq k$..
Then $(f\circ g)(x)=\lim_{k\rightarrow\infty}\phi_k(x)$ (uniformly locally convergent series with radius $\geq v$).
One has a similar result for the matrices (same proof. -that is your business- the key being $||A^p||\leq ||A||^p$): Let $f(B)=\sum_i a_iB^i$ and $B=g(A)=\sum_{j\geq 1} b_jA^j$;
$(f\circ g)(A)=\lim_{k\rightarrow\infty}\phi_k(A)$ for $||A||<u$ and even for $\rho(A)=\max((|\lambda_i|)_i<u$.
Now, if $(f\circ g)(x)=x$, then $A=\lim_{k\rightarrow\infty}\phi_k(A)$.
In particular, if $A=uJ$, then $uJ=\phi_{n-1}(uJ)$.