My university lecturer has given the following definition of the composition of two formal power series:
Let $F(X)=\sum_{n=0}^\infty a_nX^n$ with $a_0=0$ and $G(X)=\sum_{n=0}^\infty b_nX^n$.
We define $$G\circ F(X)=G(F(X))=\sum_{n=0}^\infty c_nX^n,$$
where the coefficients $c_n$ are as follows:
- There is a sequence $({d_k}^{(n)})_{k\geq0}$ such that $$\sum_{k=0}^\infty {d_k}^{(n)}X^k=\sum_{k=0}^n b_k(F(X))^k.$$
- Set $c_n={d_n}^{(n)}$.
My questions are:
- Firstly, does the notation ${d_k}^{(n)}$ refer to the $n^{th}$ derivative of $d_k$, or the $n^{th}$ power of $d_k$? (I think it refers to the latter; however, the use of notation seems ambiguous to me.)
- Secondly, how do we know that the sequence $({d_k}^{(n)})_{k\geq0}$ exists?
- Finally, why must we have $a_0=0$?
The composition is just the composition; that is, it's
$$G(F(x)) = \sum_{n=0}^{\infty} b_n F(x)^n.$$
In order for this to be well-defined as a formal power series we need $a_0 = 0$ so that $x^n | b_n F(x)^n$, which ensures that each term $b_n F(x)^n$ only contributes finitely many terms to the coefficient of $x^k$ in the composition. Otherwise, for example, $G(F(0))$ may be a sum of infinitely many terms. Said another way, the condition that $a_0 = 0$ guarantees that the above sum converges in the $x$-adic topology.