I'm reading a book that prove the first part of Möbius Inversion Formula exactly like this:
($\Rightarrow$) Suppose that
$$f(n)=\sum_{d|n}g(d).$$
Then,
$$\begin{align}\sum_{d|n}\mu(d)f\left(\frac{n}{d}\right)&=\sum_{dd'=n}\mu(d)f\left(d'\right)\\ &=\sum_{dd'=n}\mu(d)\sum_{m|d'}g(m)\\ &=\sum_{dmh=n}\mu(d)g(m)\tag{1.1}\\ &=\sum_{mh'=n}g(m)\sum_{d|h'}\mu(d).\tag{1.2}\end{align}$$
But $\sum_{d|h'}\mu(d)=0$ for $h'>1$. Hence $$\sum_{d|n}\mu(d)f\left(\frac{n}{d}\right)=g(n).$$
I understood everything except why we can go from 1.1 to 1.2. I found similar questions on the internet, but I couldn't understand the answers either.
I'm realizing that this is not the first time that I have not understood how to interchange summation indexes. It might be interesting for me to do a more careful study of summations, so in addition to the proof explanation, I'm looking for books and videos that better explain how summation works and, in particular, how index interchange works.
Feel free to point out mistakes in my writing as well (I'm not used to writing in English).
Thanks in advance.
I just want to begin by remarking that your English appears immaculate! I would not have guessed that you're not used to writing in English if you had not said it...
The general idea with sum index-interchanges is just some careful thought about precisely what terms a sum iterates over and why are the same in the new index-order.
If a term is summed in (1.1), then it is $\mu(d)g(m)$ for some triple $(d,m,h)\in\mathbb{N}^3$ such that $dmh = n$. If we were to group $dmh = m(dh) = mh'$ then notice $d|h'$. Therefore, we get a triple $(m,h',d)\in\mathbb{N}^3$ such that $mh' = n$ and $d|h'$ and thus this term appears in the sum (1.2).
Similarly, suppose a term is summed in (1.2) then it's $g(m)\mu(d)$ for some triple $(m,h',d)\in\mathbb{N}^3$ such that $mh' = n$ and $d|h'$. But then from this triple we can form the new triple $(d,m,h'/d)\in\mathbb{N}^3$ such that $d\cdot m\cdot (h'/d) = n$. Thus this term appears in the sum (1.1).
That's all there really is to sum index-interchange: there are finitely many terms in both cases and we sum over the same set in both cases but the underlying set is represented differently in some way. We just need to see that the set is the same in both sums; in the above I showed that by showing that the set of summed terms in (1.1) and (1.2) are subsets of each other and so the same. After a while, you'll get used to this and you won't need to write out something like the above and instead just "see" it but for now perhaps try to write out the argument when you find a tricky index change.