I wanted to read a proof for the Binomial theorem, so I googled "proof of the binomial theorem".
My question is about the proof from the top link of that search. In the sixth line of the induction step, $$\sum_{k=0}^{n-1}{n-1 \choose k}x^{n-k}y^{k}$$ becomes $$\sum_{k=0}^{n}{n-1 \choose k}x^{n-k}y^{k}-{n-1 \choose n}x^{0}y^{n}$$ This is in order to get the indexes of the sums to match up, so the two sums can be combined.
So, I don't know what to make of ${n-1}\choose{n}$. WolframAlpha says it's zero. When using the formula ${{n}\choose{k}}=\frac{n!}{k!(n-k)!}$ with $n-1$ and $k=n$ I get $\frac{1}{n(-1)!}$
WolframAlpha says $(-1)!$ is complex infinity.
Also, ${n-1 \choose -1}$ is used, which I don't understand.
I'd be happy if someone could explain what is going on here.
Thanks.
Consider the following definitions/formulas for binomial coefficients $\binom{n}{m}$:
All four definitions/formulas above would prescribe for us $\binom{n}{m}=0$ if $n<m$. The coefficient of $x^m$ in the expansion of $(1+x)^n$ would be $0$; there will be no subsets of $\{1,\cdots,n\}$ of size $m$, the gamma function has poles at nonpositive integers so its reciprocal should have zeros; and evaluating the polynomial form will result in zero. Thus the convention $\binom{n}{m}=0$ when $n<m$ harmonizes all four standard definitions/formulas we have for the binomial coefficient.
Moreoever, if one adopts this convention once and for all, then all future instances of sums that would involve such terms will not require breaking apart individual terms as David Wheeler mentions in the comments. Bookkeeping is made much more efficient in this way. A similar thing happens with $0^0$ - not in terms of limits and indeterminate forms, but in terms of how to evaluate the expression that appears in summation notation. Nobody frets about writing down $\sum_{n=0}^\infty a_nx^n$ instead of $a_0+\sum_{n=1}^\infty a_nx^n$ for instance.
The combinatorial interpretation is most aligned with the summation bookkeeping purpose, as one can often interpret addition and individual summands themselves combinatorially, as in the case of the binomial theorem. Indeed the same occurs with $0^0$: in general $Y^X$ denotes the set of functions $X\to Y$ so that $|Y^X|=|Y|^{|X|}$, and $\varnothing^\varnothing$ has one element, the empty function, so $0^0=1$.
All of these reasons imply this convention is the "morally correct" choice of definition.