If $\varphi$ is a formula in a first order language $\mathcal{L}$ and $x$ is a variable that is not free in $\varphi$, then the following is a pure axiom
$$\varphi \to \forall x\varphi$$
The pure axioms are our building blocks for constructing the logic theorems. There are other pure axioms, e.g. the distribution axiom:
$$\forall x(\varphi \to \psi)\to (\forall x \varphi \to \forall x \psi)$$
My question is:
Why is the generalization axiom necessary? I'm sure (I believe it) it can't be proved from the other pure axioms (but I don't see why isn't this axiom a valid formula), however, it's doesn't seem to be so important in the sense that "it doesn't add any information at all", an example of such an axiom is (in informal language):
If $x=2$ then $x=2$ for all $z$.
Why is it so important? I mean, in my opinion, the other axioms (as the distribution axiom, for example) seem to be more necessary since those "talk" about variables that may be free in the considered formulas. I've seen a couple of lemmas that need the generalization axiom to be proved, but I can't get the intuition behind.
This axiom is "useful" in proving the Generalization Theorem :
See :
There are other axiomatizations of first-order logic that avoid this "unnatural" axiom; see :
or