Questions about the Perceptron convergence proof

256 Views Asked by Bumbble Comm At 08 Apr 2026 - 10:37

I have some amateur questions about the perceptron convergence proof:

I don't understand where we got $k\gamma$ from, specifically how did we go from $\frac{\theta^{k-1}\theta^*}{\|\theta^*\|}+\gamma$ to $k\gamma$?
How did we go from $\|\theta^{k-1}+y^{(i)}x^{(i)}\|^2$ to $\|\theta^{k-1}\|^2+2y^{(i)}\theta^{(k-1)} x^{(i)}+\|x^{(i)}\|^2$?

Since $\| \theta^{k-1}+y^{(i)}x^{(i)} \|$ is just an expression of this vector's length, I would have thought that the $\| \|$ and the square would have cancelled each other out, thus leaving us with only $(\theta^{k-1})^2+(y^{(i)}x^{(i)})^2$?

(As far as I understand, the length of a vector $(x,y)$ is just $\sqrt{x^2+y^2}$)

There are 1 best solutions below

Bumbble Comm On 17 Mar 2021 - 5:56 BEST ANSWER

You can view it from the perspective of mathematical induction.

We have $$\frac{\theta^{(1)}\cdot \theta^*}{\|\theta^*\|}\ge \frac{\theta^{(0)}\cdot \theta^*}{\|\theta^*\|} + \gamma = \frac{0\cdot \theta^*}{\|\theta^*\|} + \gamma=\gamma$$ and hence by induction hypothesis $$\frac{\theta^{(k)}\cdot \theta^*}{\|\theta^*\|}\ge \frac{\theta^{(k-1)}\cdot \theta^*}{\|\theta^*\|} + \gamma \ge (k-1)\gamma + \gamma=k\gamma$$

For real number $(a+b)^2=a^2+b^2+2ab$. For real vector, $\|a+b\|^2=\langle a+b, a+b\rangle=\|a\|^2 + \|b\|^2 + 2\langle a, b \rangle.$

Questions about the Perceptron convergence proof

There are 1 best solutions below

Related Questions in ALGORITHMS

Related Questions in PROOF-EXPLANATION

Related Questions in MACHINE-LEARNING

Trending Questions

Popular # Hahtags

Popular Questions