Correct Mean Squared Error Function for Neural Network Output

32 Views Asked by Bumbble Comm At 26 Mar 2026 - 5:27

Any help is appreciated. I would like to know if i missed something, or if that would be correct?

$m$ is the number of training examples
$L$ is the Loss-Function
$\hat{\mathbf{y}}^{(i)}$ is the output vector for training example $i$
$\mathbf{y}^{(i)}$ is the target vector for training example $i$
I used the factor $\frac{1}{2}$ to simplify the derivative

$$ E = \frac{1}{2m} \sum_{i=1}^{m} L^{(i)} = \frac{1}{2m} \sum_{i=1}^{m} \frac{1}{2} \| \hat{\mathbf{y}}^{(i)} - \mathbf{y}^{(i)} \|^2 $$

There are 1 best solutions below

Bumbble Comm On 04 Sep 2021 - 6:59 BEST ANSWER

For the purpose of simplifying derivative, it suffices to let

$$E = \frac1{2m}\sum_{i=1}^m \|\hat{y}^{(i)} - y^{(i)}\|^2$$

though for the purpose of minimization, they are equivalent as they differ by a positive scalar multiplication.