hinge loss vs. square of hinge loss components

1.5k Views Asked by At

We can define the hinge loss to be

$$ L(y,t) = \max\{0,1-yt\} $$

We can also have a variation such that the loss now becomes: $$ L(y,t) = \max\{0,(1-yt)^2 \} $$

When would you want to use one over the other?