How are they different? and why would would the second regression be more appropriate?
$ Y = A_1(x_1)^2 + e $
$ {\sqrt Y} = A_1x_1 + e $
My thoughts: Square root of a function makes the function more linear, so the second regression would be preferred due to this reason.
A model is said linear when it is linear with respect to its parameters; this does not have anything to do with the variables.
For example, $$y=a+ b x^\pi+c e^{\sqrt{x^3+9}}+d \sin(8x)$$ is a multilinear model.