Possible Duplicate:
Why do we use a Least Squares fit?
To find the normal equations for derivation of the regression line we use the method of least squares, We want to make the error smallest for each individual,so we may take summation of mod of the error term for each individual instead of taking the summation of the square of the errors of the individuals. Why is the former method used for regression and not the second one?
It is not true that min sum of absolute errors is never used. Least squares is used because it is equivalent to maximum likelihood when the model residuals are normally distributed with mean 0. But when the distribution of the error term is non-normal particularly if it has heavy tails the least squares estimates are not best and robust procedures such as minimum sum of absolute errors are preferable.