I would like to know how regression model is a statistical model. The formal definition of a statistical model according to wikipedia is as follows:
a statistical model is usually thought of as a pair ( S, P) where where S is the set of possible observations, i.e. the sample space, and P is a set of probability distributions on S.
How does Regression model normally written as below map to the above formal definition of a statistical model? $y_{i}=\beta _{1}x_{i1}+\beta _{2}x_{i2}+\cdots +\beta _{p}x_{ip}+\varepsilon _{i},\, $
Note that one can perform linear regression without any probabilistic assumptions. But the above formulation is the usual set of assumptions when discussing things like bias/variance of the model, inference of coefficients, etc.
Note also that the above conditions might vary in different contexts. For instance, the above only models the conditional distribution (effectively keeping the $x_{ij}$ fixed/known); some might model the $x_{ij}$ as random variables as well. Conditions on the variance may vary. Is it known or unknown? Is it constant for all $i$ (homoskedastic) or does it vary with $i$ (heteroskedastic)? What if you don't require the conditional distribution to be normal?