Econometrics Conditional Mean

1k Views Asked by At

I have a question regarding linear regression. Suppose we have the following regression model: $$ y_{it}=\alpha+x_{it}'\beta+u_{it} $$ where say $i$ represents individual $i$ at time period $t.$ The conditional mean function is in general defined as: $$ E[Y|X=x]=\int yf(y|x)dy $$ In the linear regression, assuming conditional exogeneity, this simplifies to: $$ E[y_{it}|x_{it}]=\alpha+x_{it}'\beta $$ My question is how is the defintion of the mean function related to the above expression? Do we assume that for ${each }$ value of $x$ , we have infinite possible values of $y$ and we can estimate the average? Thanks!

2

There are 2 best solutions below

0
On BEST ANSWER

You have posted the conditional expectation for continuous random variables. The conditional expectation for discrete random variables is

$\mathbb E(Y|X=x)=\sum\limits_{i=1}^n y_i\cdot f(y_i|x_i)$

In linear regression this conditional expectation is the estimated regression line:

$\mathbb E(Y|X=x)=\alpha+\beta x_i=\hat y_i$

$\alpha$ and $\beta$ are the estimated parameter of the regression line. Every value $x_i$ have a related $y_i$. They are given by the sample.

And the difference between the observed $y$-values and the $y$-values of the regression line is $y_i-\hat y_i=u_i$. The parameters $\alpha$ and $\beta$ are estimated by minimizing $\sum\limits_{i=1}^n u_i^2$.

3
On

Each $y_{it}$ is understood a random variable; so there is no problem in writing $\Bbb{E}(y_{it} \mid x_{it})$.

It seems an abuse of notation to write $y_{it}$ for both a random variable and a particular realization; but this is not uncommon in applied fields.