Instrumental variables, one explanatory variable
Summary
Setup
- The linear regression model with random sampling,
\[y_i=β_1+β_2x_i+ε_i, i=1,…,n\]
- \(x_i\) is endogenous
- OLS will not give us unbiased or consistent estimators, standard errors are inconsistent and inference is invalid.
Instrumental variable
A variable \(z\) is called an instrumental variable (IV) for the endogenous variable \(x\) in the LRM if the following are satisfied:
- \(z\) is exogeneous,
\[E\left( ε_i|z_i \right)=0, i=1,…,n\]
- \(z_i\) is correlated with \(x_i\)
\[Cor\left( x_i,z_i \right)≠0\]
- \(z_i\) will not have a direct effect on \(y_i\) , \(E\left( z_i,x_i \right)\) does not depend on \(z_i\) .
- We say that the instrument is weak/strong if \(Cor\left( x_i,z_i \right)\) is small/large.
The IV-estimator
- The IV-estimator of \(β_2\) is given by
\[b_2= \frac{\sum_{i=1}^{n}{ \left( z_i-\bar{z} \right)\left( y_i-\bar{y} \right) }}{\sum_{i=1}^{n}{ \left( z_i-\bar{z} \right)\left( x_i-\bar{x} \right) }}\]
- The IV-estimator of \(β_1\) is given by
\[b_1=\bar{y}-b_2\bar{x}\]