Hausman test for endogenous variables
Summary
Setup
- The linear regression model with random sampling,
\[y_i=β_1+β_2x_{i,2}+β_3x_{i,3}+…+β_kx_{i,k}+ε_i, i=1,…,n\]
- You suspect some explanatory variables are endogenous
- You have at least one instrument for each variable you suspect is endogenous, both IV and OLS is available.
Hausman test
- The null hypothesis is that all your explanatory variables are exogenous.
- Under the null hypothesis OLS and IV will both be consistent (only OLS will be efficient).
- If the null hypothesis is false, then OLS will be inconsistent while IV will be consistent.
- The Hausman test is based on the difference between the OLS and the IV estimates. If the difference is significantly different from zero, the null hypothesis is rejected.