Non-simple form of heteroscedasticity

Summary

Setup

Random sample \(\left( y_i,x_i \right)\) for \(i=1, \ldots ,n\)
Correctly specified linear regression model, \(y_i=x'_iβ+ε_i\) where \(E\left( ε_i \mid x_i \right)=0\) .

We say that we have a non-simple form of heteroscedasticity if

\[Var\left( ε_i \mid x_i \right)=σ_i^2\]

where \(σ_i^2\) is unknown .
In some cases, it is possible to estimate each \(σ_i^2\) for \(i=1,..,n\) . \({\hat{σ}}_i^2\) is then an estimator of \(σ_i^2\) .
Definition, \(\hat{Ψ}\)

\[\hat{Ψ}=diag\left( {\hat{σ}}_1^2, \ldots , {\hat{σ}}_n^2 \right)\]

Definition: the Feasible Weighted Least Squares (FGLS) estimator is defined as

\[b_{FWLS}={\left( X'{\hat{Ψ}}^{-1}X \right)}^{-1}X'{\hat{Ψ}}^{-1}y\]

This is also called the EGLS estimator, where E stands for estimated.
The properties of the FGLS estimator is more complicated and, in general, \(E\left( b_{FWLS} \right)\) and \(Var\left( b_{WLS}|X \right)\) cannot be found. The FGLS estimator is in general not unbiased and not BLUE.
However, the FWLS estimator is often consistent and as an estimator if the variance of \(b_{FWLS}\) we use

\[\widehat{Var }\left( b_{FWLS}|X^* \right)={\left( X'{\hat{Ψ}}^{-1}X \right)}^{-1}\]

Example.

Suppose that

\[Var\left( ε_i \mid x_i \right)=σ_i^2=exp \left( x'_iγ \right)\]

\(γ\) is a \(k×1\) vector of unknown parameters.
\(γ\) can be estimated by running the auxiliary regression

\[log e_i^2=x'_iγ+error term\]

We then use

\[{\hat{σ}}_i^2=exp \left( x'_i\hat{γ} \right)\]

Under mild conditions, \(\hat{γ}\) will be a consistent estimator of \(γ\) , \(b_{FWLS}\) a consistent estimator of \(β\) which is asymptotically more efficient than the OLS estimator and the formula for the estimated variance will be asymptotically correct.