Confidence intervals in the LRM
Summary
Setup:
- The LRM with random sampling and GM
\[y_i=β_1+β_2x_{i,2}+β_3x_{i,3}+…+β_kx_{i,k}+ε_i, i=1,…,n\]
- \(ε_i\) follows a normal distribution (conditionally on \(x_i\) ), \(ε_i|x_i∼N\left( 0,σ^2 \right) i=1,…,n\)
Confidence intervals
- We define a \(\left( 1-α \right)∙100\) % confidence interval for \(β_j\) where \(j\) is any value between 1 and \(k\) as
\[b_j±t_{α/2,n-k}SE(b_j)\]
- The probability that the \(β_j\) is inside the \(\left( 1-α \right)∙100\) % confidence interval is precisely \(\left( 1-α \right)∙100\) %.
- If the GM assumptions are satisfied but the errors \(ε_i\) do not follow a normal distribution then, if \(n\) is large, the confidence intervals will be approximately correct by the CLT.