The Chow test
Summary
The Chow test
- Given: a regression model with two distinct subsamples, A and B.
- Example 1: Time series data divided into two subsamples based on a date.
- Example 2: Cross sectional data divided into two subsamples based on a dummy variable.
- Two choices:
- Run two regressions, one for each subsample.
- Run one regression using the entire (pooled) sample.
- \(H_0:\) There is no significant improvement in fit from running two regressions.
- \(RSS_A\) is defined as the \(RSS\) using only subsample \(A\)
- \(RSS_B\) is defined as the \(RSS\) using only subsample \(B\)
- \(RSS_P\) is defined as the \(RSS\) using the entire (pooled) sample
- The Chow test is an \(F\) -test with the following \(F\) -statistic:
\[F= \frac{\left( RSS_P-RSS_A-RSS_B \right)/k}{\left( RSS_A+RSS_B \right)/(n-2k) }\]
- Under \(H_0\) , the \(F\) -statistic follows an \(F\) -distribution with \(k\) and \(n-2k\) degrees of freedom.