The Chow test

Summary

The Chow test

  • Given: a regression model with two distinct subsamples, A and B.
  • Example 1: Time series data divided into two subsamples based on a date.
  • Example 2: Cross sectional data divided into two subsamples based on a dummy variable.
  • Two choices:
    • Run two regressions, one for each subsample.
    • Run one regression using the entire (pooled) sample.
  • \(H_0:\) There is no significant improvement in fit from running two regressions.
  • \(RSS_A\) is defined as the \(RSS\) using only subsample \(A\)
  • \(RSS_B\) is defined as the \(RSS\) using only subsample \(B\)
  • \(RSS_P\) is defined as the \(RSS\) using the entire (pooled) sample
  • The Chow test is an \(F\) -test with the following \(F\) -statistic:

\[F= \frac{\left( RSS_P-RSS_A-RSS_B \right)/k}{\left( RSS_A+RSS_B \right)/(n-2k) }\]

  • Under \(H_0\) , the \(F\) -statistic follows an \(F\) -distribution with \(k\) and \(n-2k\) degrees of freedom.