The LRM with a dummy variable

Summary

Main points:

  • In a regression model, a dummy variable is a variable \(d_i, i=1,…,n\) that can take the value 0 or 1.
  • Example: We let \(d_i=0\) if individual \(i\) is female and \(d_i=1\) if individual \(i\) is male.
  • We can add a dummy variable to our regression model:

\[y_i=β_1+β_2x_{i,2}+β_3x_{i,3}+…+β_kx_{i,k}+γd_i+ε_i i=1,…,n\]

  • The intercept of individuals coded as \(d_i=0\) (group 0) will be \(β_1\) .
  • The intercept of individuals coded as \(d_i=1\) (group 1) will be \(β_1+γ\) .
  • \(γ\) is interpreted as the extra intercept that group 1 observations have compared to group 0 observations.
  • You can have many dummy variables in a LRM, for example

\[y_i=β_1+β_2x_{i,2}+β_3x_{i,3}+…+β_kx_{i,k}+γ_1d_{i,1}+γ_2d_{i,2}+ε_i i=1,…,n\]