Questions: Dummy Variables and Categorical Regressors

5 questions to test your understanding

Score: 0 / 5

Question 1 Multiple Choice

You are estimating a wage regression with a 4-category variable: season of birth (spring, summer, fall, winter). How many dummy variables should you include, and why?

A4 dummies — one for each season, to capture each season's full effect

B3 dummies — include all but one, which becomes the reference category

C2 dummies — one for each pair of seasons (spring/summer vs. fall/winter)

D1 dummy — a single variable can encode all 4 categories using values 0, 1, 2, 3

Question 2 Multiple Choice

A wage regression includes a female dummy D (1=female, 0=male) and years of education. The interaction term D × Education has a coefficient of +$800. What does this mean?

AWomen earn $800 more than men on average, regardless of education

BEach additional year of education is worth $800 for both men and women

CEach additional year of education is worth $800 more for women than for men

DThe gender wage gap closes by $800 for each year of education women complete

Question 3 True / False

Including most k dummy variables for a k-category variable alongside an intercept term is fine in OLS regression, as long as your software is modern enough to handle the multicollinearity.

TTrue

FFalse

Question 4 True / False

Changing the reference category in a dummy variable regression changes the fitted values and the predicted mean outcomes for each group.

TTrue

FFalse

Question 5 Short Answer

What is the 'dummy variable trap,' and why does it make OLS estimation mathematically impossible?

Think about your answer, then reveal below.