5 questions to test your understanding
A researcher runs an OLS regression and finds evidence of heteroskedasticity. She switches from classical OLS standard errors to robust (Huber-White) standard errors. What does this change fix?
A researcher studies the effect of a job training program assigned randomly at the county level, with outcome data measured at the individual worker level. At what level should she cluster her standard errors?
Robust (Huber-White) standard errors are generally larger than the classical OLS standard errors they replace.
Using clustered standard errors makes OLS coefficient estimates less biased and more efficient, in addition to correcting the inference.
Why should you cluster standard errors at the level of policy assignment rather than at the individual level, even if your data is measured at the individual level?