Questions — Two-Stage Least Squares (2SLS)

Question 1 Multiple Choice

A researcher manually runs two OLS regressions to implement 2SLS: first regressing x on instrument z to get x̂, then regressing y on x̂. She reports the standard errors from the second OLS regression as her 2SLS standard errors. What is wrong with this procedure?

AThe point estimate of the causal effect is also biased when 2SLS is run manually

BThe second-stage OLS standard errors are correct only if the first-stage R² exceeds 0.5

CThe standard errors from manual second-stage OLS are incorrect — they ignore the sampling variability introduced in the first stage and will typically be too small

DThe second stage should regress y on the original x, not on x̂

Question 2 Multiple Choice

A researcher reports a first-stage F-statistic of 4.2 when using a single instrument for an endogenous regressor. What is the key concern about the 2SLS estimates?

AThe instrument may violate the exclusion restriction, as indicated by the low F-statistic

BThe instrument is weak — it explains too little variation in x, so 2SLS estimates are severely biased toward OLS with inflated standard errors

CThe overidentification test will necessarily fail with a low first-stage F-statistic

DThe second stage cannot be run if the first-stage F-statistic falls below 10

Question 3 True / False

Having more instruments than endogenous variables (overidentification) allows the researcher to fully verify that most instruments satisfy the exclusion restriction via the Hansen-Sargan J-test.

TTrue

FFalse

Question 4 True / False

The first stage of 2SLS isolates the exogenous variation in the endogenous variable x by regressing x on the instrument z, producing fitted values x̂ that are uncorrelated with the error term.

TTrue

FFalse

Question 5 Short Answer

Why does 2SLS produce unbiased causal estimates when OLS does not, and what role does the first stage play?

Think about your answer, then reveal below.

Questions: Two-Stage Least Squares (2SLS)