Questions — Simple Linear Regression Estimation

Question 1 Multiple Choice

A researcher runs OLS regression of annual income on years of education and obtains R² = 0.82. She concludes that education strongly causes higher income. What is the fundamental error in this reasoning?

AR² above 0.8 is implausibly high, suggesting a coding error

BOLS minimizes absolute errors, not squared errors, so R² measures the wrong criterion

CR² measures goodness of fit — how well education predicts income in the sample — but causality requires E(u|X)=0, which cannot be established from the regression output alone

DThe intercept must be statistically significant for causal inference to be valid

Question 2 Multiple Choice

What is the correct interpretation of the OLS slope estimator β̂₁ = Cov(X,Y) / Var(X)?

AThe fraction of the variation in Y that is explained by X

BThe probability that a one-unit increase in X causes Y to increase

CThe average change in Y associated with a one-unit change in X, measuring how much Y co-moves with X scaled by X's own variability

DThe average value of X when Y equals zero

Question 3 True / False

OLS estimation of β̂₁ and β̂₀ requires that the residuals are normally distributed.

TTrue

FFalse

Question 4 True / False

A high R² value in a regression of Y on X means that X explains a large share of the variation in Y, but does not by itself establish that X causes Y.

TTrue

FFalse

Question 5 Short Answer

Why can a regression with high R² still fail to identify a causal effect of X on Y? What additional condition is required, and why is that condition not visible in the regression output?

Think about your answer, then reveal below.

Questions: Simple Linear Regression Estimation