Questions — Simple Linear Regression

Question 1 Multiple Choice

A regression of annual salary (y) on years of education (x) yields ŷ = 20,000 + 3,500x. A student concludes: 'Getting one more year of education causes your salary to increase by $3,500.' What is wrong with this interpretation?

AThe intercept $20,000 is implausibly low, which invalidates the slope interpretation

BThe slope represents a predictive association, not causation — lurking variables like field of study or ability could explain the relationship

CThe interpretation is only valid for people with exactly average education levels

DThe interpretation is correct as long as the R² value is sufficiently high

Question 2 Multiple Choice

A researcher fits a regression line to data on tree heights (y) and trunk diameter (x) for trees between 5 and 80 cm in diameter. She then uses the line to predict the height of a tree with a 200 cm diameter. Why is this prediction unreliable?

ARegression equations cannot be evaluated at values larger than the sample mean

BThe linear relationship may not hold beyond the observed range — the line has no obligation to track data where it hasn't been observed

CThe slope b₁ changes its value outside the observed data range

DPredictions are unreliable whenever the x-value is more than one standard deviation from the mean

Question 3 True / False

The regression line ŷ = b₀ + b₁x always passes through the point (x̄, ȳ) — the sample means of both variables.

TTrue

FFalse

Question 4 True / False

To predict x from y, you can simply rearrange the regression equation ŷ = b₀ + b₁x algebraically to solve for x.

TTrue

FFalse

Question 5 Short Answer

Why does the slope of a regression line represent a predictive rather than causal relationship, and what would be required to justify a causal interpretation?

Think about your answer, then reveal below.

Questions: Simple Linear Regression