Questions: Simple Linear Regression

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A regression of annual salary (y) on years of education (x) yields ŷ = 20,000 + 3,500x. A student concludes: 'Getting one more year of education causes your salary to increase by $3,500.' What is wrong with this interpretation?

AThe intercept $20,000 is implausibly low, which invalidates the slope interpretation
BThe slope represents a predictive association, not causation — lurking variables like field of study or ability could explain the relationship
CThe interpretation is only valid for people with exactly average education levels
DThe interpretation is correct as long as the R² value is sufficiently high
Question 2 Multiple Choice

A researcher fits a regression line to data on tree heights (y) and trunk diameter (x) for trees between 5 and 80 cm in diameter. She then uses the line to predict the height of a tree with a 200 cm diameter. Why is this prediction unreliable?

ARegression equations cannot be evaluated at values larger than the sample mean
BThe linear relationship may not hold beyond the observed range — the line has no obligation to track data where it hasn't been observed
CThe slope b₁ changes its value outside the observed data range
DPredictions are unreliable whenever the x-value is more than one standard deviation from the mean
Question 3 True / False

The regression line ŷ = b₀ + b₁x always passes through the point (x̄, ȳ) — the sample means of both variables.

TTrue
FFalse
Question 4 True / False

To predict x from y, you can simply rearrange the regression equation ŷ = b₀ + b₁x algebraically to solve for x.

TTrue
FFalse
Question 5 Short Answer

Why does the slope of a regression line represent a predictive rather than causal relationship, and what would be required to justify a causal interpretation?

Think about your answer, then reveal below.