Questions — Ridge, Lasso, and Elastic Net Regression

Question 1 Multiple Choice

You have a dataset with 200 candidate predictors and believe only about 20 are genuinely related to the outcome. Which regularization method is most appropriate?

ARidge regression, because it handles large numbers of predictors by shrinking all coefficients

BOLS, because you need unbiased estimates to identify the true 20 predictors

CLasso regression, because it performs automatic variable selection by driving some coefficients to exactly zero

DElastic Net, because you always need both L1 and L2 penalties when predictors outnumber observations

Question 2 Multiple Choice

Why does Lasso drive some coefficients to exactly zero while Ridge only shrinks them toward (but never to) zero?

ALasso uses a larger default penalty parameter λ, forcing more shrinkage

BThe L1 constraint region has corners at the coordinate axes; the optimization solution often lands exactly on a corner where a coefficient is zero

CLasso uses an iterative algorithm that terminates early, leaving some coefficients unupdated

DRidge uses squared penalties which are stronger than absolute-value penalties and push coefficients further from zero

Question 3 True / False

Increasing the regularization parameter λ in Ridge regression always increases the model's bias while decreasing its variance.

TTrue

FFalse

Question 4 True / False

Ridge regression is the preferred regularization method when you believe primarily a sparse subset of predictors is truly relevant to the outcome.

TTrue

FFalse

Question 5 Short Answer

Explain the bias-variance tradeoff in regularization and describe how cross-validation is used to choose the optimal penalty parameter λ.

Think about your answer, then reveal below.

Questions: Ridge, Lasso, and Elastic Net Regression