Questions: Ridge, Lasso, and Elastic Net Regression

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

You have a dataset with 200 candidate predictors and believe only about 20 are genuinely related to the outcome. Which regularization method is most appropriate?

ARidge regression, because it handles large numbers of predictors by shrinking all coefficients
BOLS, because you need unbiased estimates to identify the true 20 predictors
CLasso regression, because it performs automatic variable selection by driving some coefficients to exactly zero
DElastic Net, because you always need both L1 and L2 penalties when predictors outnumber observations
Question 2 Multiple Choice

Why does Lasso drive some coefficients to exactly zero while Ridge only shrinks them toward (but never to) zero?

ALasso uses a larger default penalty parameter λ, forcing more shrinkage
BThe L1 constraint region has corners at the coordinate axes; the optimization solution often lands exactly on a corner where a coefficient is zero
CLasso uses an iterative algorithm that terminates early, leaving some coefficients unupdated
DRidge uses squared penalties which are stronger than absolute-value penalties and push coefficients further from zero
Question 3 True / False

Increasing the regularization parameter λ in Ridge regression always increases the model's bias while decreasing its variance.

TTrue
FFalse
Question 4 True / False

Ridge regression is the preferred regularization method when you believe primarily a sparse subset of predictors is truly relevant to the outcome.

TTrue
FFalse
Question 5 Short Answer

Explain the bias-variance tradeoff in regularization and describe how cross-validation is used to choose the optimal penalty parameter λ.

Think about your answer, then reveal below.