Questions: Regularization Techniques

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A model achieves near-perfect training accuracy but performs poorly on unseen test data. You apply L1 regularization. Which best explains how L1 addresses the problem?

AIt increases model capacity so the model fits both training and test distributions better
BIt penalizes the absolute value of weights, driving some to exactly zero and reducing effective model complexity
CIt averages predictions across many sub-models trained on different random subsets
DIt adds noise to training labels to prevent the model from memorizing specific examples
Question 2 Multiple Choice

You are training a linear model on 1,000 features but suspect only 20 are truly informative. Which regularizer is most appropriate, and why?

AL2, because it shrinks all weights equally and makes the model more numerically stable
BL1, because it can drive irrelevant feature weights to exactly zero, performing automatic feature selection
CDropout, because it randomly deactivates neurons during training, implicitly ignoring irrelevant features
DEarly stopping, because halting before convergence prevents the model from learning irrelevant features
Question 3 True / False

L2 regularization shrinks weights toward zero but rarely sets them to exactly zero, while L1 regularization can produce exactly zero weights.

TTrue
FFalse
Question 4 True / False

Regularization improves a model's training accuracy by penalizing overly complex solutions.

TTrue
FFalse
Question 5 Short Answer

Why does regularization improve generalization even though it makes the model fit the training data less well?

Think about your answer, then reveal below.