Questions: Regularization Techniques

5 questions to test your understanding

Score: 0 / 5

Question 1 Multiple Choice

A model achieves near-perfect training accuracy but performs poorly on unseen test data. You apply L1 regularization. Which best explains how L1 addresses the problem?

AIt increases model capacity so the model fits both training and test distributions better

BIt penalizes the absolute value of weights, driving some to exactly zero and reducing effective model complexity

CIt averages predictions across many sub-models trained on different random subsets

DIt adds noise to training labels to prevent the model from memorizing specific examples

Question 2 Multiple Choice

You are training a linear model on 1,000 features but suspect only 20 are truly informative. Which regularizer is most appropriate, and why?

AL2, because it shrinks all weights equally and makes the model more numerically stable

BL1, because it can drive irrelevant feature weights to exactly zero, performing automatic feature selection

CDropout, because it randomly deactivates neurons during training, implicitly ignoring irrelevant features

DEarly stopping, because halting before convergence prevents the model from learning irrelevant features

Question 3 True / False

L2 regularization shrinks weights toward zero but rarely sets them to exactly zero, while L1 regularization can produce exactly zero weights.

TTrue

FFalse

Question 4 True / False

Regularization improves a model's training accuracy by penalizing overly complex solutions.

TTrue

FFalse

Question 5 Short Answer

Why does regularization improve generalization even though it makes the model fit the training data less well?

Think about your answer, then reveal below.