Questions: Agnostic PAC Learning

4 questions to test your understanding

Score: 0 / 4
Question 1 Multiple Choice

In realizable PAC learning, sample complexity scales as O(d/epsilon). In agnostic PAC learning, it scales as O(d/epsilon^2). Why does removing the realizability assumption make learning harder?

AWithout realizability, the learning algorithm must use gradient descent instead of ERM, which converges more slowly
BIn the realizable case, any hypothesis with nonzero training error can be eliminated, drastically pruning the search space; in the agnostic case, every hypothesis has some error, so the learner must estimate error rates precisely — and estimating rates to epsilon accuracy requires O(1/epsilon^2) samples
CThe VC dimension effectively doubles in the agnostic setting because the hypothesis class must represent both the target and the noise
DAgnostic learning requires a separate validation set, consuming half the samples
Question 2 True / False

Agnostic PAC learning guarantees that the learned hypothesis achieves error within epsilon of the Bayes-optimal classifier.

TTrue
FFalse
Question 3 True / False

In the agnostic setting, ERM is still a valid learning algorithm — but it requires more samples than in the realizable case.

TTrue
FFalse
Question 4 Short Answer

Explain why the agnostic setting is more practically relevant than the realizable setting, and what the 'price of agnosticism' is in terms of sample complexity.

Think about your answer, then reveal below.