Questions: Few-Shot Learning

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A large model trained on 1000 classes is given 5 examples of a new class and fine-tuned for 100 epochs. Why does this approach typically fail in practice?

AThe model's architecture cannot be extended to new classes without retraining from scratch
BThe model catastrophically overfits to the 5 examples, memorizing them without learning a generalizable representation
CStandard fine-tuning requires at least 100 examples per class to update weights meaningfully
DThe learning rate must be specially tuned for each new class, requiring a separate validation set
Question 2 Multiple Choice

What is the fundamental difference between how prototypical networks and MAML handle a new task at test time?

APrototypical networks require fine-tuning via gradient descent; MAML classifies by nearest prototype without gradient updates
BPrototypical networks classify by distance to learned class prototypes without gradient updates; MAML performs a few gradient steps to adapt to the new task
CPrototypical networks use second-order optimization; MAML uses only first-order distance metrics
DBoth require the same number of gradient steps at test time; they differ only in training procedure
Question 3 True / False

In episodic training for few-shot learning, the model is trained on the same fixed set of classes it will be tested on, just with fewer labeled examples per class.

TTrue
FFalse
Question 4 True / False

Prototypical networks require no gradient updates at test-time inference because the embedding network has been trained to create a space where computing the mean of K support examples captures a class's identity sufficiently for classification.

TTrue
FFalse
Question 5 Short Answer

What does it mean to say few-shot learning models 'learn how to learn,' and how does episodic training implement this goal?

Think about your answer, then reveal below.