Questions: Instance-Based Learning (k-NN)

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A k-NN classifier is trained on a dataset with two features: age (range 0–80 years) and annual income (range $0–$500,000). No feature scaling is applied. What is the most likely consequence?

AThe model will fail to converge because k-NN requires normalized inputs to compute gradients
BIncome will dominate the distance calculation, effectively making age irrelevant to the predictions
CAge will dominate because biological age has more predictive power than income for most tasks
DBoth features contribute equally, because k-NN uses rank-order comparisons rather than raw distances
Question 2 Multiple Choice

A k-NN model with k=1 achieves 100% accuracy on training data but only 62% on held-out test data. Increasing k to 15 gives 88% training accuracy and 85% test accuracy. What best explains this pattern?

Ak=1 memorizes each training point perfectly — there is always a neighbor with distance zero — but overfits to noise; larger k smooths the decision boundary by averaging over more neighbors
Bk=1 is computationally faster, so it processes more training data before the time limit, learning more patterns
Ck=15 selects from a larger pool of training examples, effectively training on 15 times as much data
DIncreasing k introduces beneficial randomness that prevents the model from latching onto spurious correlations
Question 3 True / False

One advantage of k-NN over parametric models like logistic regression is that k-NN becomes faster to make predictions as the training set grows larger.

TTrue
FFalse
Question 4 True / False

Removing irrelevant features from a dataset can significantly improve k-NN accuracy, even if those same features would have negligible effect on a logistic regression model's performance.

TTrue
FFalse
Question 5 Short Answer

Explain what makes k-NN a 'lazy learner' and describe the key computational tradeoff this creates compared to 'eager' algorithms like logistic regression or decision trees.

Think about your answer, then reveal below.