Questions: Confusion Matrix and Classification Metrics

5 questions to test your understanding

Score: 0 / 5

Question 1 Multiple Choice

A disease affects 1% of a population. A diagnostic test achieves 99% accuracy by always predicting 'healthy' for every patient. What is this test's recall for detecting the disease?

A99% — matching its overall accuracy

B1% — equal to the disease prevalence

C0% — it correctly identifies zero sick patients

D100% — it correctly identifies all healthy patients as healthy

Question 2 Multiple Choice

A spam filter is evaluated on 9,200 emails: 600 spam correctly caught (TP), 400 ham misclassified as spam (FP), 8,000 ham correctly passed (TN), 200 spam that slipped through (FN). What is the filter's recall?

A0.60 — computed as TP / (TP + FP)

B0.75 — computed as TP / (TP + FN)

C0.93 — computed as (TP + TN) / total

D0.95 — computed as TN / (TN + FP)

Question 3 True / False

A classifier with 99% accuracy is necessarily better than one with 95% accuracy for a fraud detection task where primarily 1% of transactions are fraudulent.

TTrue

FFalse

Question 4 True / False

Increasing a binary classifier's classification threshold (requiring higher confidence before predicting 'positive') generally increases precision while decreasing recall.

TTrue

FFalse

Question 5 Short Answer

Explain why 'accuracy' is a misleading metric for a fraud detection system where 99% of transactions are legitimate, and identify two metrics that would be more informative.

Think about your answer, then reveal below.