5 questions to test your understanding
A fraud detection model reports 99.8% accuracy on a dataset where 0.2% of transactions are fraudulent. What is the most important concern about this result?
You have a medical diagnosis model with 50:1 class imbalance (healthy vs. diseased). Which intervention most directly addresses the training algorithm's bias toward predicting 'healthy' for every patient?
A model that achieves 99% accuracy on an imbalanced dataset is likely performing well on the minority class.
Lowering the classification threshold (e.g., from 0.5 to 0.1) in a probabilistic classifier increases recall for the minority class at the cost of more false positives.
Why is accuracy a misleading metric for imbalanced classification, and what alternative metrics should be used?