5 questions to test your understanding
A fraud detection model achieves 99.9% accuracy on a dataset where only 0.1% of transactions are fraudulent. What does this tell us about the model's AUC?
A classifier achieves AUC = 0.85. Which interpretation is correct?
An ROC curve is constructed by varying the classification threshold and recording how the true positive rate and false positive rate change at each threshold.
A model with AUC = 0.75 will achieve higher accuracy at nearly every possible threshold than a model with AUC = 0.65.
Why is AUC more informative than accuracy when evaluating a classifier on an imbalanced dataset? What does each metric actually measure?