Questions — Anomaly Detection Methods

Question 1 Multiple Choice

A data scientist builds an anomaly detection model for factory machine failures and says 'I'll use the statistically optimal threshold.' What is the fundamental problem with this statement?

AAnomaly detection models do not produce continuous scores, so threshold selection is not applicable

BStatistical optimality requires labeled anomaly data, which is never available in practice

CThere is no universally optimal threshold — the right cutoff depends on the business cost of false negatives versus false positives, which cannot be determined from the data alone

DThe threshold should always be set at 3 standard deviations from the mean, making the choice straightforward

Question 2 Multiple Choice

Why do isolation forests use the average depth at which a point is isolated in random decision trees as its anomaly score?

ADeeper isolation indicates the point is in a denser region, requiring more splits to separate from similar points

BAnomalies in sparse regions are isolated in very few random splits, while normal points in dense clusters require many splits; short isolation paths inversely signal anomaly-ness

CRandom trees with more splits achieve higher accuracy, so deeper isolation paths produce more reliable scores

DIsolation depth is directly proportional to the z-score, providing a familiar statistical interpretation

Question 3 True / False

In anomaly detection for credit card fraud, it is generally better to use a lower detection threshold (more sensitive, more alerts) than in a manufacturing quality control application.

TTrue

FFalse

Question 4 True / False

The Local Outlier Factor (LOF) method uses a global density threshold to identify anomalies, which is why it performs better than isolation forests on datasets with clusters of varying density.

TTrue

FFalse

Question 5 Short Answer

Why is anomaly detection fundamentally different from a standard binary classification problem, and how does this difference affect how the methods are trained?

Think about your answer, then reveal below.

Questions: Anomaly Detection Methods