Questions: Anomaly Detection Methods

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A data scientist builds an anomaly detection model for factory machine failures and says 'I'll use the statistically optimal threshold.' What is the fundamental problem with this statement?

AAnomaly detection models do not produce continuous scores, so threshold selection is not applicable
BStatistical optimality requires labeled anomaly data, which is never available in practice
CThere is no universally optimal threshold — the right cutoff depends on the business cost of false negatives versus false positives, which cannot be determined from the data alone
DThe threshold should always be set at 3 standard deviations from the mean, making the choice straightforward
Question 2 Multiple Choice

Why do isolation forests use the average depth at which a point is isolated in random decision trees as its anomaly score?

ADeeper isolation indicates the point is in a denser region, requiring more splits to separate from similar points
BAnomalies in sparse regions are isolated in very few random splits, while normal points in dense clusters require many splits; short isolation paths inversely signal anomaly-ness
CRandom trees with more splits achieve higher accuracy, so deeper isolation paths produce more reliable scores
DIsolation depth is directly proportional to the z-score, providing a familiar statistical interpretation
Question 3 True / False

In anomaly detection for credit card fraud, it is generally better to use a lower detection threshold (more sensitive, more alerts) than in a manufacturing quality control application.

TTrue
FFalse
Question 4 True / False

The Local Outlier Factor (LOF) method uses a global density threshold to identify anomalies, which is why it performs better than isolation forests on datasets with clusters of varying density.

TTrue
FFalse
Question 5 Short Answer

Why is anomaly detection fundamentally different from a standard binary classification problem, and how does this difference affect how the methods are trained?

Think about your answer, then reveal below.