Questions — Model Interpretability and Explainability

Question 1 Multiple Choice

A medical AI model achieves 97% accuracy at detecting tumors from X-ray scans. Saliency maps show the model highlights a region in the corner of each image — the same region where a metal ruler used during imaging always appears. What does this scenario most directly illustrate about interpretability?

ASaliency maps are fundamentally unreliable and should never be used to explain medical AI decisions

BHigh accuracy guarantees that the model has learned clinically meaningful features

CInterpretability tools require validation against ground truth — a plausible-looking explanation can still reflect a spurious correlation rather than the model's genuine reasoning

DThe model must be retrained on ruler-free images before any explanation can be trusted

Question 2 Multiple Choice

SHAP values are computed to explain why a specific patient's loan application was denied, citing that their debt-to-income ratio was the most influential feature for this decision. This represents which type of interpretability?

AGlobal intrinsic — it uses a transparent model structure that is interpretable by design

BGlobal post-hoc — it summarizes feature importance across the entire model

CLocal post-hoc — it explains a single prediction using a method applied after training

DLocal intrinsic — it uses a simple model that is inherently transparent for individual predictions

Question 3 True / False

A perfectly faithful explanation of a neural network's prediction would be at least as difficult to interpret as the model itself.

TTrue

FFalse

Question 4 True / False

Interpretability methods are most valuable after a model fails in production, since there is no benefit to examining model reasoning on a well-performing system.

TTrue

FFalse

Question 5 Short Answer

Why is a perfectly faithful explanation of a neural network's prediction inherently self-defeating as a practical tool?

Think about your answer, then reveal below.

Questions: Model Interpretability and Explainability