Analysis of why respondents select incorrect options (distractors) reveals test quality issues and guides item revision. Effective distractors should be plausible to those lacking mastery but clearly inferior to the correct answer for those with knowledge. Weak distractors that are avoided by both high and low scorers reduce item discrimination and efficiency; removal or revision of such distractors can improve test quality.
Examine item response frequencies across ability groups (often 25th, 50th, 75th percentile scorers). Identify distractors that are not chosen by any group or chosen equally by all groups. Practice revising weak distractors to common misconceptions or likely errors that content experts expect.
From your study of item difficulty and discrimination, you know that a good item should be moderately difficult and should reliably separate high-ability from low-ability respondents. But a multiple-choice item doesn't live or die by its correct answer alone — the wrong options matter just as much. Distractor analysis asks: what are the incorrect options *doing* for the item, and are they doing it well?
A functioning distractor is one that attracts respondents who lack mastery while being clearly avoided by those who have it. Think of a well-designed distractor as a plausible error trap: it represents a misconception, a common computational mistake, or a related-but-wrong concept that someone who hasn't fully learned the material would reasonably select. For example, on a pharmacology exam, a distractor might name a drug with a similar mechanism but different indication — someone who half-remembers the content might choose it, but someone with solid knowledge won't. This is what you want: distractors that discriminate.
The diagnostic tool for distractor quality is the distractor frequency table — a breakdown of how often each option is chosen by respondents at different ability levels (typically the bottom, middle, and top quartiles). A functioning distractor shows a characteristic gradient: chosen most often by the bottom quartile, less often by the middle, rarely by the top. A non-functioning distractor (NFD) violates this pattern. The most common failure mode is the "transparent foil" — an option so obviously wrong that nobody picks it at any ability level. Another failure is the "inverse distractor" that attracts more high-ability than low-ability respondents, suggesting it is actually closer to correct than the keyed answer, or that the item has a flaw.
Fixing non-functioning distractors requires content expertise combined with statistical feedback. Statistics tell you *that* a distractor isn't working; content expertise tells you *why* and *what to replace it with*. Good revisions anchor replacements in common learner errors: survey your own students about what confuses them, review wrong answers on open-response versions of the same question, or consult subject matter experts about typical misconceptions. A four-option item with three functioning distractors is substantially more discriminating than one with only one functioning distractor — from a Classical Test Theory perspective, you are essentially running a different test depending on how many genuine traps the item contains. Distractor revision is therefore one of the highest-leverage activities in test development.