Questions: The Guessing Parameter and Three-Parameter IRT Models
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A test developer expects the guessing parameter c for a 4-choice item to be approximately 0.25, but the estimated value is 0.13. The most likely explanation is:
AThe item is too easy, so even low-ability examinees answer correctly without guessing
BLow-ability examinees are attracted to specific distractors rather than choosing randomly across all options
CThe item has poor discrimination, which reduces the lower asymptote
DThe sample was too small to estimate c accurately, so it shrank toward zero
The 'pseudo' in pseudo-guessing reflects the fact that low-ability examinees do not choose uniformly at random. Well-constructed distractors attract systematic wrong responses — certain wrong answers look more plausible than others — so low-ability examinees cluster on specific incorrect options rather than spreading evenly. This means the empirical lower asymptote is typically 0.10–0.20 for 4-choice items, well below the 0.25 that pure random guessing would predict. Option A describes a different problem (low difficulty), and option C confuses discrimination with the asymptote.
Question 2 Multiple Choice
A psychometrician fits a 2PL model to multiple-choice data from a test with substantial guessing. What systematic bias should they expect in difficulty (b) estimates?
ADifficulty will be deflated because the model attributes lucky guesses to the item being easy
BDifficulty will be inflated because the model pushes the ICC's inflection point upward to fit the lower plateau
CDifficulty will be unaffected — the discrimination parameter absorbs guessing behavior
DDifficulty estimates will be random, since guessing is random by definition
The 2PL assumes the ICC descends all the way to zero as ability decreases. When there is a genuine lower plateau from guessing, the model tries to fit this floor by shifting the inflection point (b) upward — making the item look harder than it really is. The 3PL corrects this by modeling the asymptote directly with c, freeing b to reflect true difficulty. Option C is wrong because discrimination controls the slope, not the floor; the 2PL has no mechanism to absorb a non-zero lower asymptote.
Question 3 True / False
For a four-option multiple-choice item, the 3PL pseudo-guessing parameter c will typically be estimated at approximately 0.25 in real test data.
TTrue
FFalse
Answer: False
This is a common intuition — 1 in 4 options suggests 25% chance guessing — but empirical c estimates are typically 0.10–0.20, substantially lower. Low-ability examinees are not choosing randomly; they are systematically drawn to plausible distractors. The 0.25 value applies only if choices are uniformly random, which real test-taking behavior is not. This is exactly why the parameter is called 'pseudo-guessing' rather than 'guessing.'
Question 4 True / False
The added complexity of the 3PL model over the 2PL is only justified when guessing is a substantial, systematic feature of the data, and this should be determined by empirical model comparison.
TTrue
FFalse
Answer: True
The c parameter is weakly identified from data alone — it requires large samples and items where guessing genuinely occurs, and even then it is estimated with high variance, often requiring informative priors to stabilize. Using the 3PL when guessing is minimal or absent adds estimation instability without improving fit. Many operational testing programs use 2PL or Rasch models for most items, reserving the 3PL for clearly multiple-choice contexts with known guessing. Empirical comparison using indices like M2, RMSEA, or information criteria is the principled way to decide.
Question 5 Short Answer
Why is the lower asymptote parameter in the 3PL model called 'pseudo-guessing' rather than simply 'guessing'?
Think about your answer, then reveal below.
Model answer: Because the parameter captures the combined effect of random guessing and differential distractor attraction, not pure chance. Low-ability examinees do not choose uniformly at random — well-constructed distractors attract systematic wrong responses, so examinees cluster on specific incorrect options. The empirical lower asymptote is therefore lower than 1/k (where k is the number of options) would predict from pure random guessing. 'Pseudo-guessing' acknowledges that the lower asymptote reflects distractor quality and test-taking strategy as well as chance.
The distinction matters practically: if c simply equaled 1/k, test developers would have no reason to invest in distractor quality. The pseudo-guessing framing reveals that good distractors — ones that attract low-ability examinees for conceptual reasons — actually reduce c below the chance baseline, giving the test more information-discriminating power at low ability levels. It also means that c is partly a property of the items (distractor quality) and not just a universal constant for multiple-choice formats.