Questions: Base-Rate Integration and Bayesian Reasoning in Probability
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A test for a disease affecting 1 in 1,000 people is 95% sensitive and 95% specific. A patient tests positive, and their doctor concludes there is roughly a 95% chance the patient has the disease. What error is the doctor making?
AIgnoring the low base rate — with 1-in-1,000 prevalence, far more false positives than true positives occur, making the true probability much lower than 95%
BConfusing sensitivity with specificity, which would actually increase the estimated probability
COverestimating the test's accuracy by relying on the manufacturer's claim
DApplying Bayes' theorem incorrectly by counting the base rate twice
The doctor is substituting the test's accuracy (95%) for the probability of disease given a positive result. With a 1-in-1,000 base rate in a population of 100,000: roughly 95 true positives but ~4,995 false positives, giving a positive predictive value of only about 2%. The test's accuracy is high, but the rarity of the disease means false positives dominate. Base-rate neglect here leads to severe overestimation of disease probability — and potentially harmful over-treatment.
Question 2 Multiple Choice
In the classic lawyer-engineer study, participants are told a group is 70% engineers and 30% lawyers, then read a brief description of Tom: methodical, enjoys logic puzzles, has few friends. Most estimate ~85% probability Tom is an engineer. What does this demonstrate about base-rate integration?
APeople over-rely on the description's resemblance to an 'engineer type,' effectively ignoring the 70% prior probability
BPeople correctly weight both the description and the base rate, producing a well-calibrated estimate
CPeople understand that the description is more informative than the base rate in this case
DThe 85% estimate is normatively correct because the description is highly diagnostic
If participants integrated the base rate correctly, the description would update the 70% prior upward — but modestly, depending on how much more likely the description is given 'engineer' vs. 'lawyer.' Pushing the estimate to 85%+ shows the base rate has been nearly ignored. People substitute the question 'How much does Tom resemble an engineer?' for 'What is the probability Tom is an engineer?' — a representativeness substitution that systematically discards the prior.
Question 3 True / False
Presenting base-rate information as natural frequencies (e.g., '5 out of 100 people') rather than percentages (e.g., '5% of people') improves base-rate integration in probabilistic reasoning.
TTrue
FFalse
Answer: True
Research consistently shows that frequency formats dramatically improve base-rate integration. When the base rate is embedded in a natural frequency format, it is easier to process and harder to ignore. This appears to reflect how our reasoning systems evolved — we track repeated discrete events better than abstract probabilities. The practical implication: communicating risk as frequencies is not just a stylistic preference but a measurable intervention that improves reasoning in patients, clinicians, and the general public.
Question 4 True / False
Base-rate neglect is an unavoidable feature of human cognition that can seldom be meaningfully reduced by training or by changing how information is presented.
TTrue
FFalse
Answer: False
False. Base-rate neglect varies substantially with presentation format and training. Natural frequency formats restore near-Bayesian performance in many tasks where percentage formats produce severe neglect. Explicit training in probabilistic reasoning and Bayesian updating also increases integration. The bias reflects a mismatch between presentation format and the representational formats our intuitive systems handle well — change the format, and the 'unavoidable' bias largely disappears.
Question 5 Short Answer
Why does a highly accurate diagnostic test still produce many false positives when used to screen a general population for a rare condition?
Think about your answer, then reveal below.
Model answer: Because the base rate (prior probability) of the condition is so low, even a small false-positive rate generates a large absolute number of false alarms. With a 1-in-1,000 base rate and 95% accuracy in a population of 100,000: about 95 true positives but ~4,995 false positives — so only about 2% of positive tests are true positives. High accuracy means the test reliably distinguishes sick from healthy given known disease status, but positive predictive value (probability of disease given a positive result) depends critically on how rare the disease is.
This is the key practical consequence of base-rate neglect. Ignoring the prior probability leads to massive overestimation of disease probability after a positive test. This is why population-wide screening for rare conditions requires careful consideration of false-positive rates and subsequent harm — not just test accuracy in isolation.