Questions: Quantiles, Percentiles, and the Five-Number Summary
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A student scores 720 on a standardized test and is told they are at the 85th percentile. Which interpretation is correct?
AThe student scored 85% of the total possible points on the test
BApproximately 85% of test-takers scored at or below 720
CThe student answered 85 out of 100 questions correctly
DThe student scored 85 points above the median
The pth percentile identifies a position in the sorted distribution, not an absolute score. Being at the 85th percentile means roughly 85% of test-takers scored at or below 720 — this is the student's percentile rank. The actual score (720) is the percentile value; the 85% describes relative position. These can be entirely disconnected: 720 might represent only 60% of possible points yet still place a student in the 85th percentile if most others scored lower.
Question 2 Multiple Choice
Why is the IQR considered more robust than the standard deviation as a measure of spread?
AThe IQR is always a smaller number than the standard deviation, so it is more precise
BThe IQR spans only the middle 50% of sorted data, so extreme values at the tails cannot affect it
CThe IQR can be computed without sorting the data, making it less sensitive to ordering errors
DThe standard deviation is only defined for normally distributed data, while the IQR works for any distribution
Robustness means resistance to outliers. The IQR = Q3 − Q1 uses only the boundaries of the middle 50% of the data. No matter how extreme the minimum or maximum are, they cannot change Q1 or Q3 — the IQR is unaffected. The standard deviation squares deviations from the mean, which amplifies the influence of extreme values. Option D is a partial truth: the standard deviation is defined for all distributions, but it is less meaningful (not undefined) for skewed data — the real issue is sensitivity to outliers.
Question 3 True / False
The mean and the median are both measures of center, so they both fall at the 50th percentile of any dataset.
TTrue
FFalse
Answer: False
The median is always the 50th percentile by definition — it is the value that splits the sorted data in half. The mean is an arithmetic average and can be anywhere in the distribution. In right-skewed data (like incomes), a small number of very high values pull the mean far above the median. For example, a dataset where most values are around 30 but a few are in the thousands could have a median of 32 and a mean of 150. Only in perfectly symmetric distributions do the mean and median coincide.
Question 4 True / False
The five-number summary makes no assumptions about the shape of the underlying distribution.
TTrue
FFalse
Answer: True
This is what 'distribution-free' means. The five-number summary — minimum, Q1, median, Q3, maximum — is computed directly from sorted data positions, requiring no assumption about normality, symmetry, or any parametric form. This contrasts with inference based on the mean and standard deviation, which typically assumes approximate normality for interval estimates. The five-number summary is valid and meaningful for any data distribution, including heavily skewed or multimodal ones.
Question 5 Short Answer
Why might a five-number summary give a better description of income data than the mean and standard deviation alone?
Think about your answer, then reveal below.
Model answer: Income distributions are right-skewed: a small number of very high earners pull the mean far above what most people earn. The mean and standard deviation are sensitive to these outliers, making the mean unrepresentative of the typical income. The five-number summary — based on percentiles — shows where most incomes actually fall (Q1 to Q3) and reveals the skewness through asymmetric gaps (e.g., Q3−median much larger than median−Q1), without being distorted by extreme values.
When the median income is $52,000 and the mean is $80,000, the gap signals right-skew: a few high earners inflate the average. The five-number summary makes the structure visible without being misled. This is why policy discussions about 'typical' household income should reference median, not mean — and why the IQR better captures the spread most families experience.