A state mandates that all 4th graders scoring at the 50th percentile or above on a standardized reading test are considered 'proficient readers.' A policy analyst objects that this approach conflates two different frameworks. What is the analyst's concern?
AThe 50th percentile is too low a cutoff and should be raised to the 75th percentile
BUsing a percentile rank (norm-referenced) as if it defines a performance standard (criterion-referenced) is a category error — relative standing does not guarantee absolute competence
CThe test has not been validated for use with 4th graders and the norms may be outdated
DNorm-referenced and criterion-referenced scores are mathematically equivalent, so the distinction does not matter
This is the core validity problem the question targets. 'Proficiency' is a criterion-referenced concept — it means demonstrating a specified level of skill. But the 50th percentile is a norm-referenced score — it simply means performing better than half the reference group. If the reference group is low-performing overall, a student at the 50th percentile might still lack basic reading competence. The analyst is right that framing norm-referenced standing as a proficiency standard conflates the frameworks, producing misleading inferences. Proficiency requires independently defined standards, not rank-order comparisons.
Question 2 Multiple Choice
A clinical psychologist administers a personality inventory that produces ipsative scores ranking an individual's five trait scores from highest to lowest within their own profile. She wants to use these scores to compare two patients' levels of conscientiousness. What is the problem with this plan?
AIpsative scores cannot be used clinically — they are only valid for research
BConscientiousness is not a valid personality trait and should not be assessed
CIpsative scores sum to a constant within a person, so they reflect relative standing within an individual's profile, not absolute level — cross-person comparison is mathematically invalid
DThe comparison is fine as long as both patients were tested with the same version of the instrument
Ipsative scores measure relative prominence of traits within a single person's profile. Because scores are forced to sum to a constant (e.g., if conscientiousness is high, another trait must be lower), a high conscientiousness score means only that the individual is relatively more conscientious than their other traits — not that they are highly conscientious in an absolute sense. Two people can both rank conscientiousness first while having very different absolute levels. Cross-person comparison using ipsative scores is a validity violation — you cannot rank individuals on a trait when each person's scores are constrained relative to their own total.
Question 3 True / False
A student who scores at the 90th percentile on a calculus exam has definitively demonstrated mastery of calculus as defined by the course learning objectives.
TTrue
FFalse
Answer: False
The 90th percentile is a norm-referenced score — it tells us the student outperformed 90% of the reference group, but nothing about whether they achieved a criterion standard. If the entire reference group performs poorly, a student at the 90th percentile might still fall short of mastery. Conversely, if the group is high-achieving, even lower-percentile students might exceed the criterion. Criterion-referenced mastery requires comparison to a fixed performance standard, not to peer performance. Conflating these frameworks is one of the most common — and consequential — errors in applied measurement.
Question 4 True / False
Ipsative scores from a personality inventory can be used to compare two employees' absolute levels of a trait to determine who is more suited for a leadership role.
TTrue
FFalse
Answer: False
Ipsative scores cannot be used for cross-person comparison because they reflect within-person relative standing, not absolute levels. The mathematical constraint — all trait scores for one person sum to a constant — means a high ipsative conscientiousness score may coexist with very different absolute conscientiousness levels in different individuals. Using ipsative scores for personnel selection decisions (which require cross-person comparison) is a validity violation and can produce incorrect recommendations. Norm-referenced standard scores are appropriate for comparing individuals.
Question 5 Short Answer
Why can't ipsative scores be used to compare people on a given trait, even if two individuals complete the exact same personality test?
Think about your answer, then reveal below.
Model answer: Ipsative scores measure the relative prominence of each trait within a single individual's profile — they are computed by comparison within the person, not by reference to an external norm or standard. Because scores are constrained to sum to a constant within each person, a high score on one trait necessarily implies lower scores on others. This means the same numerical ipsative score on 'conscientiousness' could reflect a very high absolute level for one person and a moderate level for another, depending on how their other traits are distributed. Cross-person comparison requires that scores share a common external reference point, which ipsative scoring deliberately removes.
The mathematical constraint is key: ipsative scoring creates a zero-sum situation within each person's profile. Students should understand this not as a technical detail but as a fundamental limitation on the inferences ipsative scores can support — they are valid for understanding within-person priorities but invalid for between-person ranking, correlation with external criteria, or group statistics.