Questions: Criterion-Related Validity and Predictive Accuracy
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A cognitive ability test for job applicants has a validity coefficient of r = 0.40 with supervisor ratings of job performance. What percentage of variance in job performance does this test account for?
A40%, because the validity coefficient directly represents the proportion of variance explained
B16%, because the coefficient of determination is r² = 0.16
C60%, because 1 – r represents the unexplained portion
D80%, because r = 0.40 indicates a strong relationship in applied settings
The validity coefficient r tells you the direction and strength of the linear relationship, but the proportion of variance in the criterion explained by the test is r² — the coefficient of determination. For r = 0.40, r² = 0.16, meaning the test accounts for 16% of variance in job performance. This is why r² is the more interpretable effect size. Option A is the most common misconception: treating r itself as if it were a proportion of variance.
Question 2 Multiple Choice
A researcher validates a new depression screening scale by administering it to 300 patients and simultaneously collecting clinician diagnostic ratings. She finds a strong correlation between scale scores and clinician judgments. This study establishes:
APredictive validity, because the scale forecasts future clinical outcomes
BConcurrent validity, because the scale scores and criterion are collected at the same point in time
CConstruct validity, because it shows the scale measures what it claims to measure
DIncremental validity, because the scale adds information beyond what clinicians already know
Concurrent validity is established when test scores and criterion measures are collected simultaneously. The defining feature is timing: no waiting period. Predictive validity requires administering the test first and then measuring the criterion outcome at a later date. While concurrent validity is faster and cheaper to establish, predictive validity is usually more important for selection and screening contexts because the practical value of a test is its ability to forecast, not just correlate with current status.
Question 3 True / False
A test that shows strong concurrent validity — a high correlation between test scores and current criterion status — is equally suitable for personnel selection as a test with strong predictive validity.
TTrue
FFalse
Answer: False
Concurrent validity and predictive validity differ in what they tell you. Concurrent validity tells you that the test correlates with a criterion measured at the same time, which may reflect that both are tapping the same present state. Predictive validity tells you that test scores administered now forecast performance measured later — which is the actual task in selection contexts. A test that correlates with current performance may not predict who will succeed in a role months or years later. The time lag matters enormously for applied purposes.
Question 4 True / False
Even a modest validity coefficient (e.g., r = 0.30) can justify using a selection test in high-stakes hiring decisions, depending on base rates, selection ratios, and the costs of selection errors.
TTrue
FFalse
Answer: True
Utility analysis shows that the practical value of a test depends on more than its validity coefficient. If the base rate of success is moderate (not too high or too low), the selection ratio is competitive (many applicants per position), and the cost of a hiring mistake is high, even r = 0.30 can produce substantial economic benefit. Conversely, a test with r = 0.50 may have little practical value if nearly all applicants succeed anyway (high base rate) or if positions always get filled (selection ratio = 1). Validity alone does not determine usefulness.
Question 5 Short Answer
Why is r² a more useful way to interpret a validity coefficient than r alone, and what implication does this have for tests with 'moderate' validity (r ≈ 0.40–0.50)?
Think about your answer, then reveal below.
Model answer: r² represents the proportion of criterion variance accounted for by the test, which is the direct measure of how much the test reduces prediction error. r is a correlation coefficient whose scale is not directly interpretable as a proportion. A validity of r = 0.40 seems substantial in isolation, but r² = 0.16 reveals that 84% of criterion variance is still unexplained — the test has real but limited predictive power. This matters for setting expectations: tests with 'good' validity coefficients still leave most of the criterion outcome to be explained by other factors. It also guards against overconfidence in prediction.
The common error is treating r as if it directly measures predictive accuracy or explained variance, when it measures only the linear association strength. Squaring r to obtain r² — the coefficient of determination — converts the correlation into a proportion-of-variance metric that is directly interpretable.