Questions — Differential Item Functioning and Test Bias Detection

Question 1 Multiple Choice

A math test item is answered correctly by 72% of male examinees and 54% of female examinees. A researcher concludes the item shows DIF. What is wrong with this reasoning?

ADIF can only be detected using IRT, not raw score comparisons, so the method is invalid

BDIF requires showing that the group performance difference persists after conditioning on ability — the score gap alone could reflect genuine group differences in the construct, not item-specific bias

CThe score gap is too small to constitute DIF; a gap of at least 25 percentage points is needed

DDIF analysis requires the two groups to be matched in sample size before comparison

Question 2 Multiple Choice

A test of English language proficiency includes an item that shows DIF against non-native speakers. Content reviewers find that the item uses a grammatical construction that is genuinely difficult for non-native speakers at any given proficiency level because it targets a specific feature of advanced English grammar. How should this DIF be classified?

AAs bias requiring immediate removal — any DIF against a minority group is by definition biased

BPotentially as legitimate DIF — the differential functioning may reflect the target construct itself rather than irrelevant content

CAs negligible — DIF only matters when it affects groups by more than one standard deviation

DAs an IRT calibration error requiring the item to be recalibrated using the non-native speaker subsample

Question 3 True / False

A group scoring significantly lower on an overall test than another group provides sufficient statistical evidence that specific items in the test show DIF against the lower-scoring group.

TTrue

FFalse

Question 4 True / False

The Mantel-Haenszel method detects DIF by stratifying examinees into ability-matched subgroups and testing whether each item's difficulty is consistent across demographic groups within each stratum.

TTrue

FFalse

Question 5 Short Answer

Why is 'conditioning on ability' the essential step in DIF detection, and what does the analysis fail to show without it?

Think about your answer, then reveal below.

Questions: Differential Item Functioning and Test Bias Detection