Questions — Item Selection and Item Pool Development for Tests

Question 1 Multiple Choice

A test developer selects the 20 items with the highest discrimination indices from a pilot pool of 60 items and declares the test complete. What is the most significant problem with this approach?

ATwenty items is too few — reliability requires at least 40 items

BUsing only the top discriminating items may produce a test that covers only a narrow slice of the construct, violating content validity

CDiscrimination indices are not meaningful until the final test is assembled

DHigh-discrimination items are typically too difficult for most test-takers

Question 2 Multiple Choice

An item on a medical licensing exam has a p-value of 0.97, meaning 97% of test-takers answered it correctly. What does this tell you about its contribution to the test?

AIt is an excellent item — all examinees answered it correctly, confirming mastery of this content area

BIt contributes almost nothing to reliability because it produces virtually no variance — nearly everyone passes it regardless of true ability

CIts discrimination index will be high because most examinees got it right

DIt should be removed only if it also has low face validity

Question 3 True / False

An item that every test-taker answers correctly contributes nothing to the reliability of the test.

TTrue

FFalse

Question 4 True / False

The goal of item selection is to maximize the average discrimination index across most selected items, regardless of other considerations.

TTrue

FFalse

Question 5 Short Answer

Why must test developers write an item pool two to three times larger than the final test, rather than writing exactly the items they intend to use?

Think about your answer, then reveal below.

Questions: Item Selection and Item Pool Development for Tests