A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Item Difficulty and Item Discrimination Analysis

Research Depth 99 in the knowledge graph ☐ I know this ☆ Set as goal

8topics build on this

501prerequisites beneath it

Classical Test Theory Foundations Item Response Functions and Item Characteristic Curves→→Classical and IRT-Based Item Analysis Compared Differential Item Functioning and Test Bias Detection +2 more

Core Idea

Item difficulty is the proportion of test-takers answering an item correctly; item discrimination is the correlation between item response and total score (point-biserial correlation). These indices identify problematic items that fail to contribute effectively to score precision and test reliability.

How It's Best Learned

Calculate p-values and discrimination indices for classroom or standardized test data. Create item analysis reports identifying items for revision or removal based on statistical evidence.

Common Misconceptions

Very high difficulty (p-value near 1.0) is always undesirable. Easy items can be valuable for confidence and accessibility. Similarly, low discrimination doesn't automatically warrant item removal; consider construct relevance and test purpose.

Explainer

Classical test theory and item response functions, which you've studied as prerequisites, both treat individual test items as the unit of analysis for understanding test quality. Item difficulty and discrimination are the two most basic numerical summaries of how a single item is performing — together they are the workhorses of practical test development, review, and revision.

Item difficulty in classical test theory is expressed as the p-value — not the statistical significance p-value, but the proportion of test-takers answering the item correctly. A p-value of 0.80 means 80% answered correctly; 0.30 means 30% did. The scale is counterintuitive: higher p-value means an easier item. For a test designed to discriminate across a wide range of ability, items near p = 0.50 contribute the most information because they split the group. Very easy items (p near 1.0) and very hard items (p near 0.0) tell you little about individual differences — almost everyone gets them right or wrong regardless of ability. But p-value targets must match test purpose: a mastery certification test may legitimately include many easy items if the threshold skill is expected of nearly all competent performers.

Item discrimination measures whether the item distinguishes between high and low scorers on the test overall. The most common index is the point-biserial correlation — the correlation between item response (0 = wrong, 1 = right) and total score. A high point-biserial (typically 0.30+ is considered good) means high scorers mostly got this item right and low scorers mostly got it wrong — the item is pulling in the same direction as the test. A near-zero discrimination means the item is essentially noise, contributing no information about the underlying construct. A *negative* discrimination is a red flag: high-scoring students are getting the item wrong more often than low scorers, which usually signals a miskeyed item (the wrong answer recorded as correct) or a genuinely ambiguous question.

The connection to item response theory (IRT) from your prerequisite is direct: IRT's difficulty parameter (*b*) is a more principled version of the p-value, estimated from the full item characteristic curve rather than a simple proportion. IRT's discrimination parameter (*a*) corresponds to the slope of the curve at the difficulty point — which is what the point-biserial is approximating in simpler form. Classical indices are computationally transparent and sufficient for most routine test review; IRT provides more information at the cost of greater complexity and larger sample requirements. In practice, item analysis combines both indices alongside expert review: statistics diagnose problems, but content knowledge determines the remedy.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Independence of Events → Sampling Distributions → Standard Error of Estimators → Hypothesis Testing: Framework and Logic → Classical Test Theory Foundations → Item Response Functions and Item Characteristic Curves → Item Difficulty and Item Discrimination Analysis

Longest path: 100 steps · 501 total prerequisite topics

Prerequisites (2)

Item Response Functions and Item Characteristic Curveshard Classical Test Theory Foundationshard

Leads To (4)

Classical and IRT-Based Item Analysis Comparedhard Differential Item Functioning and Test Bias Detectionhard Distractor Analysis and Item Optimizationhard Item Selection and Item Pool Development for Testshard