A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Measurement Validity: Construct and Criterion Evidence

College Depth 115 in the knowledge graph ☐ I know this ☆ Set as goal

4topics build on this

562prerequisites beneath it

Variables: Definition, Operationalization, and Measurement Construct Validity and Measurement Validity +1 more→→Implicit Association Test and Implicit Bias Measurement

Core Idea

Construct validity asks: Does the measure assess the intended construct? Evidence comes from content validity, convergent validity (correlates with related measures), discriminant validity (uncorrelated with unrelated measures), and factor structure. Criterion validity asks: Does the measure predict relevant outcomes? Both are integral to score interpretation and use.

How It's Best Learned

Review validation studies for a psychological measure, extracting evidence of construct and criterion validity. Compare a measure with high internal consistency but low validity to understand that reliability ≠ validity. Practice evaluating whether a measure is valid for a new use.

Common Misconceptions

Validity is inherent to a test; - Validity is determined by a single correlate; - High internal consistency ensures validity; - Validity is about group means, not individual scores.

Explainer

Validity is often summarized as "does the test measure what it claims to measure?" but this framing obscures something important: validity is not a property of a test in isolation. It is a property of the interpretations and uses made from test scores. A depression measure might have strong validity evidence in clinical adult populations but poor validity when used with adolescents or in non-Western cultural contexts. From your study of reliability, you know that a measure can be highly consistent without measuring anything meaningful — a bathroom scale that consistently reads 10 pounds too heavy is reliable but systematically invalid.

Construct validity is the umbrella concept. It asks: does the pattern of relationships this measure forms with other variables make sense given our theoretical understanding of the construct? Evidence accumulates through multiple lines. Content validity evaluates whether the items cover the theoretical domain adequately — a math anxiety scale that only asks about algebra anxiety has poor content coverage if the construct is meant to encompass all mathematical domains. Convergent validity asks whether the measure correlates with other measures of the same or similar constructs; a new depression scale should correlate strongly with the BDI and PHQ-9. Discriminant validity (sometimes called divergent validity) asks the opposite: the measure should *not* correlate strongly with theoretically unrelated constructs. A depression scale with a .80 correlation with an anxiety scale raises questions about whether the two constructs are actually distinct.

Criterion validity is a separate but related question: does the measure predict relevant real-world outcomes? Concurrent validity examines correlation with a gold-standard criterion measured at the same time — does a new brief cognitive screening tool correlate with a full neuropsychological battery administered simultaneously? Predictive validity examines whether the measure predicts future outcomes — does a pre-employment personality scale predict actual job performance one year later? The distinction matters practically: a measure can have strong construct validity but weak predictive validity if the construct itself doesn't strongly cause the outcome you care about.

The unifying framework from contemporary psychometrics is that validity evidence is cumulative and argument-based. No single study "validates" a measure; rather, validation is an ongoing process of assembling a coherent validity argument — a chain of claims from test scores to interpretations to uses, with evidence supporting each link. When validity evidence is missing for a specific use case (a new population, a new purpose, a new context), the burden falls on the test user to either generate that evidence or acknowledge the inferential gap. This is why the phrase "this test is valid" is technically imprecise — the proper phrasing is always "the interpretation of these scores as measuring X in this population for this purpose has strong/weak validity evidence."

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Conditional Distributions → Bivariate Normal Distribution → Normal Distribution → Standard Normal Distribution and Z-Scores → Hypothesis Testing Fundamentals → Experimental Research Design → Control and Experimental Groups → Random Assignment → Confounding Variables and Internal Validity → Blinding and Demand Characteristics → Validity in Psychological Measurement → Inferential Statistics in Psychology → Effect Size and Statistical Power → Sample Size Determination in Research Planning → Literature Review and Research Synthesis → Hypothesis Construction: Directional and Nondirectional Predictions → Operationalizing Independent and Dependent Variables → Construct Definition and Measurement Development → Construct Validity and Measurement Validity → Construct Validity and Operationalization of Psychological Constructs → Variables: Definition, Operationalization, and Measurement → Measurement Reliability: Types and Estimation → Measurement Validity: Construct and Criterion Evidence

Longest path: 116 steps · 562 total prerequisite topics

Prerequisites (3)

Variables: Definition, Operationalization, and Measurementhard Measurement Reliability: Types and Estimationsoft Construct Validity and Measurement Validitysoft

Leads To (1)

Implicit Association Test and Implicit Bias Measurementsoft