External Validity and Generalization of Findings

College Depth 54 in the knowledge graph I know this Set as goal
Unlocks 59 downstream topics
validity generalization external

Core Idea

External validity is the degree to which findings generalize beyond the specific participants, settings, and times studied. Laboratory experiments with convenience samples and artificial tasks often have lower external validity than naturalistic or community-based studies. Balancing internal and external validity requires strategic trade-offs: tight experimental control strengthens causal inference but may reduce applicability to real-world contexts.

Explainer

Your prerequisite on internal validity established that a study has internal validity when its design supports a causal inference — when we can attribute the observed outcome to the manipulated variable rather than to confounds. But internal validity only answers "did X cause Y in this study?" External validity asks the harder follow-up: "so what?" — meaning, does the causal relationship found here hold in other places, with other people, at other times? A study can be perfectly internally valid and almost completely non-generalizable, and the history of psychology is full of cautionary examples.

There are three main threats to external validity. Population validity concerns whether findings generalize from the sample studied to other people. Psychology's most-criticized sampling problem is the WEIRD sample — participants from Western, Educated, Industrialized, Rich, and Democratic societies, often undergraduate students at research universities. Findings from such samples have repeatedly failed to replicate with different populations: the Mueller-Lyer illusion varies across cultures; conformity effects vary by individualist vs. collectivist contexts; even basic memory and perception phenomena show cross-cultural differences. Ecological validity concerns whether the laboratory setting captures the phenomenon as it operates in the real world. A memory study using lists of unrelated words is internally clean but may tell us little about how people remember personally meaningful events. Temporal validity concerns whether findings hold across time — social norms, technology, and cultural contexts change, and phenomena studied in one era may not replicate in another.

The central tension in research design is that the moves that maximize internal validity often threaten external validity, and vice versa. Random assignment to conditions, strict experimental control, standardized stimuli, and laboratory settings all increase confidence in causal inference but introduce artificiality. Naturalistic observation and field research capture behavior in its real context but sacrifice control. This is not a problem with a clean solution — it is a design trade-off that researchers navigate based on the question being asked. If you want to know *whether* a drug can work, a tightly controlled randomized trial is appropriate. If you want to know *whether* it works as prescribed in real clinical practice, you need effectiveness research in natural settings.

Replication is the scientific community's primary tool for establishing external validity over time. A single study, no matter how well designed, makes a narrow generalization claim. A finding that holds across multiple labs, diverse participant populations, varied operationalizations of the key construct, and different cultural contexts is far more likely to reflect a genuine phenomenon. The reproducibility crisis in psychology (many classic findings failed direct replication in large-sample attempts) renewed attention to external validity as distinct from internal validity — and reminded the field that p < .05 in one well-controlled study is only the beginning of the evidential story.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsFunction Notation ReviewRandom Variables: Definition and ClassificationJoint and Marginal DistributionsConditional Distributions of Random VariablesRandom VariablesSampling DistributionsHypothesis Testing FundamentalsExperimental Research DesignControl and Experimental GroupsRandom AssignmentConfounding Variables and Internal ValidityInternal Validity and Threats to Causal InferenceExternal Validity and Generalization of Findings

Longest path: 55 steps · 253 total prerequisite topics

Prerequisites (2)

Leads To (2)