Internal Validity and Threats to Experimental Control

College Depth 53 in the knowledge graph I know this Set as goal
Unlocks 60 downstream topics
validity experimental-design causal-inference threats

Core Idea

Internal validity refers to the degree to which a study can demonstrate a true causal relationship between an independent and dependent variable, free from confounding influences. Threats to internal validity include history, maturation, testing, instrumentation, regression to the mean, and selection bias. Understanding these specific threats enables researchers to design controls that eliminate plausible alternative explanations for observed effects. Strong internal validity is essential for causal claims, though it may require trade-offs with ecological authenticity.

How It's Best Learned

Study classic examples where internal validity is compromised (e.g., the Hawthorne effect, practice effects from pre-testing). Analyze published experiments to identify which validity threats were addressed and which remain.

Common Misconceptions

Internal validity means the study is well-designed overall (actually, it specifically means causal conclusions are justified). A study with perfect internal validity automatically has high external validity (actually, gains in control often reduce generalizability).

Explainer

From your study of experimental research design, you know that the logic of experimentation is to manipulate one variable while holding everything else constant, then attribute any resulting change in the outcome to the manipulation. Internal validity is the formal name for the degree to which that inference is justified — whether the observed change in the dependent variable was truly caused by the independent variable and nothing else. Every threat to internal validity is a specific alternative explanation: a plausible reason why the outcome might have changed even if the manipulation had no effect.

The most important threats to learn, and the ones you will encounter in published research, are: history (an external event occurred during the study that could explain the outcome — a news story breaks while you're measuring attitudes, or a school fire drill interrupts your experiment); maturation (participants naturally change over time regardless of your intervention — children get older, people get tired, a condition resolves spontaneously); testing effects (taking the pretest sensitizes participants to the topic or teaches them the answers, so gains on the posttest reflect learning from the test itself rather than the intervention); and instrumentation (the measurement procedure changes between assessments — observers recalibrate their rating standards, a scale loses calibration, or the same rater becomes more lenient over time).

Two more threats require particular attention because they are less intuitively obvious. Regression to the mean occurs because participants selected for extreme scores — the most depressed patients, the lowest-performing students — are partly selected for measurement error that pushed them to that extreme. On retest, their scores move toward the population mean regardless of any intervention. If you enroll only the highest-scorers on a pre-test and see lower scores afterward, regression may explain it entirely. Selection bias occurs when the groups being compared differ systematically before the manipulation begins — in a pretest-posttest design without random assignment, the treatment group may have been more motivated to begin with.

Controlled experiments address these threats primarily through random assignment, which distributes all known and unknown individual differences equally across conditions at baseline. But random assignment does not eliminate every threat — history, testing effects, and instrumentation can still operate. Each threat has corresponding design solutions: control groups to absorb history and maturation effects, Solomon four-group designs to separate testing effects from treatment effects, inter-rater reliability checks and standardized protocols to address instrumentation. The important skill is not memorizing the list of threats but diagnosing which ones are plausible in a specific study and evaluating whether the design actually rules them out.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsFunction Notation ReviewRandom Variables: Definition and ClassificationJoint and Marginal DistributionsConditional Distributions of Random VariablesRandom VariablesSampling DistributionsHypothesis Testing FundamentalsExperimental Research DesignControl and Experimental GroupsRandom AssignmentConfounding Variables and Internal ValidityInternal Validity and Threats to Experimental Control

Longest path: 54 steps · 252 total prerequisite topics

Prerequisites (3)

Leads To (2)