Instrumental Variables Estimation

Research Depth 75 in the knowledge graph I know this Set as goal
Unlocks 2 downstream topics
instrumental-variables endogeneity two-stage-least-squares IV-validity

Core Idea

Introduces instrumental variables as a solution to endogeneity when confounders are unobserved. Covers IV assumptions (relevance, exclusion restriction), two-stage least squares estimation, instrument validity testing, and weak instrument problems with examples from social research.

How It's Best Learned

Identify potential instruments in published studies, design IV estimations with real data, test instrument strength and validity assumptions, practice interpreting TSLS results.

Common Misconceptions

Explainer

You already know from linear regression that the ordinary least squares (OLS) estimator finds the line minimizing prediction error. And from causal inference in observational data, you know the central problem: OLS gives unbiased estimates *only* if the error term is uncorrelated with the treatment variable. When unmeasured confounders simultaneously affect the treatment and outcome, the regression coefficient is endogenous — it captures both the causal effect and the confounding, making it impossible to isolate either from the other.

The instrumental variables (IV) strategy solves this by finding a third variable — the instrument — that affects the treatment but affects the outcome *only through* the treatment. Think of a natural experiment: the Vietnam War draft lottery assigned military service by birth date. Birth date is essentially random with respect to earnings (it's not a confounder), yet it affected who served. Economists use birth date as an instrument for military service to estimate the causal effect of service on lifetime earnings. The instrument creates exogenous variation in the treatment — variation not contaminated by unobserved confounders.

This logic formalizes into two conditions. Relevance means the instrument must actually predict the treatment — the correlation between instrument and treatment must be nonzero and ideally strong. You can test this: regress treatment on instrument and check the F-statistic; a rule of thumb is F > 10. The exclusion restriction says the instrument affects the outcome only through the treatment, not through any other channel. This is *untestable* — it must be defended on theoretical and contextual grounds. A weak exclusion restriction argument is the most common fatal flaw in IV studies.

Two-Stage Least Squares (2SLS) is the standard estimator. In stage 1, regress treatment on the instrument to extract only the exogenous variation in treatment. In stage 2, regress outcome on the stage-1 fitted values — the "clean" part of treatment variation. The matrix algebra you know makes this tractable: the 2SLS estimator is (Z'X)⁻¹Z'y, where Z is the instrument matrix, X the treatment, and y the outcome. In practice, software handles this, but understanding the algebra clarifies what's being estimated: only the variation in treatment driven by the instrument identifies the causal effect.

There is a crucial limitation: IV estimates Local Average Treatment Effects (LATE) — the causal effect only for the subgroup whose treatment status was actually changed by the instrument (called "compliers"). People who always take the treatment regardless of the instrument, or never do regardless, are not informative for the 2SLS estimate. This means IV findings may not generalize to the full population. The tradeoff is real: IV gives you cleaner causal identification than OLS, but at the cost of estimating an effect for a sometimes-narrow, sometimes-uncharacterized subgroup.

What did you take from this?

Topics in reflective domains aren't scored by quiz answers. Read, reflect, and mark when you've thought it through.

Quiz me anyway →

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIndefinite IntegralsBasic Integration RulesRiemann SumsDefinite Integral DefinitionProbability Density Functions and Continuous DistributionsCumulative Distribution FunctionsContinuous Random VariablesProbability Density FunctionsCausal Inference from Observational DataInstrumental Variables Estimation

Longest path: 76 steps · 436 total prerequisite topics

Prerequisites (8)

Leads To (2)