A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Advanced Regression Discontinuity Design

Research Depth 115 in the knowledge graph ☐ I know this ☆ Set as goal

1,007prerequisites beneath it

Regression Discontinuity: Sharp and Fuzzy Designs Causal Inference from Observational Data +5 more→

Core Idea

Regression discontinuity design exploits threshold rules in policy assignment to estimate causal effects. When eligibility for treatment depends on crossing a cutoff (income threshold, test score, age), units just above and below the threshold are comparable except for treatment status. RDD requires no assumption of ignorability; instead, identification relies on the assumption that other determinants of the outcome vary smoothly across the threshold. Advanced RDD addresses multiple thresholds, bandwidth selection, and validity checks (density tests, covariate continuity).

Explainer

You've already grasped the core logic of RDD: when treatment assignment depends on crossing a threshold, units just above and below the cutoff are as-good-as randomly assigned near that threshold, and the jump in outcomes at the cutoff estimates the causal effect of treatment. This is powerful because it demands only one credible assumption — that other outcome determinants vary smoothly across the cutoff — rather than the full ignorability required by observational regression. Advanced RDD extends this logic to harder identification problems and more demanding validity requirements.

Bandwidth selection is where estimation becomes technically non-trivial. The RDD estimator works locally: you use only observations near the cutoff, where the as-if-random assumption is most credible. Observations far from the cutoff are informative about the regression function's shape but are weaker counterfactuals for units right at the threshold. The bandwidth trades off bias (wider bandwidth = more extrapolation = more potential bias) against variance (narrower bandwidth = fewer observations = more noise). The Calonico-Cattaneo-Titiunik (CCT) optimal bandwidth selector formalizes this tradeoff using a mean squared error criterion. In practice, researchers report estimates at the optimal bandwidth and check sensitivity by varying bandwidth width — results that evaporate at different bandwidths are fragile.

Validity diagnostics are not formalities — they constitute the empirical argument that your design is identifying a causal effect. The McCrary density test checks whether there is a discontinuity in the density of the running variable at the cutoff. If units can manipulate precisely which side of the threshold they fall on, the as-if-random assumption fails: the density would show a suspicious spike just above a scholarship cutoff if administrators are nudging borderline students over. Covariate continuity tests check that pre-determined baseline characteristics are continuous at the cutoff — a jump in prior income or age at the threshold (absent a theoretical explanation) signals contamination. Placebo cutoff tests apply the design at other values of the running variable where no treatment discontinuity exists; finding effects at placebo cutoffs suggests the real result may be spurious.

Multiple thresholds arise when a policy applies different treatments at several cutoffs — income brackets for different subsidy levels, test score thresholds for different program tracks. Each threshold yields a local average treatment effect (LATE) for the subpopulation near that specific cutoff, and these estimates need not agree: treatment effects may vary by the level of the running variable. Comparing estimates across thresholds reveals treatment effect heterogeneity and can test whether the running variable moderates the effect. The discipline throughout advanced RDD is remembering what you are identifying: an effect for units at the margin, not a population average. Whether that local effect generalizes beyond the threshold is a substantive question about mechanism — and it cannot be answered by the design alone.

What did you take from this?

Topics in reflective domains aren't scored by quiz answers. Read, reflect, and mark when you've thought it through.

Quiz me anyway →

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Introduction to Exponents → Order of Operations → Integer Order of Operations → Variable Expressions → The Distributive Property → Variables and Expressions Review → Introduction to Polynomials → Adding and Subtracting Polynomials → Multiplying Polynomials → Factorial → Permutations → Combinations → Counting Principles: Addition and Multiplication Rules → Introduction to Graph Theory → Propositional Logic Foundations → Logical Equivalences → Boolean Algebra → Introduction to Propositional Logic → Introduction to Predicate Logic (First-Order Logic) → First-Order Logic Syntax → Terms and Atomic Formulas in FOL → Variable Binding and Scope → Open and Closed Formulas in First-Order Logic → Variable Substitution and Capture-Avoidance in First-Order Logic → Quantifier Instantiation Rules in First-Order Proof Systems → Universal Quantification: Meaning and Scope → Free Variables and Bound Variables → Substitution and Instantiation in Predicate Logic → Terms and Atomic Formulas → Formulas and Well-Formed Expressions → Structures and Interpretations → Model Interpretation and Satisfaction → Interpretation, Truth, and Satisfaction of Formulas → Logical Consequence and Entailment → Soundness Theorem and Validity of Proof Systems → Deductive Reasoning and Formal Proof Systems → First-Order Resolution → Propositional Resolution → Semantic Tableaux (Propositional) → Semantic Tableaux (First-Order) → Decidable Fragments of First-Order Logic → Gödel's Completeness Theorem for First-Order Logic → Gödel's Incompleteness Theorems → Introduction to Intuitionistic Logic → Introduction to Modal Logic → Compatibilism → Moral Responsibility → Moral Psychology → Moral Sentiments and Emotions → Care Ethics → Rational Choice and Ethics → Contractarian Moral Foundations → Moral Foundations and Intuitions → Moral Relativism → Introduction to Applied Ethics → Bioethics: Foundations → Medical Ethics & Patient Autonomy → Informed Consent & Research Ethics → Research Ethics: Human Subjects Protection → Ethnographic Fieldwork: Positionality and Research Ethics → Ethnographic Interviewing and Qualitative Data Collection → Advanced Ethnographic Methods → Longitudinal Qualitative Research Design → Advanced Regression Discontinuity Design

Longest path: 116 steps · 1007 total prerequisite topics

Prerequisites (7)

Regression Discontinuity: Sharp and Fuzzy Designshard Causal Inference from Observational Datasoft Interrupted Time Series Designsoft Fixed and Random Effects Modelssoft Focus Group Research Designsoft Longitudinal Qualitative Research Designsoft Count Data Regression: Poisson and Negative Binomial Modelssoft

Leads To (0)

No topics depend on this one yet.