A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Causal Inference from Observational Data

Graduate Depth 100 in the knowledge graph ☐ I know this ☆ Set as goal

67topics build on this

591prerequisites beneath it

Advanced Research Design Conditional Probability +4 more→→Advanced Regression Discontinuity Design Difference-in-Differences +8 more

Core Idea

Synthesizes strategies for inferring causation from observational data when randomization is impossible or unethical. Covers the causal hierarchy (association, experimental, natural experiment), potential outcomes framework, confounding, backdoor and frontdoor criteria, and conditions for causal identification.

How It's Best Learned

Draw directed acyclic graphs (DAGs) for research questions, identify confounders, write causal models, discuss identification assumptions, evaluate whether different designs meet assumptions.

Common Misconceptions

Correlation never implies causation
More controls always improve causal inference
Unconfoundedness can be tested with data

Explainer

You have learned to run regressions and interpret correlations. But correlation is not causation — and more usefully, there is now a rigorous mathematical framework for specifying exactly when and why an observed correlation can and cannot be interpreted causally. That framework is the subject of this topic.

The foundation is the potential outcomes framework, developed by Donald Rubin and extended by Judea Pearl and others. For any unit i and a binary treatment T, we define two potential outcomes: Y_i(1), what would happen to unit i if assigned to treatment, and Y_i(0), what would happen if not. The individual causal effect is the difference Y_i(1) − Y_i(0). The problem is that we observe only one of these — whichever treatment state actually occurred. The other is a counterfactual: what would have happened in a world that did not occur. This is the fundamental problem of causal inference: it is a logical impossibility, not a data gap. No sample size, no matter how large, allows you to observe both potential outcomes for the same unit at the same time.

Randomization solves this problem in expectation. If treatment assignment is truly random, then the treated and untreated groups are identical in expectation across all observed and unobserved characteristics. The observed difference in outcomes is then an unbiased estimate of the average treatment effect. But randomization is often impossible — you cannot randomly assign people to smoke, grow up poor, or experience a policy implemented everywhere simultaneously. Most data is observational, and observational data requires you to defend causal identification through explicit design arguments.

Directed acyclic graphs (DAGs) are the tool for making those arguments transparent. In a DAG, variables are nodes and causal relationships are directed arrows. A confounder is a common cause of both treatment and outcome that creates a non-causal association between them; it must be blocked by conditioning. A mediator lies on the causal path from treatment to outcome; conditioning on it blocks part of the effect you are trying to measure. A collider is caused by both treatment and outcome; conditioning on it opens a spurious path that was previously blocked — making the estimate worse, not better. The backdoor criterion formalizes which sets of variables, when conditioned on, close all non-causal paths without opening new ones. Getting this right requires understanding the data-generating process, not just running variable-selection algorithms.

Three common misconceptions are worth internalizing directly. First, "correlation never implies causation" is too strong a rule — under the right design assumptions, observational correlations can be interpreted causally. The question is always whether those assumptions are defensible, not whether causation is categorically off the table. Second, "add more controls" is not always better — colliders are the clearest counterexample, and there are others. Third, "unconfoundedness can be tested" is wrong by construction: unconfoundedness is an assumption about unmeasured variables, and unmeasured variables cannot be used to test assumptions about themselves. What can be done is sensitivity analysis — testing how large an unmeasured confounder would need to be to overturn your conclusion. Honest causal work states assumptions clearly, defends them on substantive grounds, and reports what would falsify them.