Causal Inference Methods in Biostatistics

Research Depth 189 in the knowledge graph I know this Set as goal
Unlocks 4 downstream topics
causal-inference counterfactual potential-outcomes confounding DAG SUTVA

Core Idea

Causal inference in biostatistics formalizes the question "does X cause Y?" using the potential outcomes framework (Rubin causal model): each subject has a potential outcome under treatment Y(1) and under control Y(0), but only one is observed — the fundamental problem of causal inference. The average treatment effect (ATE) is E[Y(1) - Y(0)]. In randomized trials, randomization ensures that observed treatment groups estimate potential outcomes without bias. In observational studies, confounding (common causes of treatment and outcome) prevents direct causal interpretation. Causal inference methods — propensity scores, instrumental variables, difference-in-differences, regression discontinuity — each address confounding under different assumptions. Directed acyclic graphs (DAGs) provide a visual language for encoding causal assumptions and identifying what must be adjusted for to estimate causal effects.

Explainer

The goal of causal inference is to determine whether a treatment or exposure causes a change in an outcome — not merely whether the two are associated. From your study of study design, you know that randomized experiments provide the strongest evidence for causation. The potential outcomes framework explains why: each subject has two potential outcomes, Y(1) under treatment and Y(0) under control. The causal effect for that individual is Y(1) - Y(0). The "fundamental problem of causal inference" is that we observe only one of these — a person either receives the treatment or does not, never both simultaneously.

Randomization solves this at the population level by ensuring that the group of treated subjects is a representative sample of the population's Y(1) values, and the control group samples Y(0). The difference in group means estimates the Average Treatment Effect (ATE): E[Y(1)] - E[Y(0)]. This works because random assignment makes treatment independent of all patient characteristics — measured and unmeasured — eliminating confounding.

In observational studies, treatment is not randomly assigned — patients who receive a treatment may differ systematically from those who do not. Confounders (variables that cause both treatment and outcome) create spurious associations. Directed acyclic graphs (DAGs) provide a rigorous visual language for representing causal relationships and identifying what must be controlled for. The backdoor criterion states that the causal effect of X on Y is identified if you condition on a set of variables that blocks all backdoor paths (non-causal paths from X to Y through confounders). DAGs also reveal what you should not condition on: colliders (variables caused by both treatment and outcome), which introduce bias when conditioned upon, and mediators (variables on the causal path from treatment to outcome), which absorb the very effect you are trying to estimate.

The various causal inference methods — propensity scores, instrumental variables, difference-in-differences, regression discontinuity — each address confounding under different assumptions about which variables are observed and how treatment assignment works. No method eliminates the need for assumptions; each makes different untestable assumptions transparent. Propensity scores assume no unmeasured confounders. Instrumental variables assume the existence of a variable that affects treatment but not outcome directly. Difference-in-differences assumes parallel trends. The choice of method depends on the data structure and the plausibility of its specific assumptions.

Practice Questions 4 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIterated Integrals and Fubini's TheoremDouble Integrals in Cartesian CoordinatesDouble Integrals over Rectangular RegionsDouble Integrals in Polar CoordinatesDouble Integrals: Definition and SetupIterated Integrals and Fubini's TheoremDouble Integrals over Rectangular RegionsDouble Integrals over General RegionsApplications of Double Integrals: Area, Mass, and MomentsTriple Integrals in Cartesian CoordinatesTriple Integrals in Cylindrical and Spherical CoordinatesChange of Variables and the Jacobian DeterminantApplications of Triple Integrals: Volume and MassVector Fields and Their RepresentationsLine Integrals of Vector FieldsGreen's TheoremSurface Integrals and Flux of Vector FieldsSurface Integrals and Flux of Vector FieldsDivergence Theorem: Flux and OutflowDivergence TheoremElectric FluxGauss's LawConductors in Electrostatic EquilibriumCapacitance and CapacitorsDielectricsDielectric Constant and Relative PermittivityElectric Field Inside Dielectric MaterialsDielectric Materials and PolarizationDielectric Susceptibility and PermittivityEnergy Density in Electric FieldsElectric Current and Current DensityElectrical Resistance and ResistivityOhm's Law and Circuit ElementsElectromotive Force (EMF) and BatteriesKirchhoff's Circuit Laws: Voltage and CurrentDC Circuit Network Analysis MethodsTransient Response in RC CircuitsRC CircuitsLC and RLC CircuitsAC Circuits: FundamentalsImpedance and ReactanceAC Power and ResonanceElectromagnetic WavesThe Electromagnetic SpectrumBlackbody Radiation and Planck's LawPhotoelectric EffectThe Photon: Light as QuantaCompton ScatteringWave-Particle Dualityde Broglie WavelengthHeisenberg Uncertainty PrincipleWavefunction and the Born RuleThe Schrödinger EquationState Vectors and WavefunctionsQuantum SuperpositionQuantum EntanglementBell Theorem and Bell InequalitiesPostulates of Quantum MechanicsScattering TheoryIntroduction to Scattering TheoryPartial Wave Analysis in ScatteringSpin Angular MomentumElectron Spin and Intrinsic Magnetic MomentStern-Gerlach Experiment: Spin Quantization and MeasurementElectron Diffraction and Matter Wave PropertiesDavisson-Germer Experiment: Crystal Diffraction of ElectronsElectron Diffraction and Matter Wave InterferenceWavefunctions and Probability Density InterpretationQuantum Superposition and Linear Combinations of StatesQuantum Operators and ObservablesCanonical Commutation Relations and UncertaintyHeisenberg Uncertainty Principle and Measurement LimitsTime-Independent Schrödinger Equation and EigenvaluesHydrogen Atom in Quantum MechanicsSpectral Lines and Energy TransitionsSelection Rules for Atomic TransitionsLS and jj Coupling Schemes in Multi-Electron AtomsPauli Exclusion Principle and Antisymmetric WavefunctionsElectron Configuration and the Aufbau PrincipleThe Periodic Table and Atomic Electronic StructureThe Periodic TableElectron ConfigurationPeriodic TrendsIonization EnergyIonic BondingLewis StructuresResonance Structures and Delocalized ElectronsResonance and Formal ChargeMolecular Polarity and Dipole MomentsIntermolecular ForcesStates of Matter and Phase Changes: Melting, Boiling, and SublimationGas Laws and the Ideal Gas EquationGas Stoichiometry and Volume-Volume CalculationsThermochemistry and EnthalpyHeat Capacity and CalorimetryEntropy and Molecular DisorderSpontaneity and ΔGEntropy and Gibbs Free EnergyChemical EquilibriumAcid-Base ChemistryOrganic Reaction Mechanisms and Arrow PushingElectrophilic Addition to AlkenesAromaticity and BenzeneDNA StructureCentral Dogma of Molecular BiologyThe Genetic CodeDNA MutationsDNA Repair MechanismsCell Cycle Checkpoints and Cancer PreventionMitotic Spindle Checkpoint and Chromosome SegregationKinetochore Structure and FunctionMitochondria: Structure and FunctionCellular Respiration OverviewBacterial Metabolism OverviewAntibiotic Resistance MechanismsInfectious Disease EpidemiologyFoundations of EpidemiologyMeasuring Disease Frequency: Incidence and PrevalenceEpidemiologic Study DesignsStudy Design in BiostatisticsSurvival Analysis: Kaplan-Meier EstimationLog-Rank Test for Survival ComparisonCox Proportional Hazards ModelCausal Inference Methods in Biostatistics

Longest path: 190 steps · 942 total prerequisite topics

Prerequisites (3)

Leads To (3)