Survival Analysis: Kaplan-Meier Estimation

Graduate Depth 186 in the knowledge graph I know this Set as goal
Unlocks 10 downstream topics
survival Kaplan-Meier censoring survival-function time-to-event

Core Idea

Survival analysis studies time-to-event outcomes — time to death, disease recurrence, or hospital readmission. The defining challenge is censoring: some subjects have not yet experienced the event when observation ends, so their true event times are unknown but known to exceed the censored time. The Kaplan-Meier estimator is a nonparametric method that estimates the survival function S(t) — the probability of surviving beyond time t — by computing the cumulative product of conditional survival probabilities at each observed event time. It produces the characteristic step-function survival curve that declines at each event, properly accounting for censored observations by removing them from the risk set without treating them as events.

Explainer

Standard statistical methods assume you observe the outcome for every subject, but time-to-event data violates this assumption. In a 5-year clinical trial, some patients die (the event of interest), some are still alive when the trial ends (administratively censored), and some are lost to follow-up before the trial ends (right-censored). You know these censored patients survived at least until they were last observed, but you do not know their true event time. Ignoring censored observations — either by excluding them or by treating them as events — introduces serious bias. Survival analysis methods exist precisely to handle this incomplete information.

The Kaplan-Meier estimator constructs the survival function S(t) — the probability of surviving beyond time t — without assuming any particular distributional form. At each observed event time, it computes the conditional probability of surviving past that time given survival up to it: (number at risk - number of events) / number at risk. The cumulative survival probability is the product of all these conditional probabilities up to time t. The resulting step function starts at S(0) = 1 and decreases at each event time. Censored observations reduce the risk set (the denominator) at the censoring time but do not cause a step down — they contribute information about survival up to the moment they were last observed.

Reading a Kaplan-Meier curve is a core clinical skill. The median survival time is the time at which the curve crosses 0.50 — the time by which half the subjects have experienced the event. If the curve never reaches 0.50, the median is undefined (more than half the subjects survived the entire observation period). Confidence intervals for the survival function at any time point can be computed using Greenwood's formula, and these widen over time as the number at risk decreases. Tick marks on the curve indicate censoring events, showing where observations were lost. A curve with heavy censoring late in follow-up has wide uncertainty, even if it appears to plateau.

The critical assumption underlying the KM estimator is non-informative censoring: the reason a subject was censored must be unrelated to their prognosis. If patients who are getting sicker preferentially drop out (informative censoring), the remaining subjects are healthier than the full cohort, and the survival curve will be optimistically biased. This assumption cannot be tested from the data alone — it requires understanding why subjects were censored. The Kaplan-Meier estimator describes the survival experience of a single group; to compare survival between groups, you need the log-rank test, which is the next topic in this sequence.

Practice Questions 4 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIterated Integrals and Fubini's TheoremDouble Integrals in Cartesian CoordinatesDouble Integrals over Rectangular RegionsDouble Integrals in Polar CoordinatesDouble Integrals: Definition and SetupIterated Integrals and Fubini's TheoremDouble Integrals over Rectangular RegionsDouble Integrals over General RegionsApplications of Double Integrals: Area, Mass, and MomentsTriple Integrals in Cartesian CoordinatesTriple Integrals in Cylindrical and Spherical CoordinatesChange of Variables and the Jacobian DeterminantApplications of Triple Integrals: Volume and MassVector Fields and Their RepresentationsLine Integrals of Vector FieldsGreen's TheoremSurface Integrals and Flux of Vector FieldsSurface Integrals and Flux of Vector FieldsDivergence Theorem: Flux and OutflowDivergence TheoremElectric FluxGauss's LawConductors in Electrostatic EquilibriumCapacitance and CapacitorsDielectricsDielectric Constant and Relative PermittivityElectric Field Inside Dielectric MaterialsDielectric Materials and PolarizationDielectric Susceptibility and PermittivityEnergy Density in Electric FieldsElectric Current and Current DensityElectrical Resistance and ResistivityOhm's Law and Circuit ElementsElectromotive Force (EMF) and BatteriesKirchhoff's Circuit Laws: Voltage and CurrentDC Circuit Network Analysis MethodsTransient Response in RC CircuitsRC CircuitsLC and RLC CircuitsAC Circuits: FundamentalsImpedance and ReactanceAC Power and ResonanceElectromagnetic WavesThe Electromagnetic SpectrumBlackbody Radiation and Planck's LawPhotoelectric EffectThe Photon: Light as QuantaCompton ScatteringWave-Particle Dualityde Broglie WavelengthHeisenberg Uncertainty PrincipleWavefunction and the Born RuleThe Schrödinger EquationState Vectors and WavefunctionsQuantum SuperpositionQuantum EntanglementBell Theorem and Bell InequalitiesPostulates of Quantum MechanicsScattering TheoryIntroduction to Scattering TheoryPartial Wave Analysis in ScatteringSpin Angular MomentumElectron Spin and Intrinsic Magnetic MomentStern-Gerlach Experiment: Spin Quantization and MeasurementElectron Diffraction and Matter Wave PropertiesDavisson-Germer Experiment: Crystal Diffraction of ElectronsElectron Diffraction and Matter Wave InterferenceWavefunctions and Probability Density InterpretationQuantum Superposition and Linear Combinations of StatesQuantum Operators and ObservablesCanonical Commutation Relations and UncertaintyHeisenberg Uncertainty Principle and Measurement LimitsTime-Independent Schrödinger Equation and EigenvaluesHydrogen Atom in Quantum MechanicsSpectral Lines and Energy TransitionsSelection Rules for Atomic TransitionsLS and jj Coupling Schemes in Multi-Electron AtomsPauli Exclusion Principle and Antisymmetric WavefunctionsElectron Configuration and the Aufbau PrincipleThe Periodic Table and Atomic Electronic StructureThe Periodic TableElectron ConfigurationPeriodic TrendsIonization EnergyIonic BondingLewis StructuresResonance Structures and Delocalized ElectronsResonance and Formal ChargeMolecular Polarity and Dipole MomentsIntermolecular ForcesStates of Matter and Phase Changes: Melting, Boiling, and SublimationGas Laws and the Ideal Gas EquationGas Stoichiometry and Volume-Volume CalculationsThermochemistry and EnthalpyHeat Capacity and CalorimetryEntropy and Molecular DisorderSpontaneity and ΔGEntropy and Gibbs Free EnergyChemical EquilibriumAcid-Base ChemistryOrganic Reaction Mechanisms and Arrow PushingElectrophilic Addition to AlkenesAromaticity and BenzeneDNA StructureCentral Dogma of Molecular BiologyThe Genetic CodeDNA MutationsDNA Repair MechanismsCell Cycle Checkpoints and Cancer PreventionMitotic Spindle Checkpoint and Chromosome SegregationKinetochore Structure and FunctionMitochondria: Structure and FunctionCellular Respiration OverviewBacterial Metabolism OverviewAntibiotic Resistance MechanismsInfectious Disease EpidemiologyFoundations of EpidemiologyMeasuring Disease Frequency: Incidence and PrevalenceEpidemiologic Study DesignsStudy Design in BiostatisticsSurvival Analysis: Kaplan-Meier Estimation

Longest path: 187 steps · 930 total prerequisite topics

Prerequisites (3)

Leads To (3)