A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Cramer-Rao Lower Bound

Research Depth 92 in the knowledge graph ☐ I know this ☆ Set as goal

4topics build on this

548prerequisites beneath it

Fisher Information Variance and Higher Moments (Rigorous)→→Asymptotic Normality of MLEs Uniformly Minimum Variance Unbiased Estimation (UMVUE)

Core Idea

For any unbiased estimator T of θ, Var(T) ≥ 1/I(θ). The bound is tight: equality holds iff T is the uniformly minimum variance unbiased estimator (UMVUE). The CRLB shows that Fisher information lower-bounds estimator precision. MLEs are asymptotically efficient, achieving the CRLB in the limit.

Explainer

From Fisher information, you know that I(θ) = E[(∂/∂θ log f(X; θ))²] measures how sharply the likelihood peaks around the true parameter: high Fisher information means the data is highly informative about θ, and the log-likelihood is tightly curved. From variance, you know Var(T) measures how spread out an estimator T is around its mean. The Cramér-Rao Lower Bound connects these two: it says that no unbiased estimator can have variance smaller than 1/I(θ). The more information the data carries, the lower this floor — and thus the more precisely θ can be estimated.

The proof uses the Cauchy-Schwarz inequality in a clever way. For any unbiased estimator T(X), the condition E[T(X)] = θ can be differentiated with respect to θ (under regularity conditions) to give Cov(T, S) = 1, where S = ∂/∂θ log f(X; θ) is the score function. Since Cov(T, S)² ≤ Var(T) · Var(S) = Var(T) · I(θ), substituting Cov(T, S) = 1 gives 1 ≤ Var(T) · I(θ), which is exactly Var(T) ≥ 1/I(θ). The constraint that E[T] = θ (unbiasedness) is what forces Cov(T, S) = 1 and makes the bound tight.

Equality Var(T) = 1/I(θ) holds if and only if T is a linear function of the score, i.e., T − θ = c(θ) · S for some function c(θ). This happens precisely in exponential family distributions, where the sufficient statistic achieves the bound. For example, the sample mean X̄ from a normal distribution N(μ, σ²) has Var(X̄) = σ²/n, and the Fisher information about μ from n observations is n/σ², so Var(X̄) = 1/I(μ) exactly — X̄ is a efficient estimator.

For more complex models, the CRLB defines a benchmark for efficiency: the efficiency of an estimator T is the ratio (1/I(θ)) / Var(T), which lies in (0, 1]. Maximum likelihood estimators are generally not exactly efficient for finite samples, but they are asymptotically efficient: as n → ∞, √n(T_MLE − θ) → N(0, 1/I(θ)), meaning the MLE variance approaches the CRLB in the limit. This asymptotic efficiency is a key justification for using MLEs in practice.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Expectation (Measure-Theoretic) → Fisher Information → Cramer-Rao Lower Bound

Longest path: 93 steps · 548 total prerequisite topics

Prerequisites (2)

Fisher Informationhard Variance and Higher Moments (Rigorous)hard

Leads To (2)

Asymptotic Normality of MLEssoft Uniformly Minimum Variance Unbiased Estimation (UMVUE)hard