← Graph View All Domains

A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Consistency of Estimators

Graduate Depth 110 in the knowledge graph ☐ I know this ☆ Set as goal

2topics build on this

723prerequisites beneath it

See this on the map →

Convergence in Probability Maximum Likelihood Estimation (Theory)→→Asymptotic Normality of MLEs

Core Idea

An estimator θ̂ₙ is consistent if θ̂ₙ converges in probability to θ as n → ∞. Consistency is a minimum requirement for reasonable estimators—as sample size grows, the estimator should approach the truth. Under regularity conditions, MLEs and method of moments estimators are consistent.

Explainer

An estimator is a rule for turning data into a guess about an unknown parameter. For that rule to be useful, it should at minimum do better with more data — intuitively, collecting millions of observations should get you very close to the truth. Consistency formalizes this requirement using the language of convergence in probability that you already know.

Recall that θ̂ₙ converges in probability to θ means: for any ε > 0, the probability P(|θ̂ₙ − θ| > ε) → 0 as n → ∞. In words, the chance that your estimate is far from the truth becomes negligible as the sample grows. This is weaker than almost-sure convergence (which says the estimate *will* eventually be close with probability 1 along every path), but it is the standard benchmark for estimators. A consistent estimator might produce a bad estimate for any specific sample — you could get unlucky — but the probability of a bad estimate vanishes as n grows.

The most important consistency results are for the sample mean and for MLEs. The sample mean X̄ₙ is consistent for the population mean μ by the Weak Law of Large Numbers, which is itself a direct consequence of convergence in probability. For MLEs, consistency follows from general regularity conditions (differentiability of the log-likelihood, identifiability of the model, compactness arguments) and is one reason MLEs are the default estimator in most settings. A useful sufficient condition: if an estimator is unbiased (E[θ̂ₙ] = θ) and its variance vanishes (Var(θ̂ₙ) → 0), then by Chebyshev's inequality it is consistent. But note that consistency does not require unbiasedness — a biased estimator can still be consistent if the bias shrinks to zero with n.

What consistency does *not* guarantee is equally important. Consistency is an asymptotic property — it says nothing about performance at any finite sample size. An estimator could be badly biased for small n yet perfectly consistent. And consistency gives no rate: it does not tell you how quickly the estimate approaches the truth. That information lives in asymptotic normality (the next topic), which tells you √n(θ̂ₙ − θ) converges in distribution to a normal, quantifying the speed of convergence and enabling confidence intervals. Think of consistency as the entry requirement for an estimator — necessary but far from sufficient for a complete understanding of its behavior.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Fundamental Theorem of Calculus Part 1 → Fundamental Theorem of Calculus Part 2 → U-Substitution → Partial Fraction Decomposition for Integration → Improper Integrals - Convergence → Integral Test → P-Series → Comparison Test → Limit Comparison Test → Series Convergence Test Strategy → Power Series → Radius and Interval of Convergence → Taylor Series → Moment Generating Functions → Characteristic Functions → Convergence in Distribution → Stationary Distributions → Convergence of Markov Chains → Convergence in Probability → Almost Sure Convergence → Relationships Between Modes of Convergence → Weak Law of Large Numbers → Strong Law of Large Numbers → Central Limit Theorem (Rigorous via Characteristic Functions) → Maximum Likelihood Estimation (Theory) → Consistency of Estimators

Longest path: 111 steps · 723 total prerequisite topics

Prerequisites (2)

Convergence in Probabilityhard Maximum Likelihood Estimation (Theory)soft

Leads To (1)

Asymptotic Normality of MLEshard