Weak Law of Large Numbers

Graduate Depth 89 in the knowledge graph I know this Set as goal
Unlocks 40 downstream topics
law-of-large-numbers limit-theorems probability

Core Idea

If {Xₙ} are i.i.d. random variables with finite mean μ, then Sₙ/n = (X₁ + ... + Xₙ)/n converges in probability to μ. The key assumption is finite variance (or more generally, applying Chebyshev's inequality). The weak LLN guarantees that sample means stabilize around the true mean, justifying empirical estimation.

Explainer

The Weak Law of Large Numbers formalizes the intuition behind empirical estimation: average many independent observations of the same random quantity, and the average should be close to the true mean. You know from prerequisites what "close" means rigorously: Sₙ/n → μ in probability means that for any ε > 0, P(|Sₙ/n − μ| > ε) → 0 as n → ∞. The WLLN says exactly this happens for i.i.d. sequences with finite mean μ.

The standard proof under the additional assumption of finite variance σ² is a direct application of Chebyshev's inequality, which you know from prerequisites: P(|Y − E[Y]| > ε) ≤ Var(Y)/ε². Apply it to Y = Sₙ/n. The expected value of Sₙ/n is μ — since E[Xᵢ] = μ and expectation is linear, E[Sₙ/n] = μ. The variance of Sₙ/n is σ²/n — since the Xᵢ are independent (which allows variance to add: Var(Sₙ) = nσ²), we get Var(Sₙ/n) = σ²/n. Chebyshev then gives P(|Sₙ/n − μ| > ε) ≤ σ²/(nε²) → 0. The key roles are clear: independence (via sigma-algebra independence from your prerequisites) makes variances additive, and Chebyshev converts that variance bound into a probability bound.

The convergence is not pointwise: the sample averages don't necessarily converge at every individual outcome ω. They could fluctuate forever along particular sample paths. What goes to zero is the probability of being far from μ — which is exactly what convergence in probability captures. The Strong LLN (which builds on this result) strengthens the conclusion to almost sure convergence: P(Sₙ/n → μ) = 1, meaning the averages converge at every outcome except possibly a set of probability zero. The weak version is easier to prove and suffices for most statistical applications: it rigorously justifies using a sample mean to estimate a population mean.

The WLLN is also the theoretical backbone of Monte Carlo methods. To estimate E[g(X)] for some function g, generate i.i.d. samples X₁, ..., Xₙ and compute the average (g(X₁) + ··· + g(Xₙ)) / n. The WLLN guarantees this converges to the true expectation. The convergence rate is σ²/n, where σ² = Var(g(X)) — crucially, this depends on the variance of the function, not on the dimension of the problem. For high-dimensional integrals where deterministic quadrature rules scale exponentially in dimension, this dimension-independence makes Monte Carlo the method of choice.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIndefinite IntegralsBasic Integration RulesRiemann SumsDefinite Integral DefinitionFundamental Theorem of Calculus Part 1Fundamental Theorem of Calculus Part 2U-SubstitutionPartial Fraction Decomposition for IntegrationImproper Integrals - ConvergenceIntegral TestP-SeriesComparison TestLimit Comparison TestAbsolute vs. Conditional ConvergencePower SeriesTaylor PolynomialsTaylor SeriesMoment Generating FunctionsCharacteristic FunctionsConvergence in DistributionStationary DistributionsConvergence of Markov ChainsConvergence in ProbabilityWeak Law of Large Numbers

Longest path: 90 steps · 499 total prerequisite topics

Prerequisites (3)

Leads To (2)