← Graph View All Domains

A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Moment Generating Functions

Research Depth 98 in the knowledge graph ☐ I know this ☆ Set as goal

111topics build on this

602prerequisites beneath it

See this on the map →

Variance and Higher Moments (Rigorous)Probability Mass Functions +1 more→→Branching Processes Characteristic Functions

Core Idea

The moment generating function (MGF) is M(t) = E[e^tX], defined for t in some neighborhood of 0. If M(t) exists, all moments can be recovered: E[Xᵏ] = M^(k)(0). The MGF uniquely determines the distribution, and convergence of MGFs implies convergence of distributions.

Explainer

The moment generating function is an encoding trick: it packages all the moments of a distribution into a single function of one variable t. The definition M(t) = E[e^tX] looks mysterious at first, but the connection to moments becomes transparent through the Taylor series you already know. Recall that e^tX = 1 + tX + t²X²/2! + t³X³/3! + ... Taking expectations term by term: M(t) = 1 + t·E[X] + t²·E[X²]/2! + t³·E[X³]/3! + ... This is the ordinary power series for M(t) with coefficients E[Xᵏ]/k!. Differentiating k times and evaluating at t = 0 plucks out E[Xᵏ], which is exactly why the kth derivative at zero gives the kth moment: M^(k)(0) = E[Xᵏ].

This makes computing variance and higher moments from prerequisites much easier for well-known distributions. For the exponential distribution with rate λ, M(t) = λ/(λ − t) for t < λ. Differentiating: M'(t) = λ/(λ − t)², so E[X] = M'(0) = 1/λ. Differentiating again: M''(t) = 2λ/(λ − t)³, giving E[X²] = 2/λ² and Var(X) = 2/λ² − (1/λ)² = 1/λ². One function generates everything. For the normal distribution N(μ, σ²), the MGF is M(t) = exp(μt + σ²t²/2) — a compact encoding that makes normal calculations tractable.

The deeper power of the MGF is that it uniquely determines the distribution: two distributions with the same MGF (when it exists on an open interval around 0) are identical. This is analogous to how a function is determined by all its derivatives at a point (when the Taylor series converges). This uniqueness property is the key to proving limit theorems: if you can show that the MGF of a sequence of distributions converges to M(t) = exp(μt + σ²t²/2) (the normal MGF), then the distributions themselves converge to normal. This is one route to the Central Limit Theorem — show that the MGF of the standardized sum converges pointwise to the standard normal MGF.

One important caveat: the MGF may fail to exist (the expectation E[e^tX] may be infinite) for heavy-tailed distributions like the Cauchy. This is why the characteristic function (replacing t with it, using complex exponentials) is more generally applicable and is the preferred tool in rigorous probability theory — the characteristic function always exists because |e^itX| = 1. Think of the MGF as the practical, computable tool for distributions with finite moments, and the characteristic function as its more powerful but less elementary extension.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Fundamental Theorem of Calculus Part 1 → Fundamental Theorem of Calculus Part 2 → U-Substitution → Partial Fraction Decomposition for Integration → Improper Integrals - Convergence → Integral Test → P-Series → Comparison Test → Limit Comparison Test → Series Convergence Test Strategy → Power Series → Radius and Interval of Convergence → Taylor Series → Moment Generating Functions

Longest path: 99 steps · 602 total prerequisite topics

Prerequisites (3)

Variance and Higher Moments (Rigorous)hard Taylor Seriessoft Probability Mass Functionssoft

Leads To (2)

Branching Processeshard Characteristic Functionssoft