A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Binomial Distribution

College Depth 94 in the knowledge graph ☐ I know this ☆ Set as goal

321topics build on this

482prerequisites beneath it

Discrete Random Variables Independence and the Multiplication Rule +4 more→→Binomial Option Pricing and Replicating Portfolios Confidence Intervals for Proportions +3 more

Core Idea

The binomial distribution models the number of successes in n independent Bernoulli trials, each with success probability p. Its PMF is P(X = k) = C(n,k) × p^k × (1-p)^n-k, where C(n,k) is the binomial coefficient. The mean is np and variance is np(1-p). Binomial distributions arise whenever we count successes in a fixed number of identical, independent trials.

How It's Best Learned

Derive the binomial formula from first principles using counting and independence. Explore how the distribution changes with n and p using simulation or calculation.

Common Misconceptions

Assuming binomial applies without independent trials or equal p. Confusing binomial coefficients with probabilities. Misremembering whether variance is np or np(1-p).

Explainer

You already know what a discrete random variable is — a variable that takes specific countable values, each with a defined probability — and you understand the multiplication rule for independent events. The binomial distribution is what emerges when you combine those two ideas in the most natural setting: repeated independent trials with the same outcome structure each time.

A single trial with two outcomes (success with probability p, failure with probability 1 − p) is called a Bernoulli trial. The binomial distribution counts successes across n such trials performed independently. To derive the PMF, ask: what is the probability of exactly k successes in n trials? One specific sequence with k successes and (n − k) failures has probability p^k · (1 − p)^{n − k} by the multiplication rule for independent events. But there are C(n, k) ways to arrange k successes among n positions (the binomial coefficient, "n choose k"), and each arrangement has the same probability. So P(X = k) = C(n, k) · p^k · (1 − p)^{n − k}. The formula is not handed down from above — it follows directly from counting arrangements and multiplying independent probabilities.

The mean E[X] = np has a clean intuition: if each trial succeeds with probability p, you expect np successes out of n. To see this formally, write X = X₁ + X₂ + ... + Xₙ where each Xᵢ is a Bernoulli(p) indicator variable. Since E[Xᵢ] = p for each i, linearity of expectation gives E[X] = np. The variance Var(X) = np(1 − p) follows similarly from the independence of the Xᵢ's: variances add for independent variables, and Var(Xᵢ) = p(1 − p) for each Bernoulli trial. Notice that variance is largest when p = 1/2 (maximum uncertainty) and shrinks toward zero as p approaches 0 or 1 (near certainty).

Two conditions must hold for the binomial to be appropriate: fixed n (the number of trials is set in advance) and constant, independent p (each trial has the same success probability and the trials do not influence each other). Drawing without replacement from a small population violates independence; trials where the probability of success shifts over time violate the constant-p condition. When these conditions hold — flipping a fair coin 20 times, testing whether each of 100 components is defective, counting how many of 50 email recipients click a link — the binomial distribution is the exact model. As n grows large, the binomial distribution approaches the normal distribution, which is why the normal appears as a limit for sums of independent random variables and why the binomial is your bridge to that next major topic.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Independence and the Multiplication Rule → Binomial Distribution

Longest path: 95 steps · 482 total prerequisite topics

Prerequisites (6)

Discrete Random Variableshard Independence and the Multiplication Rulehard Expected Valuesoft Expected Value and Variancesoft Probability with Combinatoricssoft Variance and Standard Deviation of Random Variablessoft

Leads To (5)

Binomial Option Pricing and Replicating Portfoliossoft Confidence Intervals for Proportionshard Confidence Intervals for Proportionssoft Enzyme Cooperativity and Hill Coefficientsoft Exoplanet Detection Methodssoft