A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Confidence Intervals for Proportions

College Depth 100 in the knowledge graph ☐ I know this ☆ Set as goal

512prerequisites beneath it

Binomial Distribution Central Limit Theorem: Rigor and Applications +1 more→

Core Idea

Sample proportion p̂=X/n has approximately N(p, p(1−p)/n) distribution when np≥10 and n(1−p)≥10. CI: p̂±z_{α/2}√(p̂(1−p̂)/n). Exact methods (Clopper-Pearson) preferred when normality conditions fail.

Explainer

You know from the Central Limit Theorem that sample means of i.i.d. observations are approximately normally distributed for large n. A sample proportion p̂ = X/n is a special case: X counts successes in n Bernoulli trials, so X ~ Binomial(n, p). Each trial contributes either 0 or 1 to the sum, and p̂ is the mean of these 0-1 observations. By the CLT, p̂ ≈ N(p, p(1−p)/n) — the true proportion p is the mean of the Bernoulli, and p(1−p) is its variance, so the standard error of p̂ is √(p(1−p)/n).

The confidence interval formula follows directly from this approximation. A 95% confidence interval for a Normal mean is point estimate ± 1.96 × (standard error). Since we don't know p (that's what we're estimating), we plug in p̂ in its place: CI = p̂ ± z_{α/2} √(p̂(1−p̂)/n). Here z_{α/2} is the z-critical value for the desired confidence level — 1.96 for 95%, 2.576 for 99%. The margin of error is the ± part: it tells you the half-width of the interval.

The conditions np ≥ 10 and n(1−p) ≥ 10 (sometimes stated as np ≥ 5) ensure the Binomial is well-approximated by the Normal. Intuitively, if p = 0.01 and n = 50, then you'd expect only 0.5 successes on average — the distribution is heavily skewed toward zero, and the Normal approximation is poor. These conditions require enough expected successes *and* expected failures for the distribution to look roughly symmetric and bell-shaped. When they fail, the Normal-based interval can have poor coverage — the actual proportion of intervals containing the true p may be much less than the nominal 95%.

In that case, the Clopper-Pearson interval (also called the "exact" binomial interval) uses the Binomial distribution directly rather than the Normal approximation. It constructs the interval by finding the values of p that make the observed count X neither too extreme in the lower tail nor the upper tail. Clopper-Pearson is conservative — its actual coverage is always at least the nominal level — but it tends to be wider than necessary. This is the fundamental tradeoff: the approximate Normal interval is narrower and simpler but unreliable for small n or extreme p; the exact interval is always valid but wider.

A useful fact: the margin of error is maximized when p̂ = 0.5, giving maximum margin = z_{α/2} / (2√n). For a 95% CI and n = 1000, this is approximately 1.96/(2·31.6) ≈ 0.031 — about 3 percentage points. This is why political polls with "margin of error ±3%" typically use roughly 1,000 respondents. Doubling the precision (halving the margin) requires quadrupling n — the square root in the denominator means precision is expensive to buy with sample size alone.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Conditional Distributions → Bivariate Normal Distribution → Normal Distribution: Properties and Fundamentals → Central Limit Theorem: Rigor and Applications → Confidence Intervals: General Framework → Confidence Intervals for Proportions → Confidence Intervals for Population Means → Confidence Intervals for Proportions

Longest path: 101 steps · 512 total prerequisite topics

Prerequisites (3)

Binomial Distributionhard Central Limit Theorem: Rigor and Applicationshard Confidence Intervals for Population Meanssoft

Leads To (0)

No topics depend on this one yet.