A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Variance and Standard Deviation

College Depth 60 in the knowledge graph ☐ I know this ☆ Set as goal

1,695topics build on this

269prerequisites beneath it

Expected Value: Theory and Properties→→Accuracy, Precision, and Error Covariance and Correlation of Random Variables +3 more

variance spread

Core Idea

Variance σ²=Var(X)=E[(X−μ)²]=E[X²]−μ² measures spread. Standard deviation σ=√Var(X) is in original units. Var(aX+b)=a²Var(X). For independent variables, Var(X+Y)=Var(X)+Var(Y). Variance characterizes dispersion around the mean.

Explainer

You already know that the expected value E[X] = μ is the probability-weighted average of a random variable — the center of mass of the distribution. But two distributions can share the same mean yet behave very differently. A coin that pays $1 with certainty has the same expected value as a coin that pays $0 or $2 with equal probability, but the second one is riskier. Variance is the tool that quantifies that spread.

Variance is defined as Var(X) = E[(X − μ)²]. The logic: subtract the mean from each outcome to get the deviation, square it so negatives don't cancel positives, then take the expectation. Squaring is the canonical choice — it penalizes large deviations quadratically and produces a mathematically clean theory. The computational shortcut E[X²] − μ² follows directly from expanding the square: E[(X−μ)²] = E[X² − 2μX + μ²] = E[X²] − 2μ² + μ² = E[X²] − μ². Use whichever form is easier for a given distribution.

The squaring introduces a units problem: if X is in dollars, variance is in dollars-squared. Standard deviation σ = √Var(X) restores original units and is typically what you report. But variance is what you use in formulas, because it has the crucial additivity property: for *independent* random variables, Var(X + Y) = Var(X) + Var(Y). This property doesn't hold for standard deviation (√(σ_X² + σ_Y²) ≠ σ_X + σ_Y), which is why variance is the fundamental object even if standard deviation is more interpretable.

The scaling rule Var(aX + b) = a²Var(X) is worth internalizing. Shifting a distribution by a constant b doesn't change its spread — variance ignores location. Scaling by a factor a stretches all deviations by a, so squared deviations scale by a². This means if you double the measurement scale of a variable, its variance quadruples. This rule is essential for standardizing random variables: if you form Z = (X − μ)/σ, then Var(Z) = (1/σ²)·Var(X) = (1/σ²)·σ² = 1. Standard deviation is the natural unit of spread, and standardization sets it to 1.

Variance connects to everything downstream. The Chebyshev inequality (which you'll study next) uses variance to bound how much probability can lie far from the mean — a distribution with small variance can't put much probability far from its center. Covariance, which measures joint spread of two variables, is the generalization of variance to pairs: Cov(X, X) = Var(X). Understanding variance as squared expected deviation from the mean is the conceptual foundation for all of these extensions.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Making 10 as an Addition Strategy → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts Through 10 → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Opposites and Additive Inverses → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Introduction to Exponents → Order of Operations → Integer Order of Operations → Variable Expressions → Function Notation Review → Random Variables: Definition and Classification → Probability Mass Functions and Discrete Distributions → Expected Value: Theory and Properties → Variance and Standard Deviation

Longest path: 61 steps · 269 total prerequisite topics

Prerequisites (1)

Expected Value: Theory and Propertieshard

Leads To (5)

Accuracy, Precision, and Errorsoft Covariance and Correlation of Random Variablessoft Investment Risk and Returnsoft Measurement Uncertainty Budgetingsoft Moment Generating Functionshard