A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Sampling Distributions of Statistics

College Depth 58 in the knowledge graph ☐ I know this ☆ Set as goal

1,603topics build on this

267prerequisites beneath it

Random Variables: Definition and Classification→→Advanced Survey Design Bootstrap Methods for Statistical Inference +4 more

sampling-distribution

Core Idea

A sampling distribution is the probability distribution of a sample statistic (mean, proportion, variance) computed from repeated random samples. It describes how statistics vary from sample to sample—crucial for inference. Does not depend on sample size in the way many misconceive.

Explainer

You already know that a random variable is a quantity whose value is determined by a random process. A sample statistic — a mean, a proportion, a variance — is itself a random variable. It takes a different value every time you draw a new sample from the same population. The sampling distribution is simply the probability distribution of that statistic: it tells you, across all possible samples of a given size, how likely each value of the statistic is.

To make this concrete, imagine a population of exam scores with a true mean of 72 and a standard deviation of 10. If you draw one random sample of 30 students and compute the sample mean, you might get 71.4. Draw another 30 students and you might get 73.1. Do this thousands of times, collect every sample mean, and plot the histogram — that histogram approximates the sampling distribution of the sample mean. Notice that the sampling distribution is a distribution *about the statistic itself*, not about individual scores. Its center, spread, and shape are separate questions from those of the original population.

The sampling distribution's shape and spread depend on two things: the population distribution and the sample size n. When n is small, the sampling distribution of the mean inherits more of the population's quirks (skewness, heavy tails). As n increases, something remarkable happens — the sampling distribution of the mean tends toward a normal distribution regardless of the population's shape. That is the content of the Central Limit Theorem, which builds directly on this concept. What matters here is understanding *why* the sampling distribution exists as an object: it captures the variability introduced by the random act of sampling, not variability in the population itself.

A crucial precision: the sampling distribution exists even when you only draw one sample in practice. It is a theoretical object — the distribution you would observe *if* you could repeat the experiment many times. This is the foundation of all frequentist inference. When a textbook says "the standard error of the mean is σ/√n," it is describing the standard deviation of the sampling distribution of the sample mean. Every confidence interval and hypothesis test is a statement about where in the sampling distribution the observed statistic falls — which is why mastering this concept unlocks everything that follows in statistical inference.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Making 10 as an Addition Strategy → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts Through 10 → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Opposites and Additive Inverses → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Introduction to Exponents → Order of Operations → Integer Order of Operations → Variable Expressions → Function Notation Review → Random Variables: Definition and Classification → Sampling Distributions of Statistics

Longest path: 59 steps · 267 total prerequisite topics

Prerequisites (1)

Random Variables: Definition and Classificationhard

Leads To (6)

Advanced Survey Designsoft Bootstrap Methods for Statistical Inferencehard Central Limit Theoremhard Distribution of the Sample Meanhard Populations, Sampling Methods, and Representativenesssoft Randomized Experiments in Development Economicshard