← Graph View All Domains

A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

One-Way ANOVA: Theory and F-Test

College Depth 104 in the knowledge graph ☐ I know this ☆ Set as goal

2topics build on this

519prerequisites beneath it

See this on the map →

F-Distribution: Comparing Variances One-Way ANOVA→→Generalizability Theory and Multi-Faceted Reliability

anova

Core Idea

One-way ANOVA tests the null hypothesis H₀: μ₁ = μ₂ = ... = μₖ, asking whether k group means differ more than chance predicts. The F-statistic equals MS_Between / MS_Within, with degrees of freedom (k−1, n−k). MS_Between captures variation among group means, while MS_Within estimates pooled within-group error. The test assumes equal variances across groups, approximate normality within groups, and independence of observations. A large F value indicates the group means differ more than expected from within-group variability alone.

How It's Best Learned

Start with a concrete example—compare exam scores across three teaching methods. Compute group means, then manually calculate SS_Between and SS_Within to build intuition for what the F-ratio measures before relying on software output.

Common Misconceptions

A significant F-test does not tell you which groups differ—post-hoc tests are needed. ANOVA is also reasonably robust to mild normality violations, so the normality assumption is not as fragile as students often fear.

Explainer

You already know the two-sample t-test for comparing two group means. But what happens when you have three, four, or more groups? The natural impulse is to run all pairwise t-tests — with k groups, that means k(k−1)/2 tests. The problem is multiple comparisons: each test has a false-positive rate of α, and across many tests, the probability of at least one spurious significant result grows rapidly. With five groups and ten pairwise t-tests at α = 0.05, the family-wise error rate climbs toward 40%. One-way ANOVA solves this by performing a single omnibus test that compares all group means simultaneously, keeping the overall error rate at α.

The core idea is a decomposition of total variability. Take all N observations, compute the grand mean (the mean of all data regardless of group), and measure total variation around it: SS_Total = Σ(xᵢ − x̄_grand)². This total variation splits cleanly into two additive components. SS_Between measures how much the group means vary around the grand mean — it captures the "signal" attributable to group membership. SS_Within measures how much individual observations vary around their own group mean — it captures the "noise" or baseline variability that exists even within homogeneous groups. The identity SS_Total = SS_Between + SS_Within holds exactly, partitioning every bit of variation into explained (between) and unexplained (within).

The F-statistic is the ratio of two mean squares: F = MS_Between / MS_Within, where each SS is divided by its degrees of freedom to make the quantities comparable. MS_Between uses k − 1 degrees of freedom (k group means minus one constraint from the grand mean). MS_Within uses N − k degrees of freedom (N observations minus k group means estimated). Under the null hypothesis that all population means are equal, both mean squares estimate the same population variance σ², so F should be approximately 1. When at least one group mean genuinely differs, MS_Between inflates — the group mean differences add to the between-group variance — while MS_Within stays anchored to within-group noise. A large F-ratio therefore signals that group means differ more than random sampling alone would predict.

A significant F-test tells you "not all means are equal" but does not identify which specific groups differ. This is an existence result, not a location result. To determine which pairs of means are significantly different, you need post-hoc tests such as Tukey's HSD, which perform all pairwise comparisons with a correction that controls the family-wise error rate. The ANOVA framework assumes approximately normal distributions within each group, equal variances across groups (homoscedasticity), and independence of observations. The normality assumption is fairly robust for moderate sample sizes, but unequal variances can distort the F-test — Welch's ANOVA provides a correction analogous to Welch's t-test when this assumption fails.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Angle Pairs: Complementary, Supplementary, and Vertical → Parallel Lines and Transversals → Corresponding Angles → Alternate Interior Angles → Triangle Angle Sum Theorem → Exterior Angle Theorem → Triangle Inequality Theorem → Similar Triangles: AA Similarity → Similar Triangles: SSS and SAS Similarity → Proportions in Similar Triangles → Right Triangle Trigonometry Introduction → Sine, Cosine, and Tangent Ratios → Trigonometric Ratios Review → Radian Measure → Converting Between Degrees and Radians → The Unit Circle → Graphing Sine and Cosine → Graphing Tangent and Reciprocal Trigonometric Functions → Derivatives of Trigonometric Functions → Antiderivatives → Indefinite Integrals → Basic Integration Rules → Riemann Sums → Definite Integral Definition → Probability Density Functions and Continuous Distributions → Cumulative Distribution Functions → Continuous Random Variables → Probability Density Functions → Expected Value → Weak Law of Large Numbers → Probability Axioms and Rules → Conditional Probability → Independence of Events → Sampling Distributions → Standard Error of Estimators → Hypothesis Testing: Framework and Logic → P-values and Statistical Significance → Effect Size and Practical Significance → Hypothesis Testing: Framework and Logic → Z-Tests and T-Tests for Means → One-Sample Z-Test for Means → One-Sample and Two-Sample T-Tests → One-Way ANOVA → One-Way ANOVA: Theory and F-Test

Longest path: 105 steps · 519 total prerequisite topics

Prerequisites (2)

F-Distribution: Comparing Varianceshard One-Way ANOVAsoft

Leads To (1)

Generalizability Theory and Multi-Faceted Reliabilitysoft