One-Way ANOVA: Theory and F-Test

College Depth 80 in the knowledge graph I know this Set as goal
Unlocks 2 downstream topics
anova

Core Idea

One-way ANOVA tests the null hypothesis H₀: μ₁ = μ₂ = ... = μₖ, asking whether k group means differ more than chance predicts. The F-statistic equals MS_Between / MS_Within, with degrees of freedom (k−1, n−k). MS_Between captures variation among group means, while MS_Within estimates pooled within-group error. The test assumes equal variances across groups, approximate normality within groups, and independence of observations. A large F value indicates the group means differ more than expected from within-group variability alone.

How It's Best Learned

Start with a concrete example—compare exam scores across three teaching methods. Compute group means, then manually calculate SS_Between and SS_Within to build intuition for what the F-ratio measures before relying on software output.

Common Misconceptions

A significant F-test does not tell you which groups differ—post-hoc tests are needed. ANOVA is also reasonably robust to mild normality violations, so the normality assumption is not as fragile as students often fear.

Explainer

You already know the two-sample t-test for comparing two group means. But what happens when you have three, four, or more groups? The natural impulse is to run all pairwise t-tests — with k groups, that means k(k−1)/2 tests. The problem is multiple comparisons: each test has a false-positive rate of α, and across many tests, the probability of at least one spurious significant result grows rapidly. With five groups and ten pairwise t-tests at α = 0.05, the family-wise error rate climbs toward 40%. One-way ANOVA solves this by performing a single omnibus test that compares all group means simultaneously, keeping the overall error rate at α.

The core idea is a decomposition of total variability. Take all N observations, compute the grand mean (the mean of all data regardless of group), and measure total variation around it: SS_Total = Σ(xᵢ − x̄_grand)². This total variation splits cleanly into two additive components. SS_Between measures how much the group means vary around the grand mean — it captures the "signal" attributable to group membership. SS_Within measures how much individual observations vary around their own group mean — it captures the "noise" or baseline variability that exists even within homogeneous groups. The identity SS_Total = SS_Between + SS_Within holds exactly, partitioning every bit of variation into explained (between) and unexplained (within).

The F-statistic is the ratio of two mean squares: F = MS_Between / MS_Within, where each SS is divided by its degrees of freedom to make the quantities comparable. MS_Between uses k − 1 degrees of freedom (k group means minus one constraint from the grand mean). MS_Within uses N − k degrees of freedom (N observations minus k group means estimated). Under the null hypothesis that all population means are equal, both mean squares estimate the same population variance σ², so F should be approximately 1. When at least one group mean genuinely differs, MS_Between inflates — the group mean differences add to the between-group variance — while MS_Within stays anchored to within-group noise. A large F-ratio therefore signals that group means differ more than random sampling alone would predict.

A significant F-test tells you "not all means are equal" but does not identify which specific groups differ. This is an existence result, not a location result. To determine which pairs of means are significantly different, you need post-hoc tests such as Tukey's HSD, which perform all pairwise comparisons with a correction that controls the family-wise error rate. The ANOVA framework assumes approximately normal distributions within each group, equal variances across groups (homoscedasticity), and independence of observations. The normality assumption is fairly robust for moderate sample sizes, but unequal variances can distort the F-test — Welch's ANOVA provides a correction analogous to Welch's t-test when this assumption fails.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIndefinite IntegralsBasic Integration RulesRiemann SumsDefinite Integral DefinitionProbability Density Functions and Continuous DistributionsCumulative Distribution FunctionsContinuous Random VariablesNormal DistributionCentral Limit TheoremConfidence Intervals for MeansZ-Tests and T-Tests for MeansOne-Sample Z-Test for MeansOne-Sample and Two-Sample T-TestsOne-Way ANOVAOne-Way ANOVA: Theory and F-Test

Longest path: 81 steps · 379 total prerequisite topics

Prerequisites (2)

Leads To (1)