5 questions to test your understanding
A researcher runs an ANOVA with 4 groups, finds a significant omnibus F-test, then examines all 6 possible pairwise comparisons to locate the effect. What correction is most appropriate for these comparisons?
A researcher conducts 100 statistical tests at α = .05. Assuming all null hypotheses are true (no real effects exist), approximately how many tests are expected to yield a 'significant' result?
The Bonferroni correction controls the familywise error rate by making each individual test harder to pass, but this comes at the cost of reduced statistical power to detect true effects.
A researcher who specifies exactly two theoretically motivated comparisons before data collection needs to apply the same stringent correction as a researcher who conducts 100 post-hoc comparisons on the same dataset.
Why does the multiple comparisons problem arise when conducting many statistical tests, and why can't it be fully fixed by applying corrections after data collection has occurred?