Effect Sizes, Practical Significance, and Results Reporting

Research Depth 93 in the knowledge graph I know this Set as goal
Unlocks 1 downstream topic
effect-size practical-significance result-reporting interpretation

Core Idea

Effect sizes (Cohen's d, r, eta-squared) quantify the magnitude of differences or relationships. They are comparable across studies and samples, making them crucial for meta-analysis and interpretation. Practical significance considers both statistical significance and effect magnitude: a statistically significant but negligible effect may be theoretically uninteresting. Reporting both p-values and effect sizes with confidence intervals enables full understanding.

How It's Best Learned

Calculate and interpret effect sizes for published studies. Compare studies with identical p-values but different effect sizes. Discuss when small effects are scientifically valuable and when large effects are expected.

Common Misconceptions

Explainer

From your study of statistical inference and significance testing, you know that a p-value answers a specific and narrow question: given that the null hypothesis is true, how probable is a result at least as extreme as the one observed? A small p-value tells you the result is unlikely under the null—that's evidence something real is happening. What it does not tell you is *how much* is happening. Two studies can both achieve p < 0.001 while one shows a large, clinically meaningful difference and the other shows a difference so small it has no practical consequence whatsoever. This is the gap that effect sizes fill.

An effect size is a standardized, scale-free measure of the magnitude of a relationship or difference. The most common in psychology is Cohen's d, which expresses a mean difference between two groups in standard deviation units: d = (M₁ − M₂) / SD_pooled. A d of 0.2 means the groups differ by two-tenths of a standard deviation—a small effect. A d of 0.8 means they differ by nearly a full standard deviation—a large effect by conventional benchmarks. Cohen's d is interpretable across studies because it removes the original measurement scale: a drug that raises test scores by 3 points on a 100-point scale and a drug that raises them by 6 points on a 200-point scale could have the same d if the SDs scale proportionally. Pearson's r (the correlation coefficient) also serves as an effect size for relationships, and eta-squared (η²) describes the proportion of variance explained in ANOVA designs—analogous to R² in regression.

The critical conceptual point is that statistical significance and effect size are logically independent. A tiny effect can be highly significant with a large sample (because significance depends on sample size), and a large effect can fail to reach significance with a small sample. In a study of thousands of participants, you might detect a statistically significant difference in IQ between people born in January versus July—but if d = 0.03, this result has essentially no practical meaning. Conversely, a clinical trial with only 20 patients might find a 40% reduction in symptoms (a massive effect) that fails to reach p < 0.05 purely due to low power. The p-value answers "Is this real?" Effect size answers "Does it matter?"

Reporting standards in psychology have shifted toward requiring both, along with confidence intervals. A well-reported result looks like: "The intervention group outperformed the control group by 8.4 points (d = 0.61, 95% CI [0.32, 0.90], p = .003)." This tells the reader that the effect is likely real (significant), medium-large in magnitude (d = 0.61), and the plausible range of the true effect excludes zero (the CI doesn't include d = 0). Confidence intervals around effect sizes are particularly valuable because they communicate both the estimate and the precision of that estimate—a very wide CI around a large d signals that the true effect size is uncertain despite the observed result. Together, p-values, effect sizes, and confidence intervals give a complete picture that neither statistic alone provides, and this complete picture is what modern psychological science demands.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIndefinite IntegralsBasic Integration RulesRiemann SumsDefinite Integral DefinitionFundamental Theorem of Calculus Part 1Fundamental Theorem of Calculus Part 2U-SubstitutionPartial Fraction Decomposition for IntegrationImproper Integrals - ConvergenceIntegral TestP-SeriesComparison TestLimit Comparison TestAbsolute vs. Conditional ConvergencePower SeriesTaylor PolynomialsTaylor SeriesMoment Generating FunctionsCharacteristic FunctionsConvergence in DistributionStationary DistributionsConvergence of Markov ChainsConvergence in ProbabilityAlmost Sure ConvergenceStrong Law of Large NumbersCentral Limit Theorem (Rigorous via Characteristic Functions)Inferential Statistics, Hypothesis Testing, and P-ValuesEffect Sizes, Practical Significance, and Results Reporting

Longest path: 94 steps · 617 total prerequisite topics

Prerequisites (2)

Leads To (1)