Moderation (or interaction) occurs when the effect of an independent variable on a dependent variable depends on the level of another variable (the moderator), revealing boundary conditions or contexts for effects. For example, the effect of a cognitive training intervention on memory may be stronger for older adults than younger adults, with age moderating the intervention effect. Moderation analysis examines 'for whom' or 'under what conditions' effects occur, revealing important individual and contextual factors. Testing moderation requires sufficient statistical power and can be tested in both experimental and correlational designs using regression with multiplicative interaction terms or multilevel modeling.
Analyze an existing dataset examining whether an effect differs across levels of a moderator variable. Use simple slopes analysis to interpret significant interactions.
A significant main effect must exist for a significant interaction (actually, interactions can occur without main effects). Moderation proves that a variable is causal (actually, moderation is a pattern of association that is consistent with moderation but does not prove causation from observational data).
You know from inferential statistics that a main effect describes the average relationship between a predictor and an outcome across all levels of other variables in the model. Moderation asks a different question: not "is there an effect of X on Y?" but "does the effect of X on Y *change* depending on the value of some third variable Z?" When the answer is yes, Z is a moderator — a boundary condition on the effect. Moderation analysis reveals not just *that* something works, but *for whom* or *under what circumstances* it works.
The conceptual core is an interaction: the effect of the independent variable is not constant but varies across levels of the moderator. In a clinical trial testing a stress-reduction intervention, you might find that the intervention reduces cortisol overall (main effect). But if older adults show a 15-point reduction while younger adults show a 2-point reduction, the intervention effect is moderated by age. The "effect" is not a single number — it is a function of the moderator's level. Interactions can operate in many ways: one group benefits while another does not (fan-out interaction), one group benefits while another is harmed (crossover or disordinal interaction), or the effect is strong in one condition and flat in another. Disordinal interactions are particularly important because they undermine the meaning of the main effect: a positive main effect might actually consist of a positive effect in one group and a negative effect in another, making the average misleading.
In regression, moderation is modeled by creating a product term: X × Z is added to the equation alongside the main effects of X and Z. The coefficient on the product term is the interaction. If that coefficient is statistically significant, the effect of X on Y depends on Z. A key technical point: the main effect coefficients in a model with an interaction term are *not* interpretable as the overall average effects — they are the estimated effects at specific values of the moderator (usually zero, which may not be a meaningful value). This is why simple slopes analysis is the follow-up of choice: you estimate the effect of X on Y separately at different values of Z (often: Z = mean, Z = +1 SD, Z = −1 SD), which translates the interaction coefficient into a set of interpretable conditional effects.
Moderation analysis has high practical value but also high false-positive risk. Interaction effects require substantially larger samples than main effects to detect reliably — roughly four times the sample size for the same power, in simple cases. Low-powered moderation studies frequently produce spurious interactions that fail to replicate. Best practice is to specify the moderator hypothesis in advance (confirmatory rather than exploratory), use adequately powered samples, and report effect sizes for the interaction alongside *p*-values. An interaction that crosses significance only because it was found after testing twelve candidate moderators is very likely noise. The question "for whom does this work?" is genuinely important — but the answer requires the same disciplined inference as any other claim.
No topics depend on this one yet.