Questions — Subgroup Analysis and Treatment Effect Heterogeneity

Question 1 Multiple Choice

A clinical trial reports that among men, the treatment significantly reduced mortality (p = 0.03), while among women the effect was not statistically significant (p = 0.12). The investigators conclude that the treatment benefits men but not women. What is the most serious flaw in this reasoning?

AThe trial should have enrolled more women to increase power

BComparing p-values within subgroups does not test whether the effects actually differ — a formal interaction test is required

CThe treatment effect in men is too small to be clinically meaningful at p = 0.03

DSubgroup analyses are never valid and should not be reported

Question 2 Multiple Choice

Which of the following subgroup findings would be most credible and worth investigating further?

AA subgroup finding discovered after unblinding that was not pre-specified, with no biological rationale and a borderline interaction p-value of 0.04

BA pre-specified subgroup based on a known mechanistic pathway, with a significant formal interaction test, replicated in an independent dataset

CA statistically significant effect in one subgroup of 20 patients, with a non-significant effect in all other subgroups

DA subgroup analysis with 15 comparisons where 2 show significant interaction, consistent with chance at α = 0.05

Question 3 True / False

Pre-specifying which subgroups will be analyzed before data collection or unblinding is primarily a bureaucratic requirement rather than a substantive methodological safeguard.

TTrue

FFalse

Question 4 True / False

A drug with a null average treatment effect might still be beneficial for a well-defined subpopulation.

TTrue

FFalse

Question 5 Short Answer

Why is a formal test for interaction necessary when analyzing subgroup effects, rather than simply comparing p-values from separate within-subgroup analyses?

Think about your answer, then reveal below.

Questions: Subgroup Analysis and Treatment Effect Heterogeneity