Questions — Multiple Imputation for Missing Data

Question 1 Multiple Choice

A clinical trial has 30% missing outcome data. The analyst performs complete-case analysis, using only the 70% of patients with complete data. Under what condition is this approach unbiased?

AWhen the data are Missing at Random (MAR)

BWhen the data are Missing Completely at Random (MCAR) — missingness is unrelated to any variable, observed or unobserved

CComplete-case analysis is always unbiased because it uses real observed data

DWhen more than 50% of data are observed

Question 2 Short Answer

Single imputation (replacing each missing value with one predicted value, like the mean) produces unbiased point estimates under MAR. However, it still underestimates uncertainty. Why?

Think about your answer, then reveal below.

Question 3 True / False

Data are Missing Not at Random (MNAR) when the probability of missingness depends on the unobserved value itself. For example, patients with severe depression are less likely to return for follow-up questionnaires. Multiple imputation under MAR assumptions will produce biased results in this scenario.

TTrue

FFalse

Question 4 Multiple Choice

A colleague uses 5 imputations for a multiple imputation analysis, arguing this is sufficient based on Rubin's original recommendation. Is this still considered adequate?

AYes — 5 imputations is always sufficient

BNo — current guidance recommends 20-50 or more imputations, especially when the fraction of missing information is high, to stabilize standard error estimates and p-values

CThe number of imputations does not affect the results

DOnly 1 imputation is needed if the imputation model is correct

Questions: Multiple Imputation for Missing Data