Questions — Causal Inference Methods in Biostatistics

Question 1 Multiple Choice

The fundamental problem of causal inference states that we can never observe both potential outcomes for the same individual. Why does randomization solve this problem at the population level even though it cannot solve it at the individual level?

ARandomization ensures every individual is observed under both conditions

BRandomization ensures that the treatment and control groups are, on average, exchangeable — the treated group's observed outcomes estimate Y(1) for the population, and the control group's observed outcomes estimate Y(0), because assignment is independent of potential outcomes

CRandomization eliminates all confounders including those that cannot be measured

DBoth B and C

Question 2 Multiple Choice

A directed acyclic graph (DAG) shows that Socioeconomic Status (SES) causes both Exercise (treatment) and Heart Disease (outcome). A researcher adjusts for SES in a regression. According to the DAG, is this sufficient to identify the causal effect of Exercise on Heart Disease?

AYes, if SES is the only confounder — adjusting for it blocks the backdoor path from Exercise to Heart Disease through SES

BNo — adjusting for confounders is never sufficient in observational data

CYes, but only if Exercise is randomized

DNo — you should also adjust for all variables in the dataset

Question 3 True / False

Adjusting for a collider (a variable caused by both treatment and outcome) in a regression introduces bias rather than removing it.

TTrue

FFalse

Question 4 Short Answer

Explain the SUTVA (Stable Unit Treatment Value Assumption) and give a biostatistical example of when it would be violated.

Think about your answer, then reveal below.

Questions: Causal Inference Methods in Biostatistics