Questions: Directed Acyclic Graphs for Causal Modeling
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A study examines whether a drug (D) causes recovery (R). Both D and R independently increase the likelihood of hospitalization (H): D → H ← R. A researcher adjusts for H to control for 'health severity.' What is the consequence?
AThis correctly removes confounding by health severity, improving the estimate
BThis introduces a spurious association between D and R by conditioning on a collider
CThis blocks a mediating pathway, causing underestimation of the drug's total effect
DThis has no effect because H is downstream of both D and R
H is a collider — arrows point INTO it from both D and R. Colliders do not create open paths between their causes by default. But conditioning on H opens a spurious non-causal path between D and R, introducing bias that wasn't there before. Among hospitalized patients, the absence of the drug becomes evidence for the disease (since something must explain the hospitalization), creating an artificial negative association. Adjusting for a collider makes the estimate worse, not better — this is the counterintuitive core of collider bias.
Question 2 Multiple Choice
A researcher wants to estimate the total effect of exercise (E) on cardiovascular disease (CVD), where part of the effect runs through reduced blood pressure: E → BP → CVD. She includes blood pressure (BP) in her regression. What does her model estimate?
AThe total causal effect of exercise on CVD
BThe direct effect of exercise on CVD not mediated through blood pressure
CThe effect of blood pressure on CVD, controlling for exercise
DAn unbiased total effect estimate with reduced variance from the extra covariate
BP is a mediator on the causal path E → BP → CVD. Adjusting for a mediator blocks the indirect pathway, so the model captures only the direct effect of E on CVD via routes that do not go through BP. To estimate the total effect of exercise, BP should NOT be adjusted for. This is one of the most common mistakes in observational epidemiology: including every 'relevant' variable without asking whether a variable is a mediator, confounder, or collider in the DAG.
Question 3 True / False
In a DAG analysis, adjusting for more variables generally provides a better causal estimate because each additional covariate removes another source of potential confounding.
TTrue
FFalse
Answer: False
This is the central error DAGs are designed to prevent. Adjusting for a collider opens a spurious non-causal path, introducing bias. Adjusting for a mediator blocks the causal path you want to estimate. The correct adjustment set depends on the causal structure — the DAG — not on maximizing the number of covariates. 'Adjust for everything' is not a valid causal inference strategy; it can actively worsen estimates.
Question 4 True / False
The backdoor criterion identifies adjustment sets that block all non-causal paths from exposure to outcome while leaving causal paths intact.
TTrue
FFalse
Answer: True
The backdoor criterion requires: (1) no variable in the adjustment set is a descendant of the exposure, and (2) the set blocks every backdoor path — every non-causal path entering the exposure node from behind (indicating a common cause). When a set satisfies these criteria, adjusting for it gives the causal effect without blocking causal paths or opening collider paths. Multiple valid adjustment sets may exist for the same DAG, and the DAG helps you choose the most practical one.
Question 5 Short Answer
What makes a collider different from a confounder in a DAG, and why does conditioning on a collider cause bias rather than remove it?
Think about your answer, then reveal below.
Model answer: A confounder has arrows pointing OUT to both exposure and outcome (a common cause), creating an open non-causal path that conditioning blocks. A collider has arrows pointing IN from both exposure and outcome (or their ancestors) — a common effect. No path passes through a collider by default; the path is already closed. Conditioning on a collider opens this previously closed path, creating a spurious association between its causes. The intuition: among patients selected by being hospitalized (a collider of disease and drug exposure), knowing a patient wasn't given the drug makes the disease more likely — producing a spurious drug-disease correlation that doesn't exist in the full population.
Collider bias is the hardest DAG concept for students to internalize because conditioning normally removes associations rather than creating them. The key is that for confounders the path is open and conditioning closes it; for colliders the path is closed and conditioning opens it — exactly the opposite.