Questions: G-Estimation and Structural Nested Models
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A researcher studies the effect of antiretroviral therapy (ART) on mortality in HIV patients. ART dosing is adjusted over time based on CD4 count, which is also an independent predictor of mortality. The researcher adjusts for CD4 count in a standard regression of mortality on ART dose. What is the primary problem with this approach?
ACD4 count is a mediator, so it must be excluded from any regression model
BAdjusting for CD4 count creates collider stratification bias: it blocks the confounding path from CD4 to ART, but simultaneously opens a backdoor path through prior ART that distorts the effect estimate
CStandard regression cannot handle continuous exposures like ART dose
DCD4 count is not a valid confounder because it is measured after treatment begins
The classic failure of standard regression with time-varying confounders: CD4 is a confounder of the ART-mortality relationship at time t (must be controlled), but it is also a mediator of prior ART's effect (controlling for it blocks part of the causal pathway and opens a backdoor through past ART). Option 0 is wrong — mediation doesn't automatically call for exclusion. G-methods were developed specifically to handle this impasse.
Question 2 Multiple Choice
In G-estimation with a structural nested model, how is the causal parameter ψ identified from observed data?
ABy regressing the outcome on exposure and all measured covariates in a single multivariable model
BBy finding the value of ψ that makes the 'de-treated' potential outcome independent of the observed exposure, conditional on the covariate history
CBy matching treated and untreated individuals on all baseline characteristics
DBy inverting the propensity score to create an exposure-weighted pseudo-population
The structural nested model specifies a mapping from observed Y to Y₀ (the outcome under no treatment) as a function of ψ. G-estimation finds the ψ where Y₀ is uncorrelated with exposure conditional on past covariates — the independence condition that identifies the causal effect. Options 0 and 2 describe standard regression and matching, which fail in this setting. Option 3 describes inverse probability weighting, a different g-method.
Question 3 True / False
G-estimation can estimate the causal effect of a time-varying treatment even when a time-varying covariate is simultaneously a confounder of the current exposure-outcome relationship and a consequence of prior exposure.
TTrue
FFalse
Answer: True
This is precisely what G-estimation was designed for. By parameterizing the counterfactual directly (the structural nested model) and solving for the ψ that achieves the independence condition, G-estimation avoids the need to adjust for the problematic covariate in a regression — sidestepping the collider stratification bias that standard adjustment would introduce.
Question 4 True / False
In the presence of time-varying confounders, including most measured covariates in a standard multivariable regression at each time point is a valid strategy for estimating the causal effect of a time-varying treatment.
TTrue
FFalse
Answer: False
This is the central misconception. When a time-varying covariate is both a confounder of the current treatment effect AND a consequence of prior treatment, adjusting for it in standard regression blocks part of the causal pathway and opens collider-induced backdoor paths. G-methods (G-estimation, marginal structural models, G-computation) were developed to handle this structural problem, which cannot be solved by simply 'adding more covariates' to a regression.
Question 5 Short Answer
Why does time-varying confounding that is also time-varying mediation break standard regression-based confounding control, and what is the key move G-estimation makes to work around this?
Think about your answer, then reveal below.
Model answer: Standard regression forces a choice: adjust for the covariate (controls confounding but blocks the causal path and induces collider bias) or don't adjust (leaves confounding uncontrolled). G-estimation avoids this by not adjusting for the covariate in a regression at all. Instead, it writes a structural nested model for the counterfactual outcome under no treatment (Y₀ = Y − ψ·A), then finds the ψ such that Y₀ is independent of A given past covariate history — using the covariate only in a propensity model for A, not in a direct outcome regression.
The key insight is that G-estimation separates the two uses of the covariate: it models the covariate's relationship to exposure (propensity) to achieve the independence condition, without conditioning on it as a predictor of outcome. This breaks the vicious cycle that standard regression cannot escape.