Questions: First-Difference Estimator for Panel Data
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A researcher has two-period panel data and worries that workers with higher innate ability earn more AND are more likely to get promoted (an omitted variable). The first-difference estimator removes this bias because:
AIt controls for time-varying confounders by averaging across periods
BAbility is the same value for the same person in both periods, so it cancels when you subtract period 1 from period 2
CThe differenced equation includes ability as an explicit control variable
DTaking differences increases sample size, reducing bias from outliers
The individual fixed effect αᵢ (here, ability) appears identically in both period equations. When you subtract: Yᵢ₂ − Yᵢ₁ = (αᵢ − αᵢ) + β(Xᵢ₂ − Xᵢ₁) + (εᵢ₂ − εᵢ₁). The αᵢ terms cancel exactly — a person's ability does not change between periods. This is within-unit identification: comparing each person to themselves. Note that FD does NOT remove time-varying confounders; those remain in Δεᵢ.
Question 2 Multiple Choice
With T=10 periods and serially uncorrelated errors, which estimator is generally preferred over first-differences?
AFirst-differences, because it creates more observations by using T−1 differences
BThe within (demeaning) estimator, because it uses all T observations and is more efficient
CPooled OLS, because panel structure is only needed when errors are correlated
DFirst-differences and within are always equivalent with T > 2 periods
With uncorrelated errors, the within estimator that demeans each unit around its time average uses all T observations per unit and is more statistically efficient than FD, which uses only T−1 differences and discards level information. However, when errors follow a random walk, FD produces white-noise differenced errors while within errors become correlated — reversing the efficiency ranking. The choice depends on the error structure.
Question 3 True / False
The first-difference estimator eliminates most sources of omitted variable bias, not just bias from time-invariant confounders.
TTrue
FFalse
Answer: False
FD removes bias from time-invariant omitted variables (captured by αᵢ) because they are identical in both periods and cancel in the difference. Time-varying omitted variables — confounders that change between periods and are correlated with ΔX — survive differencing and remain in Δεᵢ. For example, if workers who got training also received simultaneous wage subsidies, that subsidy change is a time-varying confounder FD cannot eliminate.
Question 4 True / False
In a two-period panel, the first-difference estimator and the within (demeaning) estimator produce numerically identical coefficient estimates.
TTrue
FFalse
Answer: True
With exactly T=2 periods, demeaning a unit around its two-period mean is algebraically equivalent to taking the first difference — both reduce to the same calculation. The equivalence breaks down with T > 2 because FD uses T−1 differences while within demeaning uses all T observations around a unit mean.
Question 5 Short Answer
Why does the first-difference estimator become imprecise when the outcome variable is highly persistent (changes very little from period to period)?
Think about your answer, then reveal below.
Model answer: FD identifies β from ΔYᵢ = βΔXᵢ + Δεᵢ — regression of outcome changes on predictor changes. If Y barely moves between periods, most ΔYᵢ values cluster near zero and there is very little variation in the dependent variable to identify β. The signal-to-noise ratio collapses: the small systematic movements in ΔY are swamped by even modest noise in Δε. A highly persistent outcome means the within-unit changes that FD relies on are too small to be reliably measured.
This structural weakness motivates GMM-based panel estimators like Arellano-Bond, which use lagged levels as instruments for the differenced equation — recovering the level variation that FD discards.