Questions: Marginal Effects and Partial Effects Measurement
5 questions to test your understanding
Score: 0 / 5
Question 1 Multiple Choice
A researcher estimates a probit model of employment and reports the coefficient on 'college degree' (a binary variable) as 0.8. What does this number directly represent?
AThe probability that a college graduate is employed compared to a non-graduate
BThe percentage-point increase in employment probability for college graduates
CThe change in the latent index (log-odds scale) for a college graduate versus a non-graduate
DThe average marginal effect of college education, interpretable as an 8-percentage-point increase
In a probit model, coefficients are on the scale of the latent index (the argument of the normal CDF), not on the probability scale. A coefficient of 0.8 means the index increases by 0.8 for college graduates — which corresponds to some increase in probability that depends on where on the CDF you are evaluating. To get the probability-scale effect, you need to compute a marginal effect: multiply by the normal density φ(Xβ) at the relevant covariate values.
Question 2 Multiple Choice
Researchers compute the marginal effect at the mean (MEM) for a binary gender variable (sample mean ≈ 0.52) in a logit model. A colleague argues they should use the average marginal effect (AME) instead. What is the colleague's strongest argument?
AAME is computationally simpler because it requires only one evaluation of the model
BMEM requires evaluating at the median rather than the mean, making it systematically biased
CNo individual in the sample has gender = 0.52, so evaluating a nonlinear function at this non-existent point can produce a misleading estimate
DAME and MEM always produce identical estimates, so MEM is redundant and AME is the conventional standard
The MEM evaluates the predicted effect at the sample mean of all covariates. For a binary variable like gender, the sample mean (0.52) corresponds to no real individual. Evaluating a nonlinear function (like the logistic CDF) at a fictional average point differs from the average of evaluating it at each real individual — Jensen's inequality tells us f(E[X]) ≠ E[f(X)] when f is nonlinear. The AME computes effects at each actual observation and then averages, avoiding this conceptual problem.
Question 3 True / False
In a linear regression model, the coefficient on a variable is the marginal effect on the outcome. The same interpretation applies to coefficients in a logit regression.
TTrue
FFalse
Answer: False
In linear regression, the coefficient is directly the marginal effect on the outcome because the relationship is linear. In logit (and probit, Poisson, etc.), the coefficient is on an internal transformed scale — log-odds for logit, the latent index for probit, log-count for Poisson. The marginal effect on the probability (or count) requires computing the derivative of the predicted outcome with respect to the covariate, which involves the derivative of the link function and varies by observation.
Question 4 True / False
For a binary regressor in a logit model, the average marginal effect can be estimated by computing each individual's difference in predicted probabilities when the regressor switches from 0 to 1, then averaging those differences across the sample.
TTrue
FFalse
Answer: True
This 'recycled predictions' approach is the standard way to compute AME for discrete variables. For each observation, you compute two predicted probabilities — one with the binary variable set to 0, one set to 1 — and take the difference. Averaging these individual differences gives the AME. It correctly handles the heterogeneity in marginal effects that arises from observations at different points on the logistic curve.
Question 5 Short Answer
Why is the average marginal effect (AME) generally preferred over the marginal effect at the mean (MEM) in applied work with nonlinear models? Explain the conceptual difference.
Think about your answer, then reveal below.
Model answer: The AME computes the marginal effect for each actual observation in the sample, then averages across real individuals. The MEM evaluates the marginal effect at a hypothetical 'average individual' constructed from sample means. In nonlinear models, these differ because the marginal effect depends on the covariate values — people near the middle of the response curve have larger marginal effects than those near the extremes. By averaging over actual people, AME correctly captures this heterogeneity. MEM relies on a non-existent 'average person,' which can be a misleading summary point when covariates are binary or highly skewed.
The key mathematical issue is Jensen's inequality: for a nonlinear function g, E[g(X)] ≠ g(E[X]). MEM computes g(E[X]); AME computes E[g(X)]. AME is the more interpretable quantity for most policy questions because it represents 'the average effect on the actual population' rather than 'the effect on a hypothetical average person.'