Fairness in machine learning addresses how to build algorithms that treat individuals equitably and do not unfairly discriminate based on sensitive attributes (race, gender, age, etc.). Fairness definitions are formalized through competing metrics: demographic parity (equal representation across groups), equalized odds (equal error rates across groups), calibration (predictions equally reliable across groups), individual fairness (similar individuals treated similarly), and causal fairness (controlling for discrimination via causal graphs). Trade-offs between fairness and accuracy, and between different fairness notions, require careful analysis. Achieving fairness requires understanding sources of bias (data bias, model bias, deployment bias) and applying pre-processing, in-processing, or post-processing mitigation techniques.
Fairness in machine learning is both a technical and societal challenge: how do we build algorithms that make equitable decisions without systematically disadvantaging groups based on protected attributes (race, gender, age, etc.)? The challenge is technical because fairness is not a single well-defined concept but a collection of competing definitions, each appropriate in different contexts and mathematically incompatible with others.
Fairness Definitions: Practitioners have formalized multiple fairness notions:
1. Demographic Parity: P(pred=1 | group=A) = P(pred=1 | group=B). The positive prediction rate is equal across groups. Appropriate when outcomes should be independent of group membership. Problematic when legitimate outcome differences exist.
2. Equalized Odds (Equal Opportunity): FPR and FNR are equal across groups: P(pred=1 | Y=0, group=A) = P(pred=1 | Y=0, group=B) and P(pred=0 | Y=1, group=A) = P(pred=0 | Y=1, group=B). Errors affect groups equally. Appropriate when you want equal error rates regardless of outcome distribution.
3. Calibration: P(Y=1 | pred=1, group=A) = P(Y=1 | pred=1, group=B). Predictions are equally reliable/well-calibrated across groups. Appropriate for decision-making where you need reliable confidence estimates.
4. Individual Fairness: Similar individuals (on relevant attributes) are treated similarly, regardless of group membership. Requires defining a similarity metric and ensures consistency. Appropriate when arbitrary discrimination is the concern.
5. Causal Fairness: Control for direct causal discrimination while allowing indirect effects. Use causal graphs to distinguish fair (due to qualifications) vs. unfair (due to bias) outcome differences. Appropriate when you want to identify and eliminate discriminatory causation.
The Fairness-Accuracy Trade-off: Enforcing fairness often reduces overall accuracy. For example, demographic parity might require predicting more positive outcomes for an under-represented group even if the model is less confident, raising false positive rates. This trade-off is unavoidable in many settings, and practitioners must decide which is more important: overall accuracy or fair distribution of errors/decisions.
Incompatibility Results: Different fairness definitions cannot be simultaneously satisfied when base rates differ. For example, with different outcome rates across groups, satisfying both demographic parity and calibration is impossible. Similarly, equalized odds and demographic parity are generally incompatible unless outcome rates are identical across groups. This impossibility means choosing a fairness definition requires understanding its implications.
Sources of Bias:
1. Data Bias: Training data reflects historical discrimination (e.g., hiring data where women were historically under-hired). The model learns to replicate this discrimination.
2. Model Bias: The model's capacity and structure can introduce disparities (e.g., a model optimized for overall accuracy may perform worse for under-represented groups).
3. Deployment Bias: How the model is used in practice can create disparities (e.g., a model is applied differently across groups, or feedback loops reinforce historical biases).
Mitigation Approaches:
1. Pre-processing: Clean and rebalance training data to remove historical bias before model training.
2. In-processing: Modify the learning objective to incorporate fairness constraints during training (adversarial debiasing, fair regularization, constrained optimization).
3. Post-processing: Adjust model predictions or thresholds to satisfy fairness constraints after training (threshold optimization per group).
4. Causal Approaches: Use causal graphs to identify and control for discriminatory pathways while allowing legitimate outcome differences (e.g., education level can affect hiring decisions, but race cannot).
Theoretical Results:
Practical Implementation:
Challenges and Open Questions:
Emerging Directions:
Fairness in ML is not a solved problem but an active, multidisciplinary challenge combining machine learning, causal inference, ethics, and policy. Practitioners must recognize that technical solutions alone are insufficient; fairness requires ongoing dialogue with stakeholders and commitment to equitable outcomes.
No topics depend on this one yet.