Fairness in Machine Learning Theory

Research Depth 74 in the knowledge graph I know this Set as goal
fairness bias discrimination algorithmic-accountability ml-ethics

Core Idea

Fairness in machine learning addresses how to build algorithms that treat individuals equitably and do not unfairly discriminate based on sensitive attributes (race, gender, age, etc.). Fairness definitions are formalized through competing metrics: demographic parity (equal representation across groups), equalized odds (equal error rates across groups), calibration (predictions equally reliable across groups), individual fairness (similar individuals treated similarly), and causal fairness (controlling for discrimination via causal graphs). Trade-offs between fairness and accuracy, and between different fairness notions, require careful analysis. Achieving fairness requires understanding sources of bias (data bias, model bias, deployment bias) and applying pre-processing, in-processing, or post-processing mitigation techniques.

Explainer

Fairness in machine learning is both a technical and societal challenge: how do we build algorithms that make equitable decisions without systematically disadvantaging groups based on protected attributes (race, gender, age, etc.)? The challenge is technical because fairness is not a single well-defined concept but a collection of competing definitions, each appropriate in different contexts and mathematically incompatible with others.

Fairness Definitions: Practitioners have formalized multiple fairness notions:

1. Demographic Parity: P(pred=1 | group=A) = P(pred=1 | group=B). The positive prediction rate is equal across groups. Appropriate when outcomes should be independent of group membership. Problematic when legitimate outcome differences exist.

2. Equalized Odds (Equal Opportunity): FPR and FNR are equal across groups: P(pred=1 | Y=0, group=A) = P(pred=1 | Y=0, group=B) and P(pred=0 | Y=1, group=A) = P(pred=0 | Y=1, group=B). Errors affect groups equally. Appropriate when you want equal error rates regardless of outcome distribution.

3. Calibration: P(Y=1 | pred=1, group=A) = P(Y=1 | pred=1, group=B). Predictions are equally reliable/well-calibrated across groups. Appropriate for decision-making where you need reliable confidence estimates.

4. Individual Fairness: Similar individuals (on relevant attributes) are treated similarly, regardless of group membership. Requires defining a similarity metric and ensures consistency. Appropriate when arbitrary discrimination is the concern.

5. Causal Fairness: Control for direct causal discrimination while allowing indirect effects. Use causal graphs to distinguish fair (due to qualifications) vs. unfair (due to bias) outcome differences. Appropriate when you want to identify and eliminate discriminatory causation.

The Fairness-Accuracy Trade-off: Enforcing fairness often reduces overall accuracy. For example, demographic parity might require predicting more positive outcomes for an under-represented group even if the model is less confident, raising false positive rates. This trade-off is unavoidable in many settings, and practitioners must decide which is more important: overall accuracy or fair distribution of errors/decisions.

Incompatibility Results: Different fairness definitions cannot be simultaneously satisfied when base rates differ. For example, with different outcome rates across groups, satisfying both demographic parity and calibration is impossible. Similarly, equalized odds and demographic parity are generally incompatible unless outcome rates are identical across groups. This impossibility means choosing a fairness definition requires understanding its implications.

Sources of Bias:

1. Data Bias: Training data reflects historical discrimination (e.g., hiring data where women were historically under-hired). The model learns to replicate this discrimination.

2. Model Bias: The model's capacity and structure can introduce disparities (e.g., a model optimized for overall accuracy may perform worse for under-represented groups).

3. Deployment Bias: How the model is used in practice can create disparities (e.g., a model is applied differently across groups, or feedback loops reinforce historical biases).

Mitigation Approaches:

1. Pre-processing: Clean and rebalance training data to remove historical bias before model training.

2. In-processing: Modify the learning objective to incorporate fairness constraints during training (adversarial debiasing, fair regularization, constrained optimization).

3. Post-processing: Adjust model predictions or thresholds to satisfy fairness constraints after training (threshold optimization per group).

4. Causal Approaches: Use causal graphs to identify and control for discriminatory pathways while allowing legitimate outcome differences (e.g., education level can affect hiring decisions, but race cannot).

Theoretical Results:

Practical Implementation:

Challenges and Open Questions:

Emerging Directions:

Fairness in ML is not a solved problem but an active, multidisciplinary challenge combining machine learning, causal inference, ethics, and policy. Practitioners must recognize that technical solutions alone are insufficient; fairness requires ongoing dialogue with stakeholders and commitment to equitable outcomes.

Practice Questions 4 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueReading and Writing DecimalsComparing and Ordering DecimalsAdding and Subtracting DecimalsMultiplying DecimalsDividing DecimalsDividing FractionsMixed Number ArithmeticOrder of OperationsInteger Order of OperationsVariable ExpressionsCombining Like TermsOne-Step EquationsTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesAngle Pairs: Complementary, Supplementary, and VerticalParallel Lines and TransversalsCorresponding AnglesAlternate Interior AnglesTriangle Angle Sum TheoremExterior Angle TheoremTriangle Inequality TheoremSimilar Triangles: AA SimilaritySimilar Triangles: SSS and SAS SimilarityProportions in Similar TrianglesRight Triangle Trigonometry IntroductionTrigonometric Ratios ReviewRadian MeasureConverting Between Degrees and RadiansThe Unit CircleGraphing Sine and CosineGraphing Tangent and Reciprocal Trigonometric FunctionsDerivatives of Trigonometric FunctionsAntiderivativesIndefinite IntegralsBasic Integration RulesRiemann SumsDefinite Integral DefinitionProbability Density Functions and Continuous DistributionsBayesian Statistics: Prior, Posterior, Credible IntervalsIntroduction to Bayesian InferenceCausal Inference in Machine LearningFairness in Machine Learning Theory

Longest path: 75 steps · 443 total prerequisite topics

Prerequisites (2)

Leads To (0)

No topics depend on this one yet.