Static fairness rules can worsen long-term inequality while creating high short-term costs, but modest investment policies can erase disparities at low utility loss; the paper shows analytically that fair-seeming constraints may backfire over time and that dynamic interventions offer a cheaper path to equality.
Algorithmic decision-making in high-stakes settings can have profound impacts on individuals and populations. While much prior work studies fairness in static settings, recent results show that enforcing static fairness constraints may exacerbate long-run disparities. Motivated by this tension, we study a stylized sequential selection problem in which a decision-maker repeatedly selects individuals, affecting both immediate utility and the population distribution over time. We introduce notions of group fairness for both the short and long term and theoretically analyze the trade-off between fairness and utility via the Price of Fairness (PoF). We characterize optimal and fair policies in the short term and show that the PoF can be large even when group distributions are nearly identical. In contrast, we show that long-term disparities can vanish under simple investment policies that achieve a low PoF. We also empirically validate these theoretical observations using both synthetic and real datasets.
Summary
Main Finding
In a stylized sequential selection model (e.g., lending), imposing a short-term, mean-based group fairness constraint can force arbitrarily large utility loss (Price of Fairness, PoF ≈ 1) even when group score distributions are very similar. Optimal (utility-maximizing) and fair short-term policies are threshold policies. Crucially, the severe fairness–utility trade-off is largely a short-horizon artifact: in a multi-step (long-term) setting, simple “investment” policies that prioritize improving scores for disadvantaged groups can drive disparities to vanish while incurring only a small PoF. These theoretical results are supported by simulations on synthetic data and a real FICO credit-score dataset.
Key Points
- Model: sequential selection over individuals with one-dimensional scores x ∈ X, two groups A and B with distributions DA, DB, selection probabilities πg(x), success probability p(x) increasing in x, utility U+, U− and score updates C+, C−.
- Fairness notion: bound the post-deployment difference in group mean scores (δ′ ≤ α). This is a state-based (score) fairness notion aimed at restorative/substantive equality.
- Score categories: C1 (profitable & improving), C2 (extractive), C3 (investment candidates), C4 (unprofitable & degrading). Under Assumption 2, extractive category C2 is empty.
- Short-term results:
- Optimal (utility-maximizing) single-step policy is a group-agnostic threshold: select all scores with nonnegative expected utility (Proposition 1).
- An optimal α-fair single-step policy (when feasible) can be implemented by per-group thresholding (Theorem 1).
- Price of Fairness (PoF) is defined as worst-case fractional utility loss: PoF = max_I [1 − Fair-OPT(I)/OPT(I)].
- PoF can be arbitrarily close to 1 as α → 0 in the single-step setting (Theorem 2): enforcing tight mean-equality can almost fully eliminate attainable utility in worst-case instances.
- Even restricting to “non-degrading” (rational) policies, high PoF can occur; the bad worst cases can be constructed even when DA and DB are very close in total variation (Theorem 3).
- Long-term (multi-step) results:
- When decisions affect future score distributions, a class of simple investment policies (prioritize allocations that improve scores of disadvantaged groups) can guarantee low PoF and can eliminate long-run disparities under reasonable assumptions.
- Thus, long-horizon intervention design can reconcile fairness and utility far better than one-shot fairness constraints.
- Empirical validation: synthetic experiments and experiments on real-world FICO score data confirm theoretical patterns (high short-term PoF, low long-term PoF under investment policies).
Data & Methods
- Formal model:
- Two groups A, B with weights wA, wB and score distributions DA, DB over X (often X = [0, 1] or integer scores).
- Selection policy πg(x) ∈ [0, 1]; utility per selected individual E[u(x)] = p(x)U+ + (1 − p(x))U−; expected score change E[∆(x)] = p(x)C+ + (1 − p(x))C−.
- Important assumptions: p(x) monotone increasing (Assumption 1); U+/U− ≥ C+/C− (Assumption 2) to rule out profitable extraction that harms population; further assumptions for multi-step technical results (discrete scores, stability of profitable candidates, group advantage not due to fewer unprofitable individuals).
- Theoretical approach:
- Classification of scores into four categories (C1–C4); proof that optimal policies are threshold-based; characterization of optimal fair policies.
- Price of Fairness (PoF) defined as worst-case fractional utility loss across instances.
- Construction of worst-case instances to prove high PoF in single-step (Theorem 2) and even when distributions are close (Theorem 3).
- Analysis of multi-step stochastic dynamics: define policies over T rounds, study “investment” policies and show they can control PoF and convergence of group means.
- Empirical methods:
- Simulations on synthetic instances designed to illustrate constructed worst cases and typical regimes.
- Experiments using real FICO credit-score datasets to show practical relevance and to verify the advantage of long-term investment policies.
- Proofs and technical details are provided (propositions, theorems, appendices) to support claims.
Implications for AI Economics
- Short-run fairness constraints can be very costly: regulators or platforms that enforce one-shot equality-in-outcome constraints (e.g., equal mean scores immediately after allocation) may impose large welfare losses, and in worst cases almost eliminate private utility. Economists should therefore evaluate fairness interventions with explicit attention to time horizon.
- Dynamic policies and investment view reduce the fairness–utility trade-off: treating fairness as a dynamic, restorative process (investing in disadvantaged groups so their state improves) can reconcile equity and efficiency. This suggests policy instruments (subsidies, targeted investments, credit-building programs) that alter transition dynamics are often superior to blunt one-period quotas.
- Measurement matters: state-based fairness (equalizing underlying score distributions) ties fairness to future opportunities and avoids some negative feedback loops that outcome-based fairness constraints can create. For economic modelling, specifying what state variables AI systems alter (wealth, credit history, human capital) is crucial.
- Platform and firm incentives: absent commitment or multi-period planning, profit-maximizing firms may prefer myopic policies that maximize short-term utility and exacerbate disparities. Regulatory design should consider mechanisms that align private incentives with long-run social equality (e.g., reward structures for improving borrower scores).
- Welfare and social planning: PoF as a metric is useful to quantify trade-offs, but social planners need to weigh private utility loss against social benefits from reduced disparities. Discounting and intertemporal welfare aggregation change optimal regulatory choices—longer societal horizons favor investment-based fairness.
- Research and policy agenda:
- Empirically calibrate dynamic transition models in real markets (credit, hiring, education) to quantify realistic PoFs and benefits of investment policies.
- Extend to multiple groups, strategic behavior, signaling, and competition between firms to study equilibrium effects.
- Study alternative fairness metrics (variance, tail risks, multi-dimensional states) and their dynamic properties.
- Design implementable policy tools (contracts, subsidies, regulation of scoring systems) to encourage long-run restorative interventions.
- Bottom-line takeaway for AI economists and policymakers: evaluate fairness constraints in dynamic contexts—short-horizon fairness may be expensive or counterproductive, but appropriately designed longer-horizon investment interventions can achieve equity with little efficiency loss.
Assessment
Claims (7)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| Enforcing static fairness constraints may exacerbate long-run disparities. Inequality | negative | high | long-run disparities between groups |
0.12
|
| We introduce notions of group fairness for both the short and long term. Other | neutral | high | definitions of group fairness (short-term, long-term) |
0.06
|
| We theoretically analyze the trade-off between fairness and utility via the Price of Fairness (PoF). Decision Quality | neutral | high | trade-off between utility (decision-maker objective) and fairness constraints (PoF) |
0.12
|
| We characterize optimal and fair policies in the short term. Decision Quality | neutral | high | policy optimality under short-term fairness constraints |
0.12
|
| The Price of Fairness can be large even when group distributions are nearly identical. Decision Quality | negative | high | utility loss due to fairness constraints (PoF) |
0.12
|
| Long-term disparities can vanish under simple investment policies that achieve a low Price of Fairness. Inequality | positive | high | long-term group disparity (population distribution convergence) and PoF |
0.12
|
| We empirically validate these theoretical observations using both synthetic and real datasets. Decision Quality | mixed | high | empirical consistency of theoretical findings (PoF behavior and long-term disparity dynamics) on datasets |
0.06
|