A formal framework separates AI that amplifies human judgment from AI that progressively usurps it, proposing four operational metrics (CAI*, D, HRI, HCDR) to diagnose whether hybrid performance is sustainable; it warns that maximizing short-term capability can undermine long-term human expertise and calls for a cognitive sustainability constraint in system design.

Cognitive Amplification vs Cognitive Delegation in Human-AI Systems: A Metric Framework

Eduardo Di Santi · March 19, 2026

arxiv theoretical n/a evidence 7/10 relevance Source PDF

The paper introduces a formal framework and four operational metrics to distinguish cognitive amplification—AI that enhances human performance while preserving expertise—from cognitive delegation, and argues for designing human-AI systems under a cognitive sustainability constraint to avoid long-term erosion of human competence.

Artificial intelligence is increasingly embedded in human decision-making, where it can either enhance human reasoning or induce excessive cognitive dependence. This paper introduces a conceptual and mathematical framework for distinguishing cognitive amplification, in which AI improves hybrid human-AI performance while preserving human expertise, from cognitive delegation, in which reasoning is progressively outsourced to AI systems. To characterize these regimes, we define a set of operational metrics: the Cognitive Amplification Index (CAI*), the Dependency Ratio (D), the Human Reliance Index (HRI), and the Human Cognitive Drift Rate (HCDR). Together, these quantities provide a low-dimensional metric space for evaluating not only whether human-AI systems achieve genuine synergistic performance, but also whether such performance is cognitively sustainable for the human component over time. The framework highlights a central design tension in human-AI systems: maximizing short-term hybrid capability does not necessarily preserve long-term human cognitive competence. We therefore argue that human-AI systems should be designed under a cognitive sustainability constraint, such that gains in hybrid performance do not come at the cost of degradation in human expertise.

Summary

Main Finding

The paper proposes an operational metric framework to distinguish two regimes of human–AI interaction: cognitive amplification (AI improves hybrid performance while preserving or enhancing human expertise) versus cognitive delegation (humans progressively outsource reasoning to AI, degrading human expertise). It introduces a low-dimensional state space—principally the Cognitive Amplification Index (CAI*), Dependency Ratio (D) (and its complement Human Reliance Index HRI), and the Human Cognitive Drift Rate (HCDR)—and argues human–AI systems should be designed under a cognitive-sustainability constraint (maximize hybrid capability subject to HCDR ≥ 0).

Key Points

Conceptual distinction
- Cognitive amplification: AI and human produce synergistic gains; human expertise preserved or improved.
- Cognitive delegation: AI-dominated workflows produce short-run gains but can erode human ability over time.
Core metrics (operational definitions)
- QH = human-only capability (task/domain-specific)
- QA = AI-only capability
- QHA = hybrid human+AI capability
- Cognitive Amplification Index:
  - CAI* = (QHA − max(QH, QA)) / max(QH, QA)
  - CAI* > 0 indicates genuine hybrid synergy over the best component.
- Dependency Ratio:
  - D = QA / QHA
  - High D (close to 1) indicates AI-dominated hybrid performance.
- Human Reliance Index: HRI = 1 − D = (QHA − QA) / QHA
- Human Cognitive Drift Rate:
  - HCDR = (QH(t2) − QH(t1)) / (t2 − t1)
  - HCDR ≥ 0: cognition preserved or improved; HCDR < 0: cognitive atrophy (delegation).
Regime mapping
- Use (D, CAI*) to map hybrid states into human-dominant, balanced, or AI-dominated regimes; HCDR determines long-run sustainability.
Design recommendations to favor amplification and avoid delegation
- Interfaces that force user hypothesis generation or explanation, present calibrated uncertainty and alternatives, support exploration, and embed periodic AI-off assessment blocks.
- Instrumentation/telemetry to continuously estimate CAI*, D, HCDR.
Illustrative examples (diagnostics, pharmacovigilance) show cases with CAI* > 0 but high D, implying short-run hybrid gains yet risk of long-term human skill loss.
Normative proposal: optimize QHA subject to HCDR ≥ 0 (cognitive sustainability constraint).

Data & Methods

Nature of contribution: conceptual + formal metric framework. No large-scale empirical validation in the paper; several controlled illustrative calculations are provided.
Mathematical model components
- Idealized synergy model (conceptual): QHA = QH + QA + α QH QA, with α > 0 representing interaction strength (not directly empirical).
- Practical operationalization centers on measurable task performance Q (accuracy, recall, F1, error rates, time-to-solution) within a defined domain.
Measurement strategy proposed
- Estimate QH, QA, QHA with task-level metrics in controlled experiments or field telemetry.
- Compute CAI*, D, HRI directly from observed Q values.
- Track QH periodically in "AI-off" blocks to measure HCDR over time.
Illustrative numeric examples
- Example: QH = 0.70, QA = 0.80, QHA = 0.92 ⇒ CAI* ≈ 0.15; D ≈ 0.87; HRI ≈ 0.13. Interpretation: strong hybrid gain but AI-dominated dependence.
Assumptions and limitations
- Q is domain- and task-specific; no single universal intelligence metric.
- Thresholds for D/HRI (e.g., D > 0.8 indicates AI-dominance) are operational, not universal constants.
- HCDR requires longitudinal measurement and controlled AI-off assessments; susceptible to confounders (selection, learning-by-doing, task heterogeneity).
- The α-synergy term is conceptual and may not capture complex interaction dynamics (complementarities, substitution, nonlinearities) in practice.
- Empirical validation and causal identification (distinguish substitution vs. skill accumulation) remain for future work.

Implications for AI Economics

Human capital dynamics
- HCDR formalizes a potential depreciation process for task-specific human cognitive capital when AI is heavily used. Economists should treat AI adoption not only as a productivity shock but also as a possible negative professional-skill externality.
- Short-run productivity gains (higher QHA) can mask long-run declines in human skill supply, affecting wages, promotion ladders, and the returns to training.
Task allocation and comparative advantage
- CAI and D provide measurable inputs to models of task reallocation: high CAI with low D supports complementary reallocation; high CAI* with high D signals automation with potential human obsolescence.
- Firms choosing between automation and augmentation face trade-offs between immediate output and maintaining in-house expertise (a private vs social optimum divergence if worker skill loss imposes external costs).
Firm investment and adoption incentives
- If AI adoption creates private short-term gains but causes long-run human skill erosion (negative HCDR), firms may underinvest in skill-preserving designs (scaffolded interfaces, training, AI-off assessments) absent regulation or appropriate incentives.
- Pricing of AI services and contracting may need to account for the implicit depreciation of human capital; long-term service contracts could internalize maintenance of human capability.
Labor market and distributional effects
- Persistent delegation regimes can reduce demand for high-skill labor in certain tasks, compressing wages and changing occupational task mixes; conversely, amplification regimes can raise effective productivity and complement higher-level tasks.
- Monitoring CAI*/HCDR across firms/sectors could help predict shifts in skill demand and occupational upgrading/deskilling patterns.
Regulation and policy design
- The proposed cognitive-sustainability constraint (maximize QHA s.t. HCDR ≥ 0) suggests regulatory interventions: require periodic human competency checks, mandate human-in-the-loop designs that preserve skill, or require telemetry and reporting of D/CAI*/HCDR in safety-critical industries.
- Procurement and certification standards for AI in regulated sectors could embed minimum HRI or maximum D thresholds and mandatory AI-off evaluation protocols.
Measurement and empirical research agenda for economists
- Incorporate CAI*, D, HRI, HCDR into empirical studies: link AI adoption to worker performance trajectories, wages, training investments, and firm-level productivity.
- Use randomized or quasi-experimental designs (e.g., phased AI rollouts, mandated AI-off assessments) to identify causal effects on QH(t) and longer-run labor market outcomes.
- Build datasets tracking assisted vs unassisted performance, interface designs, and telemetry to estimate skill depreciation parameters and to value long-run trade-offs.
Systemic and welfare considerations
- Market failures: individual firms may not internalize systemic risk from widespread skill loss (e.g., reduced resilience in crises when AI is unavailable), implying grounds for collective action or regulation.
- Social welfare analysis should compare short-run efficiency gains to long-run losses in human capital, error-detection capacity, and resilience—especially important in safety-critical domains.
Practical recommendations for economists advising firms or policymakers
- Require monitoring of CAI*, D, and periodic measurement of HCDR in deployments that affect critical tasks.
- Incentivize interface designs and training that increase HRI and produce non-negative HCDR (subsidies, procurement preferences, or regulation).
- Evaluate AI adoption not only by immediate productivity metrics but by modeled impacts on future human-skill stocks and labor market outcomes.

Summary recommendation: Treat AI adoption decisions and policy design as dynamic trade-offs between immediate hybrid productivity (QHA) and the preservation of human cognitive capital (HCDR). The paper’s metrics (CAI*, D, HRI, HCDR) offer a compact, operational toolkit for economists to measure, model, and regulate these trade-offs empirically.

Assessment

Paper Typetheoretical Evidence Strengthn/a — The paper is a conceptual and mathematical framework without empirical testing or causal inference; it does not provide data-driven estimates or identification of causal effects. Methods Rigormedium — The paper offers a formalized set of operational metrics and a clear distinction between cognitive amplification and delegation, showing logical coherence; however, it lacks empirical calibration, robustness checks, and validation on real-world tasks or field settings, which limits assessment of practical applicability. SampleNo empirical sample or observational/experimental data; the work develops theoretical definitions (Cognitive Amplification Index CAI*, Dependency Ratio D, Human Reliance Index HRI, Human Cognitive Drift Rate HCDR) and analyzes their implications conceptually and mathematically (possible illustrative examples but no systematic empirical dataset). Themeshuman_ai_collab skills_training productivity org_design GeneralizabilityNo empirical validation — applicability to real-world tasks and domains is untested, Metric behavior depends on model assumptions and parameterization which may not hold across contexts, Does not address heterogeneity across users, tasks, industries, or skill levels, Ignores institutional and organizational constraints (incentives, regulation, training) that affect deployment, May not capture long-term dynamics outside modeled assumptions (learning, complementarities, multi-agent interactions)

Claims (6)

Claim	Direction	Confidence	Outcome	Details
Artificial intelligence embedded in human decision-making can either enhance human reasoning or induce excessive cognitive dependence. Decision Quality	mixed	high	human reasoning quality / cognitive dependence	0.06
The paper introduces a conceptual and mathematical framework to distinguish cognitive amplification (AI improves hybrid human-AI performance while preserving human expertise) from cognitive delegation (reasoning is progressively outsourced to AI). Automation Exposure	positive	high	mode of human-AI interaction (amplification vs delegation)	0.2
The paper defines a set of operational metrics: the Cognitive Amplification Index (CAI*), the Dependency Ratio (D), the Human Reliance Index (HRI), and the Human Cognitive Drift Rate (HCDR). Automation Exposure	positive	high	operational metrics for human-AI cognitive interaction (CAI*, D, HRI, HCDR)	0.2
Together, these quantities provide a low-dimensional metric space for evaluating whether human-AI systems achieve genuine synergistic performance and whether such performance is cognitively sustainable for the human component over time. Decision Quality	positive	high	hybrid human-AI performance and cognitive sustainability	0.12
There is a central design tension in human-AI systems: maximizing short-term hybrid capability does not necessarily preserve long-term human cognitive competence. Skill Obsolescence	negative	high	long-term human cognitive competence	0.06
Human-AI systems should be designed under a cognitive sustainability constraint so that gains in hybrid performance do not come at the cost of degradation in human expertise. Skill Obsolescence	positive	high	preservation of human expertise under human-AI design choices	0.02