The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲
← Papers

More AI can sometimes make workers less productive: when workers underinvest in skill or cannot detect AI mistakes, higher AI assistance lowers output. Over time, uneven ability to adapt to AI errors causes skill polarization, letting AI-literate workers pull ahead.

Human-AI Productivity Paradoxes: Modeling the Interplay of Skill, Effort, and AI Assistance
Ali Aouad, Thodoris Lykouris, Huiying Zhong · May 12, 2026
arxiv theoretical n/a evidence 7/10 relevance Source PDF
A theoretical model shows that increased AI assistance can paradoxically reduce productivity when skill formation or AI reliability are endogenous, and that heterogeneity in AI literacy can produce long-run skill polarization.

Generative Artificial Intelligence (AI) tools are rapidly adopted in the workplace and in education, yet the empirical evidence on AI's impact remains mixed. We propose a model of human-AI interaction to better understand and analyze several mechanisms by which AI affects productivity. In our setup, human agents with varying skill levels exert utility-maximizing effort to produce certain task outcomes with AI assistance. We find that incorporating either endogeneity in skill development or in AI unreliability can induce a productivity paradox: increased levels of AI assistance may degrade productivity, leading to potentially significant shortfalls. Moreover, we examine the long-term distributional effect of AI on skill, and demonstrate that skill polarization can emerge in steady state when accounting for heterogeneity in AI literacy -- the agent's capability to identify and adapt to inaccurate AI outputs. Our results elucidate several mechanisms that may explain the emergence of human-AI productivity paradoxes and skill polarization, and identify simple measures that characterize when they arise.

Summary

Main Finding

The paper develops a parsimonious theoretical framework of human–AI production showing that, beyond the naïve substitution effect, two plausible mechanisms—endogenous skill dynamics and AI unreliability—can each generate a human–AI productivity paradox: increasing AI assistance can reduce steady‑state productivity and can produce arbitrarily large shortfalls. When agents differ in AI literacy (their ability to detect/adapt to AI errors), these mechanisms can also produce long‑run skill polarization (a multimodal skill distribution). The paper characterizes simple, interpretable conditions (a “sensitivity gap” for learning‑driven paradoxes and a curvature/absolute‑risk‑aversion condition for unreliability‑driven paradoxes) that determine when these effects arise.

Key Points

  • Basic model setup

    • Agents have skill s ≥ 0, choose effort e ≥ 0, and face AI assistance level a ≥ 0.
    • Production uses x = s + e + a and output p(x) where p is increasing, concave.
    • Effort costs are linear: cost = γ e. Agents maximize p(s+e+a) − γ e myopically.
    • In the static baseline, AI and skill substitute for effort: for fixed s, higher a reduces effort and raises (or weakly raises) realized productivity.
  • Productivity paradox from endogenous skill (Section 3)

    • Skill evolves via a birth–death chain where current effort influences future skill transitions (learning‑by‑doing).
    • When AI reduces effort, long‑run skill accumulation can fall enough that steady‑state productivity decreases with higher AI assistance.
    • The condition that determines whether the paradox occurs is a sensitivity gap: the relative strength of (i) how much effort contributes to future skill versus (ii) how much AI assistance directly raises current productivity. If effort’s role in future skills is large relative to AI’s direct productivity effect, increased AI can harm steady‑state productivity.
    • Theorem 1: formal existence/characterization of the paradox; Proposition 3.4: potential for arbitrarily large productivity shortfalls.
  • Productivity paradox from AI unreliability (Section 4)

    • AI assistance is modeled as stochastic: it sometimes produces proficient output and sometimes fails (hallucination/unreliability).
    • Agents remain myopic and reduce effort in response to expected AI help; the expected productivity can decline when AI proficiency improves but reliability (variance or chance of failure) remains.
    • The decisive factor is the curvature of p(x): under sufficiently accelerating diminishing returns (captured via a notion akin to absolute risk aversion of the production function), the indirect loss from reduced effort outweighs the expected direct gain from AI, producing a paradox.
    • Theorem 2 and Proposition 4.2: formal statement and possibility of arbitrarily large shortfalls.
  • Skill polarization via heterogeneous AI literacy (Section 5)

    • AI literacy: heterogeneity in agents’ ability to detect and correct AI errors (i.e., respond appropriately to unreliable AI).
    • With endogenous skill evolution and heterogeneous literacy, effort as a function of skill can become non‑monotonic. Small early advantages and better AI literacy lead some agents to exert more effort and accumulate more skill (Matthew‑type amplification), producing a multimodal steady‑state skill distribution (Theorem 3).
    • This generates persistent inequality in skills even when AI is uniform across the population.
  • Robustness and modeling assumptions

    • Results rely on three key modeling assumptions: additive substitutability of skill, effort, and AI; linear effort cost; and myopic (short‑sighted) agents.
    • Authors discuss that some paradoxes persist under convex effort costs (weaker form) and that the mechanisms can be interpreted as micro‑founded interactions (e.g., unreliability as a negative interaction between AI and effort).

Data & Methods

  • Nature of the work

    • Purely theoretical / analytical—not an empirical estimation paper. The paper builds stylized mathematical models to generate testable qualitative predictions.
  • Core mathematical objects

    • Agent utility: u(e, s, a) = p(s + e + a) − γ e.
    • Optimal effort e(s, a) solves the myopic maximization; realized productivity p(s, a) = p(s + e*(s, a) + a).
    • Critical input level x* defined by the largest maximizer of p(x) − γ x (point where marginal benefit equals marginal cost γ).
  • Methods and derivations

    • Comparative statics of the baseline model to show substitution effects (Proposition 2.1, Corollary 2.2).
    • Birth–death Markov chain to model skill dynamics driven by effort choices (analytical steady‑state characterization).
    • Stochastic model of AI outputs (proficient vs. failed) to compute expected productivity and analyze how changes in AI proficiency/reliability affect agent incentives and aggregate outcomes.
    • Analytical theorems and propositions that identify sufficient/necessary conditions (sensitivity gap; curvature/absolute‑risk‑aversion style condition) for paradoxes; proofs demonstrate cases where shortfalls can be arbitrarily large.
    • Analysis of heterogeneous AI literacy by introducing variation in agents’ ability to detect AI errors and deriving the steady‑state skill distribution.
  • No empirical dataset or simulations are central to the paper; the work links to and interprets existing empirical findings (RCTs and field studies) in light of the theoretical mechanisms.

Implications for AI Economics

  • Policy and organizational design

    • Deploy AI with complementary knowledge‑preservation policies: remedial training, periodic assessments, or mandated oversight to prevent skill atrophy where human skill matters long term.
    • Introduce deliberate usage frictions or monitoring that incentivize human verification and effort (e.g., require explanations, introduce checkpoints, or limit overly passive delegation) to preserve learning‑by‑doing.
    • Track not only mean AI performance (proficiency) but also reliability/variance; governance metrics should incorporate risk measures (probability of critical failures, variance of output quality).
  • Measurement and evaluation

    • Short-term productivity gains from AI can mask long‑run human capital losses. Empirical studies and firm evaluations should measure both immediate output and subsequent skill retention/formation to correctly estimate net benefits.
    • Randomized trials and panel studies should account for dynamic skill feedbacks (myopia and learning) and heterogeneous AI literacy to avoid biased estimates of AI’s long‑term productivity impact.
  • Distributional effects

    • Heterogeneous AI literacy can amplify inequality: AI may compress raw skill gaps short‑term but generate polarization long‑term through differential learning responses. Interventions targeting AI literacy (training in critical evaluation of AI outputs) can be an important equalizer.
  • Research directions

    • Empirically test the sensitivity‑gap and curvature conditions the paper identifies: measure how effort translates into future skill (learning elasticities) and how p(x) curvature interacts with reduced effort to predict paradoxes.
    • Extend to multi‑task environments and general equilibrium settings (wages, labor reallocation) to assess macroeconomic consequences.
    • Design and evaluate policy interventions (usage frictions, literacy programs, reliability‑aware deployment) in field experiments to validate theoretical counterfactuals.

Limitations to keep in mind: results are model‑driven and hinge on substitutability, linear cost, and myopic behavior; complementarities, forward‑looking agents, or different cost structures may change quantitative and some qualitative conclusions.

Assessment

Paper Typetheoretical Evidence Strengthn/a — The paper is a formal theoretical model and does not provide empirical or causal identification from data; therefore empirical evidence strength is not applicable. Methods Rigormedium — The paper develops an analytic model that identifies clear mechanisms (endogenous skill formation, AI unreliability, heterogenous AI literacy) and derives steady-state implications, which is appropriate for theory work; however, without calibration, robustness checks across alternative functional forms, or empirical validation, the practical relevance and quantitative implications remain uncertain. SampleNo empirical sample — an analytical model of heterogeneous agents who choose effort and develop skill under varying levels of AI assistance and AI unreliability; heterogeneity in 'AI literacy' determines agents' ability to detect/adapt to incorrect AI outputs. Themesproductivity skills_training human_ai_collab inequality GeneralizabilityResults depend on the model's functional-form assumptions and parameter values (e.g., how effort maps to skill, AI error distributions)., Abstract agent representation omits institutional, organizational, and incentive structures present in real workplaces., No empirical calibration or validation to particular occupations, tasks, or real-world AI systems limits external validity., Simplified treatment of learning and skill formation may not capture multi-dimensional skills, complementarities, or team interactions., Assumes specified forms of AI unreliability and agent detection ability; different error types or feedback mechanisms could change outcomes.

Claims (5)

ClaimDirectionConfidenceOutcomeDetails
Incorporating endogeneity in skill development into the model can induce a productivity paradox where increased AI assistance reduces productivity. Developer Productivity negative high agent productivity (task output) as a function of AI assistance and endogenous skill development
0.02
Introducing AI unreliability (errors/noise in AI outputs) in the model can also generate a productivity paradox: greater AI assistance may lower productivity. Developer Productivity negative high agent productivity (task output) as influenced by AI assistance and AI unreliability
0.02
Increased levels of AI assistance may degrade productivity, leading to potentially significant shortfalls under the model's identified conditions. Developer Productivity negative high expected task output / productivity shortfalls associated with increased AI assistance
0.02
Accounting for heterogeneity in AI literacy (agents' ability to identify and adapt to inaccurate AI outputs) can produce skill polarization in the long-run steady state. Skill Acquisition mixed high distribution of agent skill levels (skill polarization across population)
0.02
The model identifies simple measures/conditions that characterize when productivity paradoxes and skill polarization arise. Organizational Efficiency positive high predictive conditions/thresholds for productivity paradoxes and skill polarization
0.02

Notes