The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲
← Papers

A formal impossibility: when combined human-AI autonomy exceeds a calculable threshold, responsibility cannot be fully and legitimately assigned to people without reducing system autonomy; simulations on 3,000 synthetic collectives confirm a sharp phase transition.

The Accountability Horizon: An Impossibility Theorem for Governing Human-Agent Collectives
Haileleol Tibebu · April 09, 2026
arxiv theoretical medium evidence 8/10 relevance Source PDF
The paper proves that beyond a computable autonomy threshold for human-AI collectives, no accountability framework can simultaneously meet four minimal legitimacy properties, implying inherent limits on assigning responsibility unless autonomy is curtailed.

Existing accountability frameworks for AI systems, legal, ethical, and regulatory, rest on a shared assumption: for any consequential outcome, at least one identifiable person had enough involvement and foresight to bear meaningful responsibility. This paper proves that agentic AI systems violate this assumption not as an engineering limitation but as a mathematical necessity once autonomy exceeds a computable threshold. We introduce Human-Agent Collectives, a formalisation of joint human-AI systems where agents are modelled as state-policy tuples within a shared structural causal model. Autonomy is characterised through a four-dimensional information-theoretic profile (epistemic, executive, evaluative, social); collective behaviour through interaction graphs and joint action spaces. We axiomatise legitimate accountability through four minimal properties: Attributability (responsibility requires causal contribution), Foreseeability Bound (responsibility cannot exceed predictive capacity), Non-Vacuity (at least one agent bears non-trivial responsibility), and Completeness (all responsibility must be fully allocated). Our central result, the Accountability Incompleteness Theorem, proves that for any collective whose compound autonomy exceeds the Accountability Horizon and whose interaction graph contains a human-AI feedback cycle, no framework can satisfy all four properties simultaneously. The impossibility is structural: transparency, audits, and oversight cannot resolve it without reducing autonomy. Below the threshold, legitimate frameworks exist, establishing a sharp phase transition. Experiments on 3,000 synthetic collectives confirm all predictions with zero violations. This is the first impossibility result in AI governance, establishing a formal boundary below which current paradigms remain valid and above which distributed accountability mechanisms become necessary.

Summary

Main Finding

The paper proves a formal impossibility: in a mathematically specified class of Human-Agent Collectives (HACs), no single-locus accountability framework can simultaneously satisfy four minimal legal/philosophical axioms (Attributability, Foreseeability Bound, Non‑Vacuity, Completeness) whenever the collective (i) contains a mixed feedback cycle involving both humans and artificial agents and (ii) its minimum compound autonomy exceeds a computable threshold (the "Accountability Horizon") ˆΛ* = 1 − 1/|C_min|. Below that threshold, legitimate single-locus accountability frameworks exist; above it, the impossibility is structural — transparency, audits, or oversight cannot restore the four axioms without reducing agent autonomy.

Key Points

  • Formalisation: The paper introduces Human-Agent Collectives (HACs) as an SCM (structural causal model) of joint human-AI systems. Agents (human or artificial) are state-policy tuples with explicit observation spaces; interactions are described by an interaction graph and joint action/outcome mapping.
  • Four-dimensional autonomy profile: Each artificial agent has a computable autonomy vector α = (αE, αX, αD, αS):
    • Epistemic (αE): informational independence of agent beliefs from supervisors;
    • Executive (αX): propensity to act outside human approval;
    • Evaluative (αD): divergence of agent utility from human utility;
    • Social (αS): degree of unsupervised inter-agent initiation. Aggregate autonomy A(a) = w⊤α(a) with positive weights w.
  • Mixture model assumption: Artificial agent policy is a convex mixture of a human-aligned component and an autonomous component; executive autonomy αX is the mixture weight.
  • Accountability axioms (single-locus accountability):
  • Attributability — responsibility must be causally grounded (NESS-style).
  • Foreseeability Bound — an agent’s responsibility cannot exceed its predictive/epistemic capacity.
  • Non‑Vacuity — at least one agent must bear non-trivial responsibility.
  • Completeness — total responsibility for an outcome must be exhaustively allocated across agents.
  • Accountability Incompleteness Theorem: If the minimum compound autonomy (product of executive and epistemic autonomy for agents in a minimal mixed feedback cycle) exceeds ˆΛ*, then no allocation mechanism can satisfy all four axioms simultaneously. The structural source is distributed, feedback-rich autonomy; shifting information or transparency does not eliminate the gap unless autonomy is reduced.
  • Phase transition: The theorem establishes a sharp boundary — below ˆΛ* feasible single-locus allocations exist; above it they do not.
  • Empirical validation: Computational experiments on 3,000 synthetic HACs matched theoretical predictions with zero violations under the model.
  • Robustness: Result tolerates approximate (TV-bounded) deviations from exact mixture structure; choices of positive weight vectors w do not affect the theorem; human agents are modelled in maximally favourable way (weakening human model only strengthens the result).

Data & Methods

  • Formal model:
    • Environment E = (Ω, O, T) with shared and exogenous observation components.
    • Agents a = (S, A, Ω_i, δ, ω) where S = internal states, A = actions, Ω_i = observations, δ = state transitions, ω = policy.
    • Collective SCM M = (U, V, F, P(U)) following Pearl: endogenous variables include agent states/actions and outcome O; exogenous include environment state and noise.
  • Autonomy metrics: Information‑theoretic definitions (mutual information and entropy) for epistemic autonomy; probability-based definition for executive autonomy; normalized utility distance for evaluative autonomy; communication counts for social autonomy. Compound autonomy used multiplicatively for executive × epistemic (αX·αE).
  • Structural assumption: Agent policies adhere (exactly or approximately) to a convex mixture of human-aligned policy π_h and autonomous policy π_g that uses private information.
  • Accountability axiomatization: Four axioms motivated by legal/philosophical literature (NESS causal tests, foreseeability in tort/moral responsibility, non-triviality, and exhaustiveness).
  • Mathematical results: Lemmas and the main Accountability Incompleteness Theorem proved in the SCM framework; corollaries characterize accountability residuals and existence of allocations below horizon.
  • Computational validation: 3,000 synthetic HACs sampled under the model; tests confirmed theoretical boundary and phase transition; reported zero empirical violations.
  • Supplementary results: Propositions on δ-robustness (total variation perturbations) and sensitivity of the Accountability Horizon to modelling noise.

Implications for AI Economics

  • Regulatory boundary and calibration:
    • The paper provides a computable criterion (Accountability Horizon ˆΛ*) that regulators can use to distinguish systems where traditional single-locus liability is structurally feasible from systems where it is impossible. This enables risk-tiering and more precisely targeted obligations (e.g., require reduced autonomy or distributed governance above the horizon).
  • Firm incentives and product design:
    • Firms choosing to deploy agentic, feedback-rich multi-agent systems face an autonomy–accountability trade-off with economic consequences: maintaining high autonomy may remove the possibility of assigning liability to a single actor, increasing legal uncertainty, reputational risk, and transaction costs (insurance, contract complexity). Firms will need to decide whether to reduce autonomy (incurring performance costs) or adopt new governance/contractual structures (coalitions, joint‑and‑several liability, pooled insurance).
  • Liability, insurance, and financial markets:
    • Traditional liability markets and insurers price risk assuming accountable counterparties. Above the Accountability Horizon, residual unallocatable responsibility implies either (i) higher premiums, (ii) new insurance products for collective/coalition liability, or (iii) market failure for some applications. Insurers, investors, and corporate risk managers must factor in whether deployed systems cross the horizon.
  • Market structure and competition:
    • Smaller suppliers or firms that deliberately constrain autonomy to remain below ˆΛ can credibly offer accountable products at potentially lower legal/regulatory cost. Conversely, firms that push autonomy above ˆΛ may capture performance gains but face higher governance costs and regulatory scrutiny, which can lead to concentration if incumbents better absorb such costs or create proprietary distributed-governance institutions.
  • Social cost of autonomy vs. innovation:
    • The theorem quantifies a structural externality: high-autonomy networks can create accountability voids that impose social costs (uncompensated harms, uncertainty). Policymakers can use this to weigh innovation benefits against governance externalities and design incentives (subsidies for accountable design, taxes on opaque autonomy).
  • Policy design recommendations (economic levers):
    • Mandate design limits or verifiable contracts that keep effective αX·αE below threshold for high‑risk applications.
    • Encourage polycentric/distributed accountability (coalitions, joint liability, statutory collective responsibility regimes) and standardize mechanisms for collective allocation of responsibility.
    • Develop new insurance primitives and disclosure requirements for collective risk (e.g., mandatory registries of interaction graphs, minimal observability standards).
    • Use the Accountability Horizon as a quantitative metric for regulatory classification (risk tiers) and as a compliance target businesses can audit.
  • Research & market implications:
    • Need for economic models of contracting in the presence of non‑allocable responsibility (how do firms contract, share revenue and liability when some harms cannot be singularly attributed?)
    • Potential growth of third-party governance markets: firms offering governance-as-a-service (distributed accountability solutions, coalition formation, monitoring contracts) to manage exposure above the horizon.
    • Implications for valuation and M&A: acquiring a firm whose systems sit above ˆΛ* carries additional governance liabilities; diligence should include autonomy-profile audits.

Limitations to bear in mind - The impossibility is proved inside a specific formal model and depends on modelling choices (SCM, autonomy metrics, mixture structure, completeness axiom). Stakeholders may dispute normative axioms (e.g., rejecting Completeness). - Threshold ˆΛ* depends on |C_min|, the minimal size of a mixed human–AI feedback cycle; practical application requires mapping real-system topology into the model and estimating autonomy metrics. - The human agents in the analysis are modelled favourably; relaxing that assumption generally strengthens the impossibility (i.e., makes governance harder), but real-world complexities (continuous time, learning dynamics, legal institutions) require applied calibration.

Bottom line The paper gives AI economics a formal, computable boundary — the Accountability Horizon — that separates regimes where traditional single‑actor accountability is feasible from regimes where it is provably impossible. This has direct consequences for regulation design, firm strategy, insurance markets, and the economics of product design and governance infrastructure.

Assessment

Paper Typetheoretical Evidence Strengthmedium — The central impossibility result is a rigorous mathematical proof, and the authors systematically confirm its predictions in large-scale simulations; however, applicability to real-world AI systems depends on how well the paper's formal definitions (autonomy profile, accountability horizon, agent modelling, interaction graphs) map onto actual technical, organisational, and behavioral realities—there is no empirical validation on deployed systems or field data. Methods Rigorhigh — The paper provides an explicit formalisation with clear axioms, a proved theorem (Accountability Incompleteness Theorem), and controlled simulation experiments that test predictions across thousands of synthetic instances, showing internal consistency and robustness; limitations stem from modelling choices rather than execution quality. Sample3,000 synthetic human-agent collectives generated according to the paper's formal model: agents represented as state-policy tuples inside a shared structural causal model, with varied four-dimensional autonomy profiles (epistemic, executive, evaluative, social), interaction graphs (including human-AI feedback cycles), and joint action spaces; no real-world organisational or empirical datasets used. Themesgovernance human_ai_collab org_design IdentificationA formal axiomatic/mathematical proof (theorem) within a constructed model of Human-Agent Collectives, supplemented by simulation experiments on 3,000 synthetic collectives to validate theoretical predictions; no causal identification using observational or experimental real-world data. GeneralizabilityRelies on formal axioms and specific definitions of autonomy and accountability which may not capture all real-world complexities of human decision-making and institutions, Validation limited to synthetic simulations; no evidence from deployed AI systems, organisations, or legal cases, Mapping of theoretical constructs (e.g., Accountability Horizon) to measurable quantities in practice is unspecified or model-dependent, Assumes agent behaviours and interaction structures that may differ from socio-technical realities (e.g., informal responsibility norms, organisational hierarchies), Does not account for legal, political, or institutional mechanisms that can allocate or absorb responsibility through collective, corporate, or regulatory forms

Claims (12)

ClaimDirectionConfidenceOutcomeDetails
Existing accountability frameworks for AI systems, legal, ethical, and regulatory, rest on a shared assumption: for any consequential outcome, at least one identifiable person had enough involvement and foresight to bear meaningful responsibility. Governance And Regulation positive high attributability of responsibility for consequential outcomes
0.12
Agentic AI systems violate the above shared accountability assumption not as an engineering limitation but as a mathematical necessity once autonomy exceeds a computable threshold. Governance And Regulation negative high possibility of assigning meaningful responsibility (attributability) under formalized conditions
0.2
We introduce Human-Agent Collectives, a formalisation of joint human-AI systems where agents are modelled as state-policy tuples within a shared structural causal model. Governance And Regulation positive high formal representation of joint human-AI systems
0.12
Autonomy is characterised through a four-dimensional information-theoretic profile (epistemic, executive, evaluative, social). Governance And Regulation neutral high measure/characterisation of agent autonomy
0.12
Collective behaviour is characterised through interaction graphs and joint action spaces. Governance And Regulation neutral high formal representation of collective behaviour
0.12
Legitimate accountability is axiomatized through four minimal properties: Attributability (responsibility requires causal contribution), Foreseeability Bound (responsibility cannot exceed predictive capacity), Non-Vacuity (at least one agent bears non-trivial responsibility), and Completeness (all responsibility must be fully allocated). Governance And Regulation neutral high formal criteria for legitimate accountability
0.12
Accountability Incompleteness Theorem: for any collective whose compound autonomy exceeds the Accountability Horizon and whose interaction graph contains a human-AI feedback cycle, no framework can satisfy all four accountability properties simultaneously. Governance And Regulation negative high existence of frameworks satisfying all four accountability properties
0.2
The impossibility is structural: transparency, audits, and oversight cannot resolve it without reducing autonomy. Governance And Regulation negative high effectiveness of transparency/audits/oversight in restoring accountability without autonomy reduction
0.2
Below the threshold (Accountability Horizon), legitimate frameworks exist, establishing a sharp phase transition between regimes where the four properties can and cannot be satisfied. Governance And Regulation positive high existence/non-existence of legitimate accountability frameworks as a function of compound autonomy
0.2
Experiments on 3,000 synthetic collectives confirm all predictions with zero violations. Governance And Regulation positive high number of violations of the theoretical predictions (violations of impossibility/conditions)
n=3000
zero violations
0.12
This is the first impossibility result in AI governance, establishing a formal boundary below which current paradigms remain valid and above which distributed accountability mechanisms become necessary. Governance And Regulation positive medium novelty (first impossibility result) and policy implication (necessity of distributed accountability mechanisms above threshold)
0.04
Above the Accountability Horizon, distributed accountability mechanisms become necessary. Governance And Regulation positive high necessity of distributed accountability mechanisms conditional on compound autonomy exceeding threshold
0.12

Notes