A per-action insurance layer can make autonomous agents economically accountable: the authors derive a time-consistent counterfactual risk toll and prove no-splitting within underwriting boundaries, identify a strictly positive irreversible-authority premium, and show how high-probability toll envelopes yield executed-action budget guarantees—providing the mathematical base for operational runtime underwriting.

Foundations of a Time-Consistent Counterfactual Actuarial Runtime for Autonomous AI Agents

Hao-Hsuan Chen · May 26, 2026

arxiv theoretical n/a evidence 8/10 relevance Source PDF

This paper builds a formal actuarial layer that prices time-consistent, counterfactual per-action risk tolls for autonomous agents and proves structural results (no-splitting, irreversible-authority premium, and a runtime budget guarantee) enabling pre-action insurance and runtime gating.

We propose a foundational runtime actuarial layer for autonomous AI agents in which every side-effect-bearing action carries a time-consistent, counterfactual risk toll computed against a contractually fixed safe default, inside an explicit underwriting boundary. The framework treats per-action insurance as the primary unit of analysis and replaces post-hoc annual liability cover with a pre-action transaction layer. The paper establishes four structural results: (i) a well-defined counterfactual toll under a chosen safe-default mapping and continuation policy, with explicit non-uniqueness; (ii) a no-splitting property within an underwriting boundary that telescopes path-decomposed actions into a boundary potential, with a corollary tying gaming-resistance to boundary design; (iii) an irreversible-authority premium, split into a strictly positive action-level component and an if-and-only-if characterisation of the set-level robust capital increase; and (iv) a conservative runtime gating theorem that translates high-probability toll envelopes into an executed-action budget guarantee. The result is the mathematical base layer for a broader program: an empirical companion instantiates the runtime through an Actuarial Action Interface and authority-frontier experiments; a mechanism-design companion studies strategic operator incentives and cross-boundary aggregation; and a dynamic-underwriting companion studies experience rating and audit-replay calibration. The present paper states the primitive contract, the toll identity, the within-boundary no-arbitrage result, and the budget guarantee on which those later layers depend.

Summary

Main Finding

The paper introduces a foundational actuarial runtime layer for autonomous AI agents that prices every side-effect-bearing action via a time-consistent counterfactual “toll” relative to a contractually fixed safe default. Four structural results are proved: (1) a well-defined time-consistent counterfactual toll under fixed primitives; (2) a no-splitting telescoping identity inside an underwriting boundary (so splitting actions within the boundary does not lower total toll); (3) an irreversible-authority premium showing irreversibility implies a strictly positive action-level price and characterizing when adding an action raises set-level robust capital; and (4) a conservative runtime gating theorem that converts high-probability upper envelopes on tolls into a budget guarantee for executed actions.

Key Points

Primitives and object of pricing
- The priced object is a side-effect-bearing action (external effects, irreversible within contract thresholds).
- Each priced action is compared counterfactually to a contract-specified minimal-authority safe default (chosen ex ante).
- Underwriting boundaries (units of aggregation like legal entity, time window, asset class) group actions and expose observable cumulative exposure EB_t.
Time-consistent counterfactual toll
- Toll ct(a | ht; π+, ρ, a0) = ρt(Ldo(a),π+) − ρt(Ldo(a0),π+), where ρ is a recursive, time-consistent dynamic risk mapping and π+ is a fixed continuation policy.
- Under mild technical assumptions (interventional well-posedness; σt mappings that are convex, monotone, local, translation-invariant), the toll is Ft-measurable and bounded.
- No uniqueness: different decompositions of overall risk are possible; the toll is well-defined only after fixing the primitives (a0, π+, ρ, B, model class M).
No-splitting inside an underwriting boundary
- Define a monotone boundary potential ΦB(·) and per-action boundary toll λt = ΦB(EBt + ΔEBt(at)) − ΦB(EBt).
- Telescoping identity: sum of λt over a sequence equals ΦB(EB_T) − ΦB(EB_0). Thus, splitting the same cumulative exposure inside the same boundary does not change total toll.
- This result relies on the conditional loss law depending on within-boundary history only through EBt and an Ft-measurable ξt (Assumption 10). Cross-boundary or path-dependent loopholes must be handled by boundary design.
Irreversible-authority premium
- Define action-level premium Δt(a+ | ht) as the worst-case (over model class M) positive incremental ρ-risk of a+ vs its safe default.
- If there exists a model witness where a+ creates an uncompensable irreversible tail gap (tail gap, irreversibility, and strict ρ-monotonicity), then Δt(a+ | ht) > 0.
- The set-level robust capital Kt(U | ht) increases when adding a+ iff supM ρM_t(Ldo(a+),π+) exceeds the former set-level supremum; hence marginal price and total capital effects are distinct.
Conservative runtime gating and budget guarantee
- Computing exact ct for every action may be infeasible; instead use a conservative nonnegative upper envelope ¯ct(at | ht) (from simulation, quantile models, or distributionally robust bounds).
- Under a high-probability validity assumption for the envelope (Assumption 16), a budgeted gate that admits actions only if ¯ct ≤ current budget B_t guarantees, with probability ≥1−δ, that total realized positive tolls on executed actions do not exceed initial budget B0.
- Implementation notes: split conformal methods provide envelopes under exchangeability; adaptive deployment needs online/adaptive conformal methods or audit-replay calibration.
Program architecture and companions
- This paper is the mathematical accounting layer (Paper A). Companion papers implement the Actuarial Action Interface and experiments (Paper B), study mechanism-design/incentives, and study dynamic underwriting and experience rating.

Data & Methods

Nature of the work: formal theoretical paper proving structural results for an actuarial runtime accounting system. No primary observational dataset; empirical implementation is deferred to a companion.
Model components and assumptions:
- Finite-horizon filtered probability setup; histories ht, actions at ∈ At(ht).
- Side-effect-bearing action defined by interventional change in external state sext over admissible models M and irreversibility criteria (cost/time thresholds).
- Interventional distributions Ldo(a),π+ obtained from sandbox simulators, causal world models, or off-policy estimators (Assumption 3: well-posedness).
- Dynamic risk: one-step conditional mappings σt : Lp(Ft+1) → Lp(Ft) satisfying normalization, monotonicity, locality, translation invariance, convexity (Assumption 4). Entropic risk mapping is used as a canonical example.
- Underwriting boundary B with observable cumulative exposure EB_t and increments ΔEB_t(at); boundary determinacy (Assumption 10) requires conditional loss given Ft to depend on within-boundary history only through EBt and ξt.
- Robustness via model class M and suprema over M for set-level capital and action premiums.
- Conservative envelope ¯ct calibrated with high-probability coverage (Assumption 16), with practical calibration via conformal prediction (split or online/adaptive variants depending on adaptivity).
Methods:
- Formal definitions and proofs (sketches in paper) using backward induction for time-consistency, telescoping sums for no-splitting identity, worst-case supremum arguments for irreversibility premium, and elementary budget recursion plus probabilistic coverage for the gating guarantee.
- Companion empirical methods (Paper B) include Actuarial Action Interface, sandbox replay, Postgres/LangChain stress tests, and an LLM underwriting panel (not re-reported here).

Implications for AI Economics

New pricing primitive: shifts liability/insurance from annual ex-post cover to per-action pre-execution tolls. This creates a commodity: per-action actuarial pricing of externalities.
Internalization of marginal/external risk:
- Irreversible and tail-exposure actions acquire explicit marginal prices relative to safe defaults, which can change operator choice of actions and reduce deployment of high-tail-risk actions when priced appropriately.
- The separation between action-level premium and set-level capital highlights how marginal and total capital requirements differ—crucial for product design and regulatory capital rules.
Market and product design consequences:
- Enables new runtime insurance/escrow products and an actuarial market around live agent actions (per-action underwriting, conditional reserves, rolling exposures).
- Underwriting boundary design becomes an important market and regulatory lever: contracts must define boundaries and ξt to prevent arbitrage (splitting, proxy agents, relabelling).
- Mechanism design and contract clauses (audit rules, aggregations, related-party rules) will be necessary to avoid cross-boundary gaming and to ensure incentive compatibility.
Operational and economic trade-offs:
- Transaction costs and latency: per-action tolling and gating will introduce cognitive or latency costs, escalation to humans, and possible operational friction—trade-offs between safety and throughput.
- Behavioral responses: agents or operators may favor safe defaults, escalate, or re-route actions to cheaper boundaries unless contracts close loopholes; careful design is needed to avoid socially undesirable refusal or avoidance.
- Capital allocation and reserve dynamics: dynamic underwriting companions are required to specify experience-rating, reserve updates, and credibility-weighted premiums; firms will need new reserve accounting for action-level exposures.
Regulatory and governance impacts:
- Regulators could mandate or standardize primitives (safe defaults, boundary definitions, admissible model classes) to ensure comparability, prevent regulatory arbitrage, and set minimum capital.
- Auditability and telemetry become key economic goods—accuracy of exposure observability and replayability affects pricing and capital.
Limitations and open issues (economically relevant):
- Dependence on contract-specified primitives (safe default, continuation policy, boundary) means pricing and outcomes are not unique; institutional choices matter and will shape market structure.
- Calibration and closed-loop validity: ensuring high-probability envelopes under adaptive deployment is nontrivial; miscalibration risks underpricing exposure.
- Cross-boundary aggregation and systemic effects: per-action pricing could shift risks across entities or create correlated failures; aggregation rules and systemic-capital considerations are not solved here.
- Computational and informational constraints: constructing interventional distributions and robust envelopes at scale is expensive; affects feasibility and frictions in market adoption.
Research and policy directions
- Empirical validation of behavioral and welfare effects (Paper B and beyond).
- Mechanism-design work to ensure incentive compatibility and prevent rent-seeking.
- Dynamic-underwriting research for reserve dynamics, learning of ΦB and envelope calibration.
- Possible regulatory standards for primitives (safe defaults, admissible model classes, boundary observability) to reduce heterogeneity and systemic arbitrage.

Summary take-away: The paper provides a rigorous mathematical base for pricing and gating side-effect-bearing actions in autonomous agents via time-consistent counterfactual tolls, shows how within-boundary aggregation prevents splitting arbitrage, identifies when irreversible authority carries a positive premium, and gives a practical conservative gating guarantee — all of which enable a new class of per-action actuarial products and governance tools, while leaving important empirical, incentive-design, and calibration challenges for companion work.

Assessment

Paper Typetheoretical Evidence Strengthn/a — The paper is purely theoretical and presents formal structural results and theorems rather than empirical tests; there is no causal identification or empirical evidence to rate. Methods Rigorhigh — The work presents explicit mathematical definitions, structural theorems (toll identity, no-splitting/no-arbitrage, irreversible-authority premium, and a runtime budget guarantee) and clearly-stated primitives, indicating formal rigor; however, rigor is limited to the chosen formal model and assumptions without empirical or computational validation. SampleNo empirical sample; the paper develops a formal mathematical model of autonomous agents, actions, underwriting boundaries, a contractually fixed safe-default mapping, and continuation policies to derive per-action tolls and related structural results. Themesgovernance org_design GeneralizabilityNo empirical validation or calibration to real-world agent behavior or losses, Relies on model primitives (safe-default mapping, continuation policy, underwriting boundary) that may be hard to specify or observe in practice, Computational tractability and measurement of per-action tolls in complex, high-dimensional environments is not demonstrated, Assumes contractual and institutional enforcement that may vary across legal and regulatory contexts, Extensions to multi-agent strategic settings or large-scale deployed systems are deferred to companion work

Claims (9)

Claim	Direction	Confidence	Outcome	Details
We propose a foundational runtime actuarial layer for autonomous AI agents in which every side-effect-bearing action carries a time-consistent, counterfactual risk toll computed against a contractually fixed safe default, inside an explicit underwriting boundary. Governance And Regulation	positive	high	existence of a runtime actuarial layer assigning counterfactual risk tolls per action	0.12
The framework treats per-action insurance as the primary unit of analysis and replaces post-hoc annual liability cover with a pre-action transaction layer. Governance And Regulation	positive	high	shift from annual liability models to per-action pre-action insurance (design/operational modality)	0.12
(i) There exists a well-defined counterfactual toll under a chosen safe-default mapping and continuation policy. Governance And Regulation	positive	high	well-definedness/existence of a counterfactual toll	0.12
(i, continued) The counterfactual toll has explicit non-uniqueness (i.e., non-uniqueness of the toll is demonstrated). Governance And Regulation	null_result	high	non-uniqueness property of the counterfactual toll	0.12
(ii) A no-splitting property holds within an underwriting boundary that telescopes path-decomposed actions into a boundary potential. Governance And Regulation	positive	high	no-splitting aggregation property (telescoping into boundary potential)	0.12
(ii, corollary) Gaming-resistance of the system is tied to the design of the underwriting boundary (i.e., a corollary linking gaming-resistance to boundary design). Governance And Regulation	positive	high	relationship between underwriting-boundary design and resistance to gaming/manipulation	0.12
(iii) An irreversible-authority premium is characterized and splits into a strictly positive action-level component plus an if-and-only-if characterization of the set-level robust capital increase. Governance And Regulation	positive	high	irreversible-authority premium decomposition and positivity of action-level component; iff characterization of robust capital increase	0.12
(iv) A conservative runtime gating theorem translates high-probability toll envelopes into an executed-action budget guarantee. Governance And Regulation	positive	high	budget guarantee on executed actions derived from probabilistic toll envelopes	0.12
The present paper states the primitive contract, the toll identity, the within-boundary no-arbitrage result, and the budget guarantee that the later empirical, mechanism-design, and dynamic-underwriting companion papers depend on. Governance And Regulation	positive	high	presence/statement of specific formal primitives and theorems (primitive contract, toll identity, no-arbitrage, budget guarantee) in the paper	0.12