A per-action insurance layer can make autonomous agents economically accountable: the authors derive a time-consistent counterfactual risk toll and prove no-splitting within underwriting boundaries, identify a strictly positive irreversible-authority premium, and show how high-probability toll envelopes yield executed-action budget guarantees—providing the mathematical base for operational runtime underwriting.
We propose a foundational runtime actuarial layer for autonomous AI agents in which every side-effect-bearing action carries a time-consistent, counterfactual risk toll computed against a contractually fixed safe default, inside an explicit underwriting boundary. The framework treats per-action insurance as the primary unit of analysis and replaces post-hoc annual liability cover with a pre-action transaction layer. The paper establishes four structural results: (i) a well-defined counterfactual toll under a chosen safe-default mapping and continuation policy, with explicit non-uniqueness; (ii) a no-splitting property within an underwriting boundary that telescopes path-decomposed actions into a boundary potential, with a corollary tying gaming-resistance to boundary design; (iii) an irreversible-authority premium, split into a strictly positive action-level component and an if-and-only-if characterisation of the set-level robust capital increase; and (iv) a conservative runtime gating theorem that translates high-probability toll envelopes into an executed-action budget guarantee. The result is the mathematical base layer for a broader program: an empirical companion instantiates the runtime through an Actuarial Action Interface and authority-frontier experiments; a mechanism-design companion studies strategic operator incentives and cross-boundary aggregation; and a dynamic-underwriting companion studies experience rating and audit-replay calibration. The present paper states the primitive contract, the toll identity, the within-boundary no-arbitrage result, and the budget guarantee on which those later layers depend.
Summary
Main Finding
The paper introduces a foundational actuarial runtime layer for autonomous AI agents that prices every side-effect-bearing action via a time-consistent counterfactual “toll” relative to a contractually fixed safe default. Four structural results are proved: (1) a well-defined time-consistent counterfactual toll under fixed primitives; (2) a no-splitting telescoping identity inside an underwriting boundary (so splitting actions within the boundary does not lower total toll); (3) an irreversible-authority premium showing irreversibility implies a strictly positive action-level price and characterizing when adding an action raises set-level robust capital; and (4) a conservative runtime gating theorem that converts high-probability upper envelopes on tolls into a budget guarantee for executed actions.
Key Points
-
Primitives and object of pricing
- The priced object is a side-effect-bearing action (external effects, irreversible within contract thresholds).
- Each priced action is compared counterfactually to a contract-specified minimal-authority safe default (chosen ex ante).
- Underwriting boundaries (units of aggregation like legal entity, time window, asset class) group actions and expose observable cumulative exposure EB_t.
-
Time-consistent counterfactual toll
- Toll ct(a | ht; π+, ρ, a0) = ρt(Ldo(a),π+) − ρt(Ldo(a0),π+), where ρ is a recursive, time-consistent dynamic risk mapping and π+ is a fixed continuation policy.
- Under mild technical assumptions (interventional well-posedness; σt mappings that are convex, monotone, local, translation-invariant), the toll is Ft-measurable and bounded.
- No uniqueness: different decompositions of overall risk are possible; the toll is well-defined only after fixing the primitives (a0, π+, ρ, B, model class M).
-
No-splitting inside an underwriting boundary
- Define a monotone boundary potential ΦB(·) and per-action boundary toll λt = ΦB(EBt + ΔEBt(at)) − ΦB(EBt).
- Telescoping identity: sum of λt over a sequence equals ΦB(EB_T) − ΦB(EB_0). Thus, splitting the same cumulative exposure inside the same boundary does not change total toll.
- This result relies on the conditional loss law depending on within-boundary history only through EBt and an Ft-measurable ξt (Assumption 10). Cross-boundary or path-dependent loopholes must be handled by boundary design.
-
Irreversible-authority premium
- Define action-level premium Δt(a+ | ht) as the worst-case (over model class M) positive incremental ρ-risk of a+ vs its safe default.
- If there exists a model witness where a+ creates an uncompensable irreversible tail gap (tail gap, irreversibility, and strict ρ-monotonicity), then Δt(a+ | ht) > 0.
- The set-level robust capital Kt(U | ht) increases when adding a+ iff supM ρM_t(Ldo(a+),π+) exceeds the former set-level supremum; hence marginal price and total capital effects are distinct.
-
Conservative runtime gating and budget guarantee
- Computing exact ct for every action may be infeasible; instead use a conservative nonnegative upper envelope ¯ct(at | ht) (from simulation, quantile models, or distributionally robust bounds).
- Under a high-probability validity assumption for the envelope (Assumption 16), a budgeted gate that admits actions only if ¯ct ≤ current budget B_t guarantees, with probability ≥1−δ, that total realized positive tolls on executed actions do not exceed initial budget B0.
- Implementation notes: split conformal methods provide envelopes under exchangeability; adaptive deployment needs online/adaptive conformal methods or audit-replay calibration.
-
Program architecture and companions
- This paper is the mathematical accounting layer (Paper A). Companion papers implement the Actuarial Action Interface and experiments (Paper B), study mechanism-design/incentives, and study dynamic underwriting and experience rating.
Data & Methods
- Nature of the work: formal theoretical paper proving structural results for an actuarial runtime accounting system. No primary observational dataset; empirical implementation is deferred to a companion.
- Model components and assumptions:
- Finite-horizon filtered probability setup; histories ht, actions at ∈ At(ht).
- Side-effect-bearing action defined by interventional change in external state sext over admissible models M and irreversibility criteria (cost/time thresholds).
- Interventional distributions Ldo(a),π+ obtained from sandbox simulators, causal world models, or off-policy estimators (Assumption 3: well-posedness).
- Dynamic risk: one-step conditional mappings σt : Lp(Ft+1) → Lp(Ft) satisfying normalization, monotonicity, locality, translation invariance, convexity (Assumption 4). Entropic risk mapping is used as a canonical example.
- Underwriting boundary B with observable cumulative exposure EB_t and increments ΔEB_t(at); boundary determinacy (Assumption 10) requires conditional loss given Ft to depend on within-boundary history only through EBt and ξt.
- Robustness via model class M and suprema over M for set-level capital and action premiums.
- Conservative envelope ¯ct calibrated with high-probability coverage (Assumption 16), with practical calibration via conformal prediction (split or online/adaptive variants depending on adaptivity).
- Methods:
- Formal definitions and proofs (sketches in paper) using backward induction for time-consistency, telescoping sums for no-splitting identity, worst-case supremum arguments for irreversibility premium, and elementary budget recursion plus probabilistic coverage for the gating guarantee.
- Companion empirical methods (Paper B) include Actuarial Action Interface, sandbox replay, Postgres/LangChain stress tests, and an LLM underwriting panel (not re-reported here).
Implications for AI Economics
- New pricing primitive: shifts liability/insurance from annual ex-post cover to per-action pre-execution tolls. This creates a commodity: per-action actuarial pricing of externalities.
- Internalization of marginal/external risk:
- Irreversible and tail-exposure actions acquire explicit marginal prices relative to safe defaults, which can change operator choice of actions and reduce deployment of high-tail-risk actions when priced appropriately.
- The separation between action-level premium and set-level capital highlights how marginal and total capital requirements differ—crucial for product design and regulatory capital rules.
- Market and product design consequences:
- Enables new runtime insurance/escrow products and an actuarial market around live agent actions (per-action underwriting, conditional reserves, rolling exposures).
- Underwriting boundary design becomes an important market and regulatory lever: contracts must define boundaries and ξt to prevent arbitrage (splitting, proxy agents, relabelling).
- Mechanism design and contract clauses (audit rules, aggregations, related-party rules) will be necessary to avoid cross-boundary gaming and to ensure incentive compatibility.
- Operational and economic trade-offs:
- Transaction costs and latency: per-action tolling and gating will introduce cognitive or latency costs, escalation to humans, and possible operational friction—trade-offs between safety and throughput.
- Behavioral responses: agents or operators may favor safe defaults, escalate, or re-route actions to cheaper boundaries unless contracts close loopholes; careful design is needed to avoid socially undesirable refusal or avoidance.
- Capital allocation and reserve dynamics: dynamic underwriting companions are required to specify experience-rating, reserve updates, and credibility-weighted premiums; firms will need new reserve accounting for action-level exposures.
- Regulatory and governance impacts:
- Regulators could mandate or standardize primitives (safe defaults, boundary definitions, admissible model classes) to ensure comparability, prevent regulatory arbitrage, and set minimum capital.
- Auditability and telemetry become key economic goods—accuracy of exposure observability and replayability affects pricing and capital.
- Limitations and open issues (economically relevant):
- Dependence on contract-specified primitives (safe default, continuation policy, boundary) means pricing and outcomes are not unique; institutional choices matter and will shape market structure.
- Calibration and closed-loop validity: ensuring high-probability envelopes under adaptive deployment is nontrivial; miscalibration risks underpricing exposure.
- Cross-boundary aggregation and systemic effects: per-action pricing could shift risks across entities or create correlated failures; aggregation rules and systemic-capital considerations are not solved here.
- Computational and informational constraints: constructing interventional distributions and robust envelopes at scale is expensive; affects feasibility and frictions in market adoption.
- Research and policy directions
- Empirical validation of behavioral and welfare effects (Paper B and beyond).
- Mechanism-design work to ensure incentive compatibility and prevent rent-seeking.
- Dynamic-underwriting research for reserve dynamics, learning of ΦB and envelope calibration.
- Possible regulatory standards for primitives (safe defaults, admissible model classes, boundary observability) to reduce heterogeneity and systemic arbitrage.
Summary take-away: The paper provides a rigorous mathematical base for pricing and gating side-effect-bearing actions in autonomous agents via time-consistent counterfactual tolls, shows how within-boundary aggregation prevents splitting arbitrage, identifies when irreversible authority carries a positive premium, and gives a practical conservative gating guarantee — all of which enable a new class of per-action actuarial products and governance tools, while leaving important empirical, incentive-design, and calibration challenges for companion work.
Assessment
Claims (9)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| We propose a foundational runtime actuarial layer for autonomous AI agents in which every side-effect-bearing action carries a time-consistent, counterfactual risk toll computed against a contractually fixed safe default, inside an explicit underwriting boundary. Governance And Regulation | positive | high | existence of a runtime actuarial layer assigning counterfactual risk tolls per action |
0.12
|
| The framework treats per-action insurance as the primary unit of analysis and replaces post-hoc annual liability cover with a pre-action transaction layer. Governance And Regulation | positive | high | shift from annual liability models to per-action pre-action insurance (design/operational modality) |
0.12
|
| (i) There exists a well-defined counterfactual toll under a chosen safe-default mapping and continuation policy. Governance And Regulation | positive | high | well-definedness/existence of a counterfactual toll |
0.12
|
| (i, continued) The counterfactual toll has explicit non-uniqueness (i.e., non-uniqueness of the toll is demonstrated). Governance And Regulation | null_result | high | non-uniqueness property of the counterfactual toll |
0.12
|
| (ii) A no-splitting property holds within an underwriting boundary that telescopes path-decomposed actions into a boundary potential. Governance And Regulation | positive | high | no-splitting aggregation property (telescoping into boundary potential) |
0.12
|
| (ii, corollary) Gaming-resistance of the system is tied to the design of the underwriting boundary (i.e., a corollary linking gaming-resistance to boundary design). Governance And Regulation | positive | high | relationship between underwriting-boundary design and resistance to gaming/manipulation |
0.12
|
| (iii) An irreversible-authority premium is characterized and splits into a strictly positive action-level component plus an if-and-only-if characterization of the set-level robust capital increase. Governance And Regulation | positive | high | irreversible-authority premium decomposition and positivity of action-level component; iff characterization of robust capital increase |
0.12
|
| (iv) A conservative runtime gating theorem translates high-probability toll envelopes into an executed-action budget guarantee. Governance And Regulation | positive | high | budget guarantee on executed actions derived from probabilistic toll envelopes |
0.12
|
| The present paper states the primitive contract, the toll identity, the within-boundary no-arbitrage result, and the budget guarantee that the later empirical, mechanism-design, and dynamic-underwriting companion papers depend on. Governance And Regulation | positive | high | presence/statement of specific formal primitives and theorems (primitive contract, toll identity, no-arbitrage, budget guarantee) in the paper |
0.12
|