The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲
← Papers

Insurers cannot settle many AI-driven losses by 'event' logs alone; they need claim-grade reconstruction of the AI's evolving state. The CER framework — Control boundary, Evidence reconstruction, Insurance response — clarifies when retained artifacts and enforceable operating envelopes suffice to transfer residual AI risk to insurance markets.

From Control Boundary to Insurance Claim: Reconstructing AI-Mediated Losses Through the CER Framework
Alex Leung, Rex Zhang, Kentaroh Toyoda, SiewMei Loh · June 02, 2026
arxiv theoretical n/a evidence 7/10 relevance Source PDF
The paper introduces CER — Control boundary, Evidence reconstruction, Insurance response — a practical diagnostic for whether losses caused by generative or agentic AI can be reconstructed to support insurance claims, illustrated with public incidents and a legal case.

AI losses that arise through an insured organization's generative or agentic AI system require state reconstruction, not merely event reconstruction, because the relevant state changes as the system reasons, retrieves, calls tools, and acts. The relevant question is not only what loss occurred, but what the system was allowed to do, what it actually did, and whether that reconstructed loss can support insurance claim recovery. This paper addresses losses in which the insured's AI system is in the causal chain, including externally triggered failures such as prompt injection, retrieval-augmented generation (RAG) poisoning, malicious tool output, credential misuse, and data poisoning. Specifically, this paper introduces CER, a use-case-level diagnostic for AI residual risk transfer. C (control boundary) asks whether the system had an enforceable operating envelope. E (evidence reconstruction) asks whether the system state and causal chain can be reconstructed from retained artifacts. R (insurance response) asks whether the reconstructed loss is insured: whether insurance coverage is available in the market and placed for the insured, together with the proof needed to support insurance claim recovery. The paper makes three contributions: it defines the AI-specific reconstruction problem, operationalizes that problem through CER, and specifies claim-grade evidence for AI reconstruction. Public examples include the reported PocketOS and Replit agentic database-deletion incidents and Moffatt v. Air Canada as an adjudicated output/reliance case. Keywords: AI systems; CER framework; residual risk transfer; agentic AI; generative AI; AI insurance; evidence reconstruction.

Summary

Main Finding

The paper introduces CER, a use-case-level diagnostic that determines whether losses involving an insured organization’s generative or agentic AI system can realistically be transferred to insurance. CER requires linking three evidentiary steps: C — an enforceable control boundary that defines what the system was permitted to do; E — claim-grade evidence that reconstructs the AI system’s actual state and causal chain; and R — an insurance response showing the reconstructed loss maps to placed, applicable coverage. A loss is considered transfer-aligned only when all three links are present and demonstrably connected.

Key Points

  • The core conceptual advance is distinguishing state reconstruction from event reconstruction. For AI-mediated losses you must reconstruct the system state (prompts, retrieved context, model version, memory, tool calls, credentials, approvals, outputs, actions) that generated the harmful outcome, not just the downstream event.
  • CER = Control boundary (C), Evidence reconstruction (E), Insurance response (R). Each dimension is scored 0–3; the framework defines how scores combine into diagnostic states and when risk is effectively transferable.
  • C (Control): separates mere policy/prompt instructions from technically enforceable boundaries. Technical enforcement (IAM scoping, runtime guards, approval gates) is necessary for high C scores; policy-only or human-review-only defenses are weak, especially for agentic systems (confused-deputy risks).
  • E (Evidence): requires retained, correlated artifacts sufficient for audit/claims (seven artifact families described; claim-grade evidence template provided). Fidelity and provenance are distinct — one can reconstruct an artifact that is itself poisoned or inauthentic.
  • R (Insurance): asks whether the reconstructed loss triggers placed coverage and meets proof/causation requirements. Even with strong C and E, the loss may map to silent, excluded, or unpurchased coverage.
  • Scope: applies when the insured’s operated AI system materially contributes to loss. External AI-enabled attacks are in scope only when they act through the insured’s system (e.g., RAG poisoning, prompt injection); pure third-party deepfakes that trick human employees without using the insured’s AI are out of scope.
  • Illustrative incidents: PocketOS / Cursor agent deletion (agentic deletion via broad infra token), Replit agentic database-deletion incident, and Moffatt v. Air Canada (adjudicated output/reliance case) are used to show the framework’s application.
  • CER is complementary to — not a replacement for — existing governance, observability, forensics, and insurance-coverage analyses. It links governance outputs, telemetry/artifacts, and coverage evaluation into a use-case-level reconstruction-to-transfer chain.

Data & Methods

  • This is a conceptual, operational paper rather than an empirical study. Methods include:
    • Literature and regulatory review: synthesis of AI-risk frameworks (OWASP LLM/Agentic Top 10, MITRE ATLAS, NIST AI RMF), insurance market studies (coverage demand, silent-AI exposure), legal cases and regulatory instruments (EU AI Act, GDPR Article 22).
    • Formal definitions: precise definitions for AI-mediated loss, material contribution, permitted operating envelope, actual reconstructed state, coverage-responsive loss, and claim-grade evidence.
    • Operationalization: a practical diagnostic (CER) with a 0–3 scoring rubric per dimension, a mapping from scores to diagnostic outcomes, and an explicit practitioner template for claim-grade artifacts (seven artifact families described in appendix).
    • Case illustrations: applied CER to public incidents (PocketOS, Replit) and a court case (Moffatt v. Air Canada) to demonstrate how the framework would assess transferability.
  • No primary quantitative datasets or econometric analysis are presented. The framework relies on qualitative mapping of artifacts and governance/technical controls to insurance evidentiary needs.

Implications for AI Economics

  • Underwriting & Pricing: CER makes insurers’ evidentiary requirements explicit, which will affect underwriting granularity. Insurers may price AI-exposed risks based on (a) whether enforceable technical controls exist (C), and (b) the quality and persistence of forensic/evidence logging (E). Firms with weak C/E are likely to face higher premiums, stricter retentions, or narrow coverage.
  • Market segmentation & product innovation: Expect new insurance products and endorsements tailored to agentic and generative-AI exposures (e.g., coverage conditional on minimum C/E standards), plus risk-reduction-as-a-service (logging/forensic providers) bundled with insurance. Specialty markets may arise for high-control deployments (financial infra) vs. low-control consumer-facing agents.
  • Moral hazard & contractual design: Clearer proof requirements reduce asymmetric information and moral hazard by making claims contingent on demonstrable technical controls and retained artifacts. However, firms might underinvest in C/E if coverage remains available for poorly documented incidents — insurers will respond by conditioning coverage on CER-like requirements.
  • Adverse selection & selection effects: Without standardized CER-like evidentiary baselines, insurers may face adverse selection: firms that cannot demonstrate C/E will either be uninsured or mispriced. Standardization of logging and minimum enforceable controls reduces adverse selection and supports broader capacity in the market.
  • Operational investment & compliance costs: Firms will internalize costs to achieve transferability (implement runtime guards, least-privilege identities, secure artifact retention, provenance tracking). These are recurring costs that change the marginal economics of deploying agentic capabilities — especially for smaller firms.
  • Systemic risk & concentration: Agentic AI can produce rapid, large-scale actions (credential misuse, mass deletions). If many firms rely on similar toolchains and insurers adopt similar exclusion/coverage conditions, systemic vulnerabilities may emerge (e.g., correlated uninsured losses). CER highlights the need for industry-wide logging/standards to mitigate systemic tail risk.
  • Regulatory alignment & market signals: CER complements regulatory expectations (EU AI Act logging obligations, product liability reforms). Harmonized legal and insurance evidentiary standards could accelerate market development and reduce transaction costs for claims.
  • Incentives for observability & third-party services: High demand for claim-grade evidence will create markets for tamper-evident logs, secure memory provenance, immutable audit trails, and independent reconstruction services. Those markets alter the value chain around AI deployment and shift some economic surplus to forensic/monitoring providers.
  • Broader welfare trade-offs: Better ability to transfer AI risk (through CER-compliant controls and evidence) expands capacity to deploy agentic AI but at higher deployment cost. Society benefits from safer deployment and clearer liability allocation, but small or resource-constrained organizations may be priced out of deployable agentic uses or face higher retained risk.

Limitations and open questions for economists: - CER is conceptual and procedural; it does not provide quantitative estimates of how much premiums or coverage terms will change. - The paper does not model equilibrium outcomes (e.g., how insurers and firms will co-evolve controls, evidence practices, and contract terms). - Future empirical work could quantify the value of improved C/E on loss frequency/severity, insurer capacity, premium spreads, and broader adoption rates of agentic AI.

Overall, CER provides a practical, actionable bridge between AI governance/forensics and insurance transfer — a necessary structure for credible market pricing and for aligning incentives across deployers, insurers, and regulators.

Assessment

Paper Typetheoretical Evidence Strengthn/a — The paper is a conceptual and normative framework that does not present empirical causal estimation or inferential evidence; it uses illustrative public incidents and a legal case rather than systematic data analysis. Methods Rigormedium — The framework is clearly structured (CER) and grounded in illustrative, real-world examples and legal precedent, but it lacks systematic empirical testing, formal modeling, or validation across a representative set of incidents or insurers. SampleNo original quantitative dataset; relies on public incident reports and media accounts (e.g., PocketOS, Replit agentic database-deletion incidents) and the adjudicated legal case Moffatt v. Air Canada to illustrate the reconstruction problem and required claim-grade evidence. Themesgovernance org_design innovation GeneralizabilityRelies on a small set of public incidents and one legal case; may not generalize to the full distribution of AI failures., Legal and regulatory outcomes vary by jurisdiction, limiting transferability of the insurance implications., Assumes availability/retention of system artifacts and logging — not applicable to black-box or ephemeral deployments., Focuses on insured organizations and commercial insurance markets; findings may not apply to uninsured actors or public-sector deployments., Does not empirically test insurer behavior, market pricing, or claim adjudication at scale., May be less applicable to narrow non-agentic AI deployments where state evolution is minimal.

Claims (6)

ClaimDirectionConfidenceOutcomeDetails
AI losses that arise through an insured organization's generative or agentic AI system require state reconstruction, not merely event reconstruction, because the relevant state changes as the system reasons, retrieves, calls tools, and acts. Governance And Regulation positive high need for state reconstruction (vs. event-only reconstruction) to support insurance claim recovery after AI-caused losses
0.02
The relevant question for such losses is not only what loss occurred, but what the system was allowed to do, what it actually did, and whether that reconstructed loss can support insurance claim recovery. Governance And Regulation positive high completeness of reconstruction (allowed actions, actual actions) needed to establish insurance claim recovery
0.02
The paper addresses losses in which the insured's AI system is in the causal chain, including externally triggered failures such as prompt injection, retrieval-augmented generation (RAG) poisoning, malicious tool output, credential misuse, and data poisoning. Governance And Regulation positive high coverage of AI-caused loss modes (identification of failure types relevant to reconstruction and insurance)
0.2
The paper introduces CER, a use-case-level diagnostic for AI residual risk transfer: C (control boundary) asks whether the system had an enforceable operating envelope; E (evidence reconstruction) asks whether the system state and causal chain can be reconstructed from retained artifacts; R (insurance response) asks whether the reconstructed loss is insured (coverage available and placed, and proof needed to support claim recovery). Governance And Regulation positive high diagnostic ability to evaluate residual risk transfer via control boundaries, evidence reconstruction, and insurance response
0.2
The paper makes three contributions: it defines the AI-specific reconstruction problem, operationalizes that problem through CER, and specifies claim-grade evidence for AI reconstruction. Governance And Regulation positive high conceptual/operational contributions delivered by the paper (definition, operationalization, evidence specification)
0.2
Public examples referenced include the reported PocketOS and Replit agentic database-deletion incidents and Moffatt v. Air Canada as an adjudicated output/reliance case. Governance And Regulation positive high use of real-world examples and adjudicated case to illustrate AI reconstruction and insurance issues
0.2

Notes