The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲
← Papers

AI experts see multiple pathways to catastrophe within five years: in a Delphi of 272 specialists, 18 of 24 assessed risks had more than a 10% chance of catastrophic outcomes under business‑as‑usual by 2030, and even with pragmatic mitigations five risks retained >10% catastrophe probability, with general‑purpose AI developers and governance actors judged most responsible for mitigation.

Prioritization of Risks from Artificial Intelligence: A Delphi Study of 272 International Experts
Alexander K. Saeri, Jess Graham, Michael Noetel, Peter Slattery, Dennis Ah-king, Edla Aittokallio, Ibitola Akindehin, Abbas Al Mahdi, Elie Alhajjar, Rafael Andersson Lipcsey, Gary Ang, Catherine M. Azam, Amos Azaria, Rishal Balkissoon, Isabel Barberá, Claudio Bareato, Jonathan Barry, Michael Basehart, Andrew M. Bean, Danny Belitz, Samantha Augusta Bennett, Kayla Blomquist, Damian Borstel, Ben Bucknall, Tomas Bueno Momcilovic, Aurelie Bugeau, Nicholas Caputo, Stephen Casper, Gulam Chagani, Ze Shen Chin, Jiyeon Cho, Jay Chooi, Joel N. Christoph, Dmytro Chumachenko, Kieran Conboy, Elizabeth M. Daly, Tom David, Paul de Font-Reaulx, Antonio De Santis, Fabrizio Degni, Christopher W. DiCarlo, Yawen Duan, Janet Egan, Ian W. Eisenberg, Sherif M. Elsafty, Adam Ennamli, Mark Esposito, Nicola Fabiano, Gallo Fall, Neil R. Fernandes, Pip Foweraker, Chiara Gallese, Sandra Galletti, Andrew Gamino-Cheong, Rokas Gipiškis, Gwyn Glasser, Delaram Golpayegani, Jeff Grayson, Hans Gundlach, Josiah Hagen, Alexander Hagenah, Amelia S. Haines, The Anh Han, Yixiong Hao, Kasii Harris, Tianxing He, Koen Holtman, Giorgos Iacovides, Kenneth L. Ingham, Krystal Jackson, Adam Jones, Himanshu Joshi, Brian Judge, Arturs Kanepajs, Shreya Kapoor, Win Myat Nwe Khine, Aidan Kierans, Aleksandra Korolova, Markus Krebsz, Nicholas Kruus, Joe Kwon, Valeria Lazzaroli, Ray X. Lee, Evelina Leivada, Stephan Lewandowsky, Michael B. Li, Xiaojian Li, Geunsik Lim, Henrique Lisakowski, Fabio Lonardoni, Todd C. Lowe, Jackson G. Lu, Alexander Lyzhov, Nada Madkour, Parv Mahajan, David Manheim, Kareem Mathias, Claudio Mayrink Verdun, Sean McGregor, Scott McLean, Matthew J. McMahon, Minas Megalokonomos, Nicolas Moës, Fernando Mourao, Yaroslav Mukhin, Malcolm Murray, Simon Mylius, Neeraj Nagpal, Koichi Nakada, Anna Neumann, Jessica Newman, Kwan Yee Ng, Minh N. Nguyen, Quynh Phuong Nguyen, Seán S. Ó hÉigeartaigh, Daria Onitiu, Kelly Onu, Oscar Oviedo-Trespalacios, Ugur Ozer, Chanwoo Park, M. Alejandra Parra-Orlandoni, Patricia Paskov, Anna M. Pastwa, Burak Piskin, Jacob Pratt, Claudiu A. Predincea, Marjana Prifti Skenduli, Kenneth Priore, Mukunda Madhab Pujari, Zhenting Qi, Preethi Raghunathan, Robi Rahman, Deepika Raman, Max Reddel, Jyoti Ruparel, Emma B. Ruttkamp-Bloem, Tiffany Saade, Greg Sadler, Said Saillant, Paul M. Salmon, Ayrton San Joaquin, Lama Saouma, Maziya Sarangpurwala, Supheakmungkol Sarin, Daniel S. Schiff, Anna D. Schilling, Chris Schmitz, Reva Schwartz, Abeer Sharma, Tianhao Shen, Kehan Sheng, Maury D. Shenk, Eli Sherman, Chandler Smith, Julie M. Smith, Estevenson Solano, Oliver Sourbut, Madhulika Srikumar, Ryan Stendall, Jakob Stenseke, Michael Stern, Joshua Sternfeld, Nikko Stevens, Ilia Sucholutsky, Yuanyuan Sun, Mariami Tkeshelashvili, Cristian Trout, Brian Tse, Nikolaos Tsinganos, Michelle Vaccaro, Anthony R. Valiaveedu, Ramakrishnan Veeramony, Jeremy Verdo, Pulkit Verma, Andrea Luigi Vitali, Jinge Wang, JR Washebek, Yonah Welker, George F. Westerman, James Williams, Tristan Williams, Rongwu Xu, Mick Yang, Xuemeng Yang, Sander Zeijlemaker, Jingyu Zhang, Marta Ziosi, Neil Thompson · June 03, 2026
arxiv descriptive low evidence 7/10 relevance Source PDF
A three‑round Delphi of 272 AI experts judged most of 24 evaluated AI risks plausibly catastrophic within 2025–2030—under business as usual 18 risks had >10% chance of catastrophe and even with mitigations five risks retained >10% probability—identifying dangerous capabilities, competitive dynamics, weapons/cyberattacks, power centralization, and misinformation as the most severe.

Artificial intelligence poses many risks, ranging from familiar present-day harms to unprecedented and potentially catastrophic ones. Effective risk management requires prioritization: we must understand which risks are most severe, who is most vulnerable, and who is most responsible for addressing them. We report results from a three-round Delphi study conducted late 2025 with 272 international AI experts. Experts rated 24 AI risks on harm probability and severity, sector and actor vulnerability, actor responsibility, and overall concern. Experts estimated the five most severe harms in the next 5 years were likely to come from dangerous capabilities, competitive dynamics, weapons & cyberattacks (including CBRNE), power centralization, and false information. In a business-as-usual scenario, experts judged 18 of 24 risks as having a more than 10% probability of catastrophic outcomes (e.g., more than 1 million deaths or more than USD 100B in financial loss) in the next 5 years (2025-2030). In a scenario where pragmatic mitigations are implemented, experts still judged five risks as having a more than 10% probability of catastrophic outcomes: dangerous capabilities, weapons & cyberattacks, environmental harm, inequality & unemployment, and power centralization. All 24 risks were judged as being more than 5% likely to cause catastrophic outcomes. AI users and the general public were judged the most vulnerable to these risks, but experts assigned the highest responsibility for addressing them to general-purpose AI developers and governance actors (including governments, regulators, and standards bodies). Across most risks, experts identified information, finance, and national security as the most vulnerable sectors. These findings can guide AI risk prioritization and clarify expert expectations about who should bear responsibility for mitigation.

Summary

Main Finding

A three-round Delphi of 272 international AI experts (late 2025) found that many AI risk domains carry non-negligible near-term tail risk. Under a business-as-usual (BAU) trajectory (2025–2030), experts judged 18 of 24 AI risk domains each had >10% probability of a catastrophic outcome (catastrophic defined as >1 million deaths, >USD 100B loss, or civilization-scale intangible harms). Even with pragmatic mitigations, five domains retained >10% catastrophic probability (dangerous capabilities; weapons & cyberattacks; environmental harm; inequality & unemployment; power centralization), and all 24 domains had >5% probability of catastrophic outcomes.

Key Points

  • Sample & method: 272 international AI experts; three-round Delphi elicitation; anonymous, iterative ratings and qualitative rationales.
  • Risks assessed: 24 AI risk subdomains (drawn from prior taxonomy); experts rated harm probability/severity, vulnerability by actor and sector, and actor responsibility.
  • Top-rated severity risks (mean severity out of 5; mean catastrophic probability under BAU):
    • Dangerous capabilities — mean severity 3.49; catastrophic probability ≈21.5%
    • Weapons & cyberattacks — mean severity 3.49; catastrophic probability ≈21.0%
    • Competitive dynamics — mean severity 3.49; catastrophic probability ≈16.6%
    • Power centralization — mean severity 3.47; catastrophic probability ≈18.0%
    • False/misleading information — mean severity 3.44; catastrophic probability ≈12.8%
  • BAU scenario: 18/24 domains judged >10% chance of catastrophic outcomes over 5 years (2025–2030).
  • Pragmatic mitigations: reduce risks overall, but five domains remain >10% catastrophic risk; all domains >5% catastrophic risk even after mitigations.
  • Vulnerability: AI users and affected stakeholders (the public, downstream users) judged most vulnerable; sectors most exposed across risks were information, finance, and national security.
  • Responsibility: experts assign highest responsibility to general-purpose AI developers and governance actors (governments, regulators, standards bodies).
  • Noted asymmetry: those most responsible are not those most vulnerable → misaligned incentives and potential moral hazard.
  • Data availability: data and code publicly available (OSF link in paper).

Data & Methods

  • Method: Delphi method (three rounds) to build and test expert consensus while eliciting subjective probability distributions and qualitative rationales.
  • Participants: 272 experts with international representation across academia, industry, policy, and civil society; many contributors also appear among paper authors (declared).
  • Elicitation details:
    • Time horizon: 5 years (late 2025–late 2030).
    • Severity rubric: 1 (negligible) to 5 (catastrophic), with quantitative anchors for catastrophic (e.g., >1M deaths, >USD 100B loss) plus qualitative anchors for civilizational/intangible harms.
    • Two scenarios: Business-as-Usual (no additional AI-specific mitigations) and Pragmatic Mitigations (realistic mitigation steps implemented).
    • Outcomes asked: probability distributions over severity, vulnerability by actor and sector, and responsibility attribution across actors.
  • Limitations noted by authors:
    • Subjective expert probabilities reflect beliefs under the specific rubric and framing; sensitive to anchors and horizon.
    • Difficulty translating harms to non-human AI welfare and complex/intangible harms.
    • Potential selection/representation biases in expert panel; some experts had organizational roles with stakes in AI (declared).
    • Elicited probabilities are not calibrated real-world frequencies but expert judgments to inform prioritization.

Implications for AI Economics

  • Incorporate tail risks into economic forecasting and cost–benefit analysis:
    • Expert-elicited >10% catastrophic probabilities over a five-year horizon imply substantial expected-value losses for several domains; macroeconomic and sectoral models should include fat-tail risk scenarios, not only central estimates.
    • Insurance, reinsurance, and systemically important financial institutions should model AI-driven tail events (cyber/market shocks, operational breakdowns) explicitly in stress tests and capital buffers.
  • Regulatory and incentive design:
    • Experts place responsibility on general-purpose AI developers and public governance actors, while vulnerability falls on users and the public → need for regulatory frameworks that align incentives (liability rules, mandatory safety standards, third‑party auditing, and governance oversight).
    • Antitrust and market-structure policies are economically relevant given power centralization risk (market concentration, innovation incentives, rent extraction).
  • Sectoral policy priorities:
    • Information sector: investments in misinformation resilience, platform liability rules, demand-side media literacy, and content-safety economics.
    • Finance: systemic risk controls, algorithmic governance, model risk capital requirements, and rapid-response capabilities for AI-enabled market manipulation.
    • National security and cyber: prioritize defensive R&D, red-teaming, and public-private coordination; consider economic trade-offs of offensive capabilities and escalation.
  • Labor markets and distributional policy:
    • Inequality & unemployment remain a persistent catastrophic-risk concern even with mitigations — justify active labor-market policies (retraining, income support), and economic models of automation should weight near-term disruption risk higher.
  • Mitigation investment rationale:
    • Pragmatic mitigations reduce but do not eliminate tail risk; marginal returns exist but residual risk persists. Economic policy should weigh upfront mitigation costs against expected reduction in tail-losses (use expert priors to parameterize expected-loss reduction).
  • Research and measurement:
    • Need for better empirical measurement of AI externalities, markets for safety (liability, insurance), and quantification of mitigation efficacy to refine economic models and policy prescriptions.
  • Governance economics:
    • Consider mechanisms to internalize externalities (e.g., taxes, mandated insurance, certification) and to correct the responsibility–vulnerability mismatch via subsidies, mandates, or liability reforms.

Recommended next steps for economic researchers and policymakers: - Use the Delphi priors to stress-test macro and sector models for 2025–2030, and to price potential systemic exposures. - Prioritize regulatory design targeting general-purpose AI developers and governance institutions, coupled with protections for vulnerable actors and sectors. - Model intervention cost-effectiveness under the elicited tail-risk probabilities to inform resource allocation between mitigation types (technical safety, cybersecurity, labor-market policies, antitrust).

Assessment

Paper Typedescriptive Evidence Strengthlow — Findings are based on expert elicitation (a three‑round Delphi) and reflect aggregated subjective judgments and forecasts rather than observational or experimental data that would permit causal inference or objective validation of probabilities. Methods Rigormedium — The study uses a standard three‑round Delphi with a relatively large panel (272 international AI experts), which supports iterative calibration and diversity of opinion; however, the public summary lacks detail about expert selection, response rates, weighting, question phrasing, and calibration/validation procedures, leaving room for selection, anchoring, and aggregation biases. Sample272 international AI experts participated in a three‑round Delphi conducted in late 2025; participants rated 24 predefined AI risks on harm probability and severity, sector and actor vulnerability, actor responsibility, and overall concern for 5‑year horizons under business‑as‑usual and mitigation scenarios; further demographic and sampling details not provided in the summary. Themesgovernance inequality labor_markets productivity GeneralizabilitySubjective expert judgments may not correspond to realized outcomes; forecasts can be biased (over/underconfidence, anchoring)., Panel composition and selection procedures are unclear—may overrepresent particular geographies, disciplines, or perspectives., Findings are time‑bound (late‑2025 panel; 2025–2030 horizon) and may become quickly outdated as AI capabilities and governance evolve., Risks were limited to 24 predefined categories, possibly omitting other relevant harms or interactions., Expert views do not substitute for empirical causal estimates of economic outcomes (e.g., measured impacts on productivity, wages, or employment).

Claims (10)

ClaimDirectionConfidenceOutcomeDetails
We conducted a three-round Delphi study conducted late 2025 with 272 international AI experts. Other null_result high study_participation / sample characterization
n=272
0.3
Experts rated 24 AI risks on harm probability and severity, sector and actor vulnerability, actor responsibility, and overall concern. Other null_result high risk ratings across multiple dimensions (probability, severity, vulnerability, responsibility, concern)
n=272
0.3
Experts estimated the five most severe harms in the next 5 years were likely to come from dangerous capabilities, competitive dynamics, weapons & cyberattacks (including CBRNE), power centralization, and false information. Consumer Welfare negative high ranked severity of AI-related harms over next 5 years
n=272
0.18
In a business-as-usual scenario, experts judged 18 of 24 risks as having a more than 10% probability of catastrophic outcomes (e.g., more than 1 million deaths or more than USD 100B in financial loss) in the next 5 years (2025-2030). Consumer Welfare negative high judged probability of catastrophic outcomes (>1M deaths or >$100B loss) under BAU scenario
n=272
18 of 24 risks >10% probability
0.18
In a scenario where pragmatic mitigations are implemented, experts still judged five risks as having a more than 10% probability of catastrophic outcomes: dangerous capabilities, weapons & cyberattacks, environmental harm, inequality & unemployment, and power centralization. Consumer Welfare negative high judged probability of catastrophic outcomes (>1M deaths or >$100B loss) under pragmatic mitigations scenario
n=272
5 risks >10% probability
0.18
All 24 risks were judged as being more than 5% likely to cause catastrophic outcomes. Consumer Welfare negative high judged probability of catastrophic outcomes (>1M deaths or >$100B loss) for each risk
n=272
24 of 24 risks >5% probability
0.18
AI users and the general public were judged the most vulnerable to these risks. Social Protection negative high actor vulnerability ratings
n=272
0.18
Experts assigned the highest responsibility for addressing these risks to general-purpose AI developers and governance actors (including governments, regulators, and standards bodies). Governance And Regulation positive high actor responsibility attribution
n=272
0.18
Across most risks, experts identified information, finance, and national security as the most vulnerable sectors. Market Structure negative high sector vulnerability across listed risks
n=272
0.18
These findings can guide AI risk prioritization and clarify expert expectations about who should bear responsibility for mitigation. Governance And Regulation positive medium utility of study findings for risk prioritization and responsibility assignment
n=272
0.02

Notes