Cheap AI judgment upends science's monopoly on authoritative judgment; the key bottleneck is institutions' capacity to integrate and certify AI outputs—not computational speed.

AI-Augmented Science and the New Institutional Scarcities

Lauri Lovén · May 04, 2026

arxiv theoretical n/a evidence 8/10 relevance Source PDF

The paper argues that as AI makes competent judgment cheap, scientific institutions must compete to supply four scarce complements—verified signal, legitimacy, authentic provenance, and integration capacity—with integration capacity being the most binding constraint on AI-augmented science.

Competent-looking judgment, including selecting, ranking, attributing, and certifying, is now produced at scale at marginal cost approaching zero, inverting the dominant economics-of-AI reading that treats judgment as the scarce complement to cheap prediction. Scientific institutions, distinctively, manufacture legitimate judgment, so they do not merely adapt to AI; they compete with it for the same functional role. Four complements then become scarce and load-bearing for AI-augmented science: verified signal, legitimacy, authentic provenance, and integration capacity (the community's tolerance for delegated cognition). Of these four, integration capacity is the least developed for scientific institutions and the most binding: no improvement in AI tooling can buy it. The frontier for AI-augmented science is not acceleration; it is the redesign of the certifying infrastructure around these new scarcities.

Summary

Main Finding

Frontier AI is making competent-looking judgment effectively abundant and cheap, which reverses the classic economics-of-AI story (cheap prediction + scarce judgment). For scientific institutions—whose core product is certified judgment—this creates new binding scarcities: verified signal, legitimacy, authentic provenance, and integration capacity (the community’s tolerance for delegated cognition). The survival value shifts from producing judgment to producing trustworthy, verifiable certification; the institutional challenge is redesigning certification/verification infrastructure, not just better tooling.

Key Points

Diagnostic frame: technological revolutions are best read as shifts in what is scarce. AI shifts scarcity away from judgment toward four new scarcities that become load-bearing for science.
Four new scarcities:
Verified signal — reproducible, attested results (not merely “reviewed”) as the valuable editorial output.
Legitimacy — a depletable commons (brand/credibility) that erodes with visible failures of certification.
Authentic provenance — machine-readable, composable records of model, data, prompts, and human decisions for each scientific artefact.
Integration capacity — the community’s ceiling on how much delegated AI-judgment it can absorb without losing trust; not purchasable by tooling.
Institutional failure modes from abundant judgment:
- Peer review volume and quality collapse into a cat-and-mouse loop unless verification is rethought.
- Authorship/attribution breaks when human-AI co-production is not auditable.
- Reproducibility becomes relatively more expensive than producing plausible-but-fake claims.
- Grant evaluation and funding processes become gamed at rhetorical surface level by AI.
Proposed verification-first agenda (concrete moves):
- Reproducibility-first editorial tracks: acceptances include verified computational claims.
- Tiered certification: publish the certification tier (claim/result/framework) alongside papers.
- Provenance by default: machine-readable provenance records (model versions, data sources, prompt traces, audit logs).
- Verification-as-a-commons: shared reproducibility/attestation infrastructure with polycentric governance to resist capture.
Distributional concerns: these costs fall disproportionately on under-resourced institutions and disciplines; partial, proprietary solutions risk exacerbating inequalities.

Data & Methods

Method: conceptual and institutional analysis drawing on economic theory of scarcity, knowledge-commons literature (Ostrom, Hess), and examples from AI/ML practice.
Evidence base: synthesis of prior literatures and recent field signals:
- Reproducibility programs and checklists (NeurIPS reproducibility programs, MLSys artefact evaluations).
- Emerging norms and failures in ML benchmarks (test-set contamination, irreproducible SOTA claims).
- Tools and standards that exist but are incomplete (model cards, datasheets, W3C-PROV, C2PA).
- Regulatory context (EU AI Act Article 50) and market-concentration trends for foundation models.
No new empirical dataset or formal model is presented; the paper is a diagnosis and policy/architectural prescription grounded in observed trends and institutional theory.
Limitations: forward-looking and normative; specific operational and quantitative thresholds (e.g., how much integration capacity is tolerable) are not empirically estimated here.

Implications for AI Economics

Reframe of scarce complements: Economic models should stop treating judgment as the persistent scarce complement to AI prediction. Instead, they should model markets and institutions where verification, provenance, legitimacy, and integration capacity are the scarce inputs that determine value.
Market structure and supply of verification:
- Verification has public-good and commons characteristics; left to markets, it risks concentration (capture by big-platform verifiers) or underprovision.
- Two feasible equilibria: verification-as-a-product (centralized, monetized, liable to capture) vs verification-as-a-commons (shared infrastructure, polycentric governance).
- Incentive problems: producers benefit from presenting competent-looking claims; verifiers bear costly attestation work with weak private returns → classic market failure and need for subsidization or coordinated governance.
Returns and rents:
- Commoditization of judgment compresses rents for individual experts while increasing rents for entities controlling trusted verification/provenance infrastructure.
- Incumbency advantage for fields (like AI/ML) that build verification tooling early; global inequality risk as under-resourced communities bear disproportionate verification costs.
Policy and regulation:
- Regulation (e.g., EU AI Act transparency rules) can mandate disclosure but does not substitute for community-led scientific certification; economic policy must consider complementarities between regulation and community commons.
- Funders and learned societies are natural candidates for subsidizing verification commons to correct externalities.
Labor and credentialing markets:
- Demand shifts from routine evaluative labor (triage, first-pass reviewing) to verification labor, governance, and curation skills. This implies retraining and reallocation of human capital.
- Credential markets (journals, conferences, funders) will compete on credibility rather than throughput; business models linked to throughput/volume may become less valuable.
Measurement and research agenda for AI economics:
- Empirically quantify the four scarcities and proxies: reproducibility rates, provenance completeness, measures of legitimacy erosion (dissent rates, opt-out uptake, reputation decay), and integration-capacity metrics (uptake of human-only review tracks, downstream trust indicators).
- Model the dynamics of verification supply (public, private, hybrid), pricing, and platform competition, incorporating externalities from unreliability.
- Evaluate redistribution policies and funding mechanisms to underwrite shared verification infrastructure and avoid concentration-related pathologies.
- Study multi-stakeholder governance designs (polycentric, funder coalitions, learned-society convening) and their comparative welfare implications.
Normative takeaway for economists: policy and research should pivot from optimizing predictive performance to designing institutions that price, produce, and govern verification, provenance, and legitimacy in the presence of abundant AI-generated judgment.

Assessment

Paper Typetheoretical Evidence Strengthn/a — The paper is a conceptual/theoretical argument without empirical tests or causal identification; claims are plausible but not supported by data or causal inference. Methods Rigorn/a — No empirical methods are applied; rigor depends on logical coherence and normative/institutional reasoning rather than reproducible empirical procedures. SampleNo empirical sample or dataset; the paper advances a conceptual framework and normative argument about scientific institutions and AI-augmented judgment. Themesgovernance innovation org_design human_ai_collab adoption GeneralizabilityArgument is conceptual and not empirically validated, so real-world applicability is uncertain, Focuses on scientific institutions; implications may differ for private firms, regulators, or non-research domains, Assumes continued progress in AI judgment capabilities; results may change with different AI limitations, Institutional, cultural, and regulatory differences across countries may limit transferability, Neglects heterogeneity across scientific fields (e.g., experimental vs. theoretical disciplines)

Claims (5)

Claim	Direction	Confidence	Outcome	Details
Competent-looking judgment, including selecting, ranking, attributing, and certifying, is now produced at scale at marginal cost approaching zero, inverting the dominant economics-of-AI reading that treats judgment as the scarce complement to cheap prediction. Ai Safety And Ethics	positive	high	production of competent-looking judgment (selecting, ranking, attributing, certifying) at scale and low marginal cost	0.02
Scientific institutions, distinctively, manufacture legitimate judgment, so they do not merely adapt to AI; they compete with it for the same functional role. Governance And Regulation	mixed	high	competition between scientific institutions and AI for the functional role of producing legitimate judgment	0.02
Four complements then become scarce and load-bearing for AI-augmented science: verified signal, legitimacy, authentic provenance, and integration capacity (the community's tolerance for delegated cognition). Governance And Regulation	negative	high	scarcity of verified signal, legitimacy, authentic provenance, and integration capacity in AI-augmented science	0.02
Of these four, integration capacity is the least developed for scientific institutions and the most binding: no improvement in AI tooling can buy it. Organizational Efficiency	negative	high	relative development of integration capacity in scientific institutions and its resistance to improvement via AI tooling	0.02
The frontier for AI-augmented science is not acceleration; it is the redesign of the certifying infrastructure around these new scarcities. Governance And Regulation	positive	high	prioritization of redesigning certifying infrastructure versus accelerating scientific workflows	0.02