Evidence (7198 claims)
Search and filter individual claims pulled from the papers. Looking for a specific finding ("what's the effect on wages?"), you're in the right place. Want to compare whole outcome categories against each other instead? Use the Evidence Explorer.
The board below groups claims two ways: by broad theme (nine paper-level topics) and by outcome category (the 34 claim-level outcomes that the Explorer and Syntheses also use).
Browse by theme
Nine broad, paper-level topics. Click one to filter the claims below.
Adoption
8921 claims
Filter claims →
Productivity
8002 claims
Filter claims →
Governance
7198 claims
Filtered →
Human-AI Collaboration
6864 claims
Filter claims →
Org Design
4398 claims
Filter claims →
Innovation
4286 claims
Filter claims →
Labor Markets
3629 claims
Filter claims →
Skills & Training
3001 claims
Filter claims →
Inequality
2141 claims
Filter claims →
Claims by outcome category
Counts by direction of finding. These are the same 34 outcome categories the Explorer compares and the Syntheses are written for. A linked row has a published synthesis.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 790 | 208 | 103 | 950 | 2117 |
| Governance & Regulation | 869 | 411 | 195 | 126 | 1630 |
| Organizational Efficiency | 817 | 202 | 126 | 87 | 1243 |
| Technology Adoption Rate | 675 | 258 | 128 | 106 | 1178 |
| Research Productivity | 462 | 138 | 64 | 347 | 1023 |
| Output Quality | 501 | 193 | 61 | 52 | 807 |
| Decision Quality | 346 | 180 | 84 | 51 | 668 |
| AI Safety & Ethics | 235 | 285 | 70 | 34 | 630 |
| Firm Productivity | 452 | 58 | 91 | 20 | 627 |
| Market Structure | 184 | 171 | 123 | 24 | 507 |
| Task Allocation | 221 | 65 | 76 | 34 | 401 |
| Skill Acquisition | 176 | 62 | 62 | 17 | 317 |
| Innovation Output | 207 | 28 | 48 | 18 | 303 |
| Fiscal & Macroeconomic | 135 | 72 | 44 | 26 | 284 |
| Employment Level | 105 | 56 | 108 | 13 | 284 |
| Consumer Welfare | 121 | 67 | 45 | 11 | 244 |
| Firm Revenue | 160 | 50 | 28 | 4 | 242 |
| Task Completion Time | 182 | 33 | 10 | 13 | 239 |
| Inequality Measures | 45 | 126 | 50 | 6 | 227 |
| Worker Satisfaction | 94 | 73 | 23 | 12 | 202 |
| Error Rate | 76 | 98 | 11 | 4 | 189 |
| Regulatory Compliance | 81 | 73 | 17 | 7 | 178 |
| Automation Exposure | 61 | 59 | 26 | 14 | 163 |
| Training Effectiveness | 97 | 21 | 14 | 19 | 153 |
| Wages & Compensation | 78 | 37 | 25 | 6 | 146 |
| Developer Productivity | 105 | 18 | 14 | 6 | 144 |
| Team Performance | 87 | 17 | 28 | 10 | 143 |
| Job Displacement | 12 | 83 | 21 | 1 | 117 |
| Hiring & Recruitment | 52 | 8 | 8 | 3 | 71 |
| Social Protection | 39 | 17 | 8 | 2 | 66 |
| Creative Output | 32 | 20 | 8 | 3 | 64 |
| Skill Obsolescence | 5 | 49 | 6 | 1 | 61 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 15 | 14 | — | 3 | 32 |
| Industry | — | — | — | 1 | 1 |
Governance
Remove filter
Multiple trial runs show low variance across scenarios, demonstrating high reproducibility with 95% confidence intervals.
Reported statistical characterization from repeated trials in the paper (statement of low variance and 95% confidence intervals across scenarios).
Security mechanisms impose low latency overhead (19.6ms average).
Performance measurement reported in the paper's experiments (average latency overhead reported as 19.6ms).
Security mechanisms achieve 100% block rate for both replay attacks and invalid tokens.
Experimental security evaluation reported in the paper (block rate reported at 100% for replay attacks and invalid tokens).
The system uses FastAPI, SQLite, and Python standard libraries, making it transparent, inspectable, and reproducible.
Implementation stack specified in the paper and availability of reference implementation; asserted reproducibility.
APEX implements a challenge–settle–consume lifecycle with HMAC-signed short-lived tokens, idempotent settlement handling, and policy-aware payment approval.
Implementation details described in the methods/architecture section and supported by the provided reference implementation.
We present APEX, an implementation-complete research system that adapts HTTP 402-style payment gating to UPI-like fiat workflows while preserving policy-governed spend control, tokenized access verification, and replay resistance.
System design and implementation presented in the paper (codebase built using FastAPI, SQLite, Python; demonstration/implementation claimed).
API providers need request-level monetization with programmatic spend governance.
Normative recommendation in the paper (argumentation rather than empirical evidence).
Autonomous agents are moving beyond simple retrieval tasks to become economic actors that invoke APIs, sequence workflows, and make real-time decisions.
Framing statement / literature-motivated claim in the paper's introduction (qualitative argumentation, no experimental sample reported).
AI innovation achieves corporate low-carbon development by reorienting investment toward green assets.
Mechanism analysis reported in the paper (mediation/path analysis) using the same 21,428 firm-year observations; investment reorientation toward green assets identified as a mediation path.
AI innovation achieves corporate low-carbon development by upgrading emission-reducing production processes.
Mechanism analysis reported in the paper (mediation/path analysis) on the 21,428 firm-year sample; production-process upgrades identified as a mediation path.
AI innovation achieves corporate low-carbon development by optimizing low-carbon organizational governance.
Mechanism analysis reported in the paper (mediation/path analysis) using the same sample of 21,428 firm-year observations; paper identifies organizational governance optimization as one of three mediation paths.
Only interventions that reshape risk allocation can plausibly shift stable system-level behaviour.
Argument based on the paper's game-theoretic reasoning and stylised example (theoretical claim; no empirical testing reported in the abstract).
Artificial intelligence (AI) is widely promoted as a promising technological response to healthcare capacity and productivity pressures.
Author assertion in the paper's introduction/abstract, based on literature/policy discourse (no empirical sample or quantitative analysis reported in the abstract).
Voluntary safety commitments can sustain cooperative (higher-quality) outcomes when they are observable and credible.
Theoretical analysis of an equilibrium with voluntary, observable commitments: when commitments are binding/credible and observable, firms can coordinate to avoid preemption and achieve cooperative outcomes.
Minimum quality standards can implement the first-best outcome.
Theoretical policy analysis within the model: imposing a minimum quality threshold for release is shown to align private incentives with the social optimum, implementing the first-best.
The system is in production, serving 21 industry verticals with 650+ agents.
Deployment claim reported in paper (production system metrics: number of verticals and agents).
We propose a framework for output-side ontological validation (response validation, reasoning verification, compliance checking).
Proposed framework described in paper (conceptual/procedural proposal; not described as empirically validated in abstract).
We introduce ontology-constrained tool discovery via SQL-pushdown scoring.
Methodological/implementation contribution described in the paper (technical mechanism introduced).
Improvements from ontology coupling are greatest where LLM parametric knowledge is weakest—particularly in Vietnam-localized domains.
Observed pattern reported from the controlled experiment across the five industries, with stronger improvements in Vietnam-localized domains (no per-industry sample sizes reported in abstract).
Ontology-coupled agents significantly outperform ungrounded agents on Role Consistency (p < .001, W = .614).
Controlled experiment with 600 runs; statistical test reported (p-value and W statistic provided in abstract).
Ontology-coupled agents significantly outperform ungrounded agents on Regulatory Compliance (p = .003, W = .318).
Controlled experiment with 600 runs; statistical test reported (p-value and W statistic provided in abstract).
Ontology-coupled agents significantly outperform ungrounded agents on Metric Accuracy (p < .001, W = .460).
Controlled experiment with 600 runs; statistical test reported (p-value and W statistic provided in abstract).
We formalize the concept of asymmetric neurosymbolic coupling, wherein symbolic ontological knowledge constrains agent inputs (context assembly, tool discovery, governance thresholds) while proposing mechanisms for extending this coupling to constrain agent outputs (response validation, reasoning verification, compliance checking).
Theoretical/formalization contribution described in the paper (conceptual and methodological development).
Our approach introduces a three-layer ontological framework--Role, Domain, and Interaction ontologies--that provides formal semantic grounding for LLM-based enterprise agents.
Design contribution described in the paper (formal model specification).
We present a neurosymbolic architecture implemented within the Foundation AgenticOS (FAOS) platform that addresses these limitations through ontology-constrained neural reasoning.
System design and implementation claim: description of architecture and its implementation in the FAOS platform (technical/design evidence reported in paper).
HEWU is designed to become the cited standard before better-resourced players define competing frameworks, establishing measurement infrastructure for the cognitive industrial revolution the way GAAP established it for capital markets.
Aspirational/strategic claim made by the authors about intended role and adoption of HEWU (no empirical support provided).
In that deployment the framework measured approximately $378,000 in annual labor value of machine-equivalent work.
Same empirical manufacturing deployment reported in the paper (single case/example).
In a representative manufacturing deployment, the framework measured 8.4 FTE of machine-equivalent labor.
Empirical example reported in the paper described as a 'representative manufacturing deployment' (appears to be a single deployment/case).
The paper introduces the Machine Labor Index (HEWU-PSI), a time-series economic indicator designed to track aggregate machine labor output at company, sector, and national level, analogous in function to the Purchasing Managers' Index.
Methodological contribution described in the paper (proposal of an index and its intended scope; no empirical time-series dataset reported).
The paper introduces AILU (AI Labor Units) as a software-specific subset metric.
Methodological contribution described in the paper (definition of a software-specific metric subset).
The paper presents the conceptual foundation, mathematical model (HEWU = MO ÷ HB × CF × QF), calibration framework, Baseline Library architecture, and auditability mechanisms underlying the standard.
Paper's methodological content (explicit model formula and supporting frameworks described).
This paper introduces the Human-Equivalent Work Unit (HEWU), a standardized metric that converts AI and automation system output into human labor equivalents, expressed as full-time employee (FTE) equivalents and annual labor value ($).
Methodological contribution described in the paper (definition and proposal of a new metric; no empirical validation sample reported).
Artificial intelligence systems are autonomous agents performing economically meaningful labor at scale across customer service, software engineering, logistics, manufacturing, and knowledge work.
Author's conceptual/empirical assertion in the paper (no specific sample, presented as general observation).
This research contributes to debates about the future of work, power asymmetries in platform economies, and the development of worker-protective regulatory frameworks, engaging perspectives from feminist economics, institutional theory, and surveillance capitalism studies.
Stated contribution in the abstract based on theoretical engagement and literature synthesis (conceptual claim; no empirical citation in abstract).
Theoretical frameworks developed in the paper require future empirical validation via case studies, quantitative analysis, and ethnographic research.
Methodological statement within the abstract describing the paper's limitations and next steps (self-report about the paper's status).
The study proposes institutional frameworks for realizing labor value and for worker-protective regulatory frameworks applicable to digital/platform economies.
Normative/theoretical proposals derived from conceptual analysis and engagement with feminist economics, institutional theory, and surveillance capitalism literature (no empirical testing reported).
The paper identifies key characteristics of value formation specific to platform economies.
Theoretical framework and literature synthesis presented in the study (conceptual; no empirical cases reported in abstract).
Living labor remains the sole source of new value; the core insights of the labor theory of value remain essential for critiquing contemporary digital capitalism.
Argumentative/theoretical development grounded in Marxist political economy and literature synthesis (conceptual paper, no empirical testing reported).
AI should be classified as constant capital rather than as labor.
Theoretical analysis and critical literature synthesis in a conceptual study (no empirical sample reported).
Results may be applied in the development of financial institution strategies, regulatory frameworks, risk management systems and professional training programmes.
Applied implications drawn from the literature synthesis and comparative analysis; presented as potential uses rather than empirically validated interventions.
Significant changes in human resource needs are occurring, with growing demand for analysts and specialists combining financial and technological competencies.
Conclusion from literature review and synthesis of international studies on labour demand in finance under Big Data/AI adoption; no original labour-market survey included.
Big Data and AI technologies significantly improve efficiency, risk assessment accuracy, fraud detection and financial inclusion.
The paper reports results from a qualitative analysis of recent academic literature, comparative analysis of sector-specific applications, and synthesis of empirical findings from international studies; no primary sample size reported.
Overall, findings highlight that AI serves as a revolutionary (transformative) tool rather than merely a replacement tool for employment—changing the nature of human work rather than simply disengaging it.
Synthesis conclusion in the paper drawing on the literature review and the authors' empirical results indicating task reallocation and changing job content.
The paper argues for equal technology governance as a necessary policy response to AI's labor market effects.
Policy recommendations discussed in the paper that call for equitable governance of AI; based on literature synthesis and empirical findings.
The analysis raises policy implications emphasizing reskilling and education to address AI-driven changes in the labor market.
Policy discussion section summarized in the paper; draws on empirical findings and literature to recommend reskilling/education.
Moderate AI usage is associated with employment growth.
Part of the U-shaped relationship reported in the paper's empirical results; described qualitatively in the abstract/summary.
Strong governance and advanced digital infrastructure are critical for realizing AI’s potential as a sustainable technology—governance-driven digital transformation is important for achieving sustainable growth.
Interpretation and policy implication drawn from the empirical findings that GQI and DII mitigate the AI→CO2 relationship in the 104-country panel analysis (2000–2023) employing GMM and 2SLS.
The environmental impact of AI is stronger in energy-inefficient and AI-advanced contexts.
Heterogeneity analysis in which the AI→CO2 effect is reported as larger for energy-inefficient countries and for countries in more advanced stages of AI diffusion (same 104-country panel, 2000–2023).
Adoption of AI currently contributes to higher CO2 emissions.
Empirical panel analysis of 104 countries over 2000–2023 using two-step system GMM and two-stage least squares (2SLS) estimations; AI adoption variable positively associated with country-level CO2 emissions in the reported regressions.
To optimize agentic AI integration and ensure responsible innovation across financial services, interdisciplinary, longitudinal research and robust governance frameworks are needed.
Authors' conclusions and recommendations based on the identified findings and gaps in the reviewed literature.