Evidence (5586 claims)
Adoption
5586 claims
Productivity
4857 claims
Governance
4381 claims
Human-AI Collaboration
3417 claims
Labor Markets
2685 claims
Innovation
2581 claims
Org Design
2499 claims
Skills & Training
2031 claims
Inequality
1382 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 417 | 113 | 67 | 480 | 1091 |
| Governance & Regulation | 419 | 202 | 124 | 64 | 823 |
| Research Productivity | 261 | 100 | 34 | 303 | 703 |
| Organizational Efficiency | 406 | 96 | 71 | 40 | 616 |
| Technology Adoption Rate | 323 | 128 | 74 | 38 | 568 |
| Firm Productivity | 307 | 38 | 70 | 12 | 432 |
| Output Quality | 260 | 71 | 27 | 29 | 387 |
| AI Safety & Ethics | 118 | 179 | 45 | 24 | 368 |
| Market Structure | 107 | 128 | 85 | 14 | 339 |
| Decision Quality | 177 | 75 | 37 | 19 | 312 |
| Fiscal & Macroeconomic | 89 | 58 | 33 | 22 | 209 |
| Employment Level | 74 | 34 | 78 | 9 | 197 |
| Skill Acquisition | 98 | 36 | 40 | 9 | 183 |
| Innovation Output | 121 | 12 | 24 | 13 | 171 |
| Firm Revenue | 98 | 35 | 24 | — | 157 |
| Consumer Welfare | 73 | 31 | 37 | 7 | 148 |
| Task Allocation | 87 | 16 | 34 | 7 | 144 |
| Inequality Measures | 25 | 76 | 32 | 5 | 138 |
| Regulatory Compliance | 54 | 61 | 13 | 3 | 131 |
| Task Completion Time | 89 | 7 | 4 | 3 | 103 |
| Error Rate | 44 | 51 | 6 | — | 101 |
| Training Effectiveness | 58 | 12 | 12 | 16 | 99 |
| Worker Satisfaction | 47 | 33 | 11 | 7 | 98 |
| Wages & Compensation | 54 | 15 | 20 | 5 | 94 |
| Team Performance | 47 | 12 | 15 | 7 | 82 |
| Automation Exposure | 27 | 26 | 10 | 6 | 72 |
| Job Displacement | 6 | 39 | 13 | — | 58 |
| Hiring & Recruitment | 40 | 4 | 6 | 3 | 53 |
| Developer Productivity | 34 | 4 | 3 | 1 | 42 |
| Social Protection | 22 | 11 | 6 | 2 | 41 |
| Creative Output | 16 | 7 | 5 | 1 | 29 |
| Labor Share of Income | 12 | 6 | 9 | — | 27 |
| Skill Obsolescence | 3 | 20 | 2 | — | 25 |
| Worker Turnover | 10 | 12 | — | 3 | 25 |
Adoption
Remove filter
Coordinated digital green development strategies are important to promote a more balanced and inclusive transition toward China’s dual-carbon goals.
Policy implication drawn from the study's empirical findings (AI reduces inequality while green innovation has not diffused), recommending coordinated digital and green development to achieve balanced outcomes.
Autonomous coding agents, able to create branches, open pull requests, and perform code reviews, now actively contribute to real-world projects.
Empirical observations reported in the dataset and study showing agent-originated branches, PRs, and review actions in open-source projects (paper asserts these actions occurred in real projects).
Workplace organization (W) materially modifies the augmentation function so that two firms with identical technology investments can realize 'radically different' augmentation outcomes.
Conceptual claim supported by the paper's theoretical model (phi(D,W)) and cited empirical illustration (Colombia EDIT survey interaction result).
AI enhances innovation and productivity, even though it currently contributes to higher CO2 emissions.
Statement in the study linking AI adoption to improvements in innovation and productivity alongside the empirical finding of higher CO2 emissions (based on the same cross-country panel analysis over 2000–2023).
The revealed preference approach is a powerful mechanism for communicating human preferences to AI agents, but its success depends on careful implementation.
Overall findings from the online experiment showing higher predictive accuracy from revealed preferences combined with contextual results about subjects' choices and AI alignment; authors' synthesis and recommendation.
Because other AI systems exhibit similar scaling-law economics, the mechanisms identified extend beyond computer vision, reinforcing that partial automation is often the economically rational long-run outcome, not merely a transitional phase.
Theoretical argument generalized from scaling-law evidence in the paper; no additional cross-domain empirical evidence reported in the summary.
These findings support the practical value of structured intent representation as a robust, protocol-like communication layer for human-AI interaction.
Aggregate interpretation of the experimental results (cross-language variance reduction, model compensation pattern, equivalence of structured frameworks, and user-study improvements).
Intelligent manufacturing policies can generate economically meaningful benefits by improving firms’ sustainability performance and the credibility of ESG information, which are central to capital allocation and the effectiveness of green governance.
Synthesis/implication drawn from the empirical findings reported in the paper (positive effects on ESG ratings, reduced greenwashing, and lower ESG uncertainty).
The growth of digital platforms contributes to the decentralization of job creation.
Paper cites contemporary data on the growth of digital platforms as part of its analysis (no specific platform-level datasets or sample sizes cited in the abstract).
AI-enabled ESG ratings, green innovation, ethical AI, RegTech, and explainable AI in finance are becoming highly influential in international financial markets.
Paper identifies these themes as emerging and influential based on trends in the reviewed literature and topical focus areas; no quantitative adoption metrics or sample sizes are provided in the excerpt.
Public Model Context Protocol (MCP) server repositories are the current predominant standard for agent tools.
Paper asserts MCP servers are the predominant standard and uses these repositories as the primary monitoring source.
Drawing on analysis of agentic investment firm operational models demonstrating 50-70% cost reductions while maintaining fiduciary standards.
Internal analysis/modeling of agentic investment firm operational models reported by the authors; paper states the 50–70% cost reduction result but provides no sample size or detailed empirical validation in the provided text.
Using machine learning applied to news streams constitutes a practical method to augment existing fiscal surveillance tools.
Paper asserts practical applicability of ML + news for surveillance; presented as recommendation/claim rather than documented large-sample trial in the provided excerpt.
Incorporating news-based signals into machine-learning models can enhance regulatory practice by improving detection of potential fiscal instabilities.
Paper claims an empirical analysis and synthesizes findings linking news-derived signals and ML methods to improved regulatory monitoring; specific datasets, evaluation metrics, and sample sizes are not provided in the excerpt.
The framework offers a replicable model for governments and institutions seeking to proactively support high-potential innovations across sectors.
Paper asserts replicability and applicability to governments/institutions based on the described methods and outputs; no deployment case studies or empirical replication evidence reported in text provided.
A data-driven, foresight-based approach to policy design significantly enhances responsiveness, precision, and resource efficiency in science and technology governance.
Paper concludes this benefit based on its integrated framework, triangulation, Delphi/AHP validation and illustrative mapping; no quantified comparative metrics or experimental evaluation reported in text provided.
Fostering digital transformation alongside workforce reskilling and innovation-ecosystem development is essential for sustainable industrial growth and strengthening Kazakhstan’s global economic position.
Policy and strategic recommendations based on the study's empirical results, case studies, and macro-level index comparisons.
Digital transformation combined with workforce retraining optimizes labor costs and enhances productivity.
Synthesis of enterprise-level case examples and aggregated regression/correlation findings at industry and national levels that link digitalization and retraining programs to labor-cost and productivity indicators.
Overall, the DRL framework enhances traffic capacity and fuel efficiency without compromising safety.
Aggregate interpretation of simulation results comparing DRL-based AV control to IDM across capacity, fuel efficiency, and safety metrics within the simulated scenarios. Specific safety metrics and sample sizes are not described in the claim text.
These findings provide an early empirical baseline and point toward competitive plurality rather than winner-take-all consolidation among engaged users.
Interpretation synthesized from survey results (multi-platform usage, indistinguishable satisfaction among top platforms, differing adoption reasons); overall sample N=388.
Switching costs between platforms are negligible (users treat these tools as interchangeable utilities rather than sticky ecosystems).
Survey responses indicating platform-switching behavior and perceived costs; inference based on reported multi-platform usage and responses about platform loyalty/switching (overall N=388).
This work demonstrates the technical feasibility of scalable, AI-augmented quality assessment for early childhood education and lays a foundation for continuous, inclusive AI-assisted evaluation enabling systemic improvement and equitable growth.
Overall results of dataset release, Interaction2Eval performance (agreement), and deployment efficiency reported in the paper; used by the authors to argue broader feasibility and potential systemic impact.
AI-assisted monitoring could shift assessment practice from annual expert audits to monthly AI-assisted monitoring with targeted human oversight.
Authors' synthesis combining dataset-scale results, Interaction2Eval performance (agreement), and deployment efficiency gains to argue feasibility of more frequent monitoring.
Digital transformation enhances the relational embeddedness among cities, and this enhanced relational embeddedness facilitates improved outcomes in collaborative innovation (mediating mechanism).
Mediation analysis / network metric analysis using city-level relational embeddedness measures computed from patent collaboration networks and digital transformation indicators from A-share listed companies (2011–2021).
Robust arbitrage strategies remain profitable even when generalized across different domains (claim reiteration emphasizing cross-domain profitability and robustness).
Repeated/strengthened claim in the paper referencing multiple experiments and robustness checks across domains.
An arbitrageur can efficiently allocate inference budget across providers to undercut the market, creating a competitive offering with no model-development risk.
Methodological description and empirical demonstration in the paper showing arbitrageur strategies that allocate inference budget across multiple providers to create a competitive service without incurring model-development risk.
Arbitrage reduces market segmentation and facilitates market entry for smaller model providers by enabling earlier revenue capture.
Reported analysis and/or experiments suggesting arbitrage homogenizes offerings (reduces segmentation) and allows smaller providers to capture revenue earlier through arbitrage-enabled routes.
Robust arbitrage strategies that generalize across different domains remain profitable.
Reported experiments indicating that arbitrage strategies generalized beyond the primary SWE-bench domain and still yielded profit (authors state robust strategies remain profitable across domains).
Arbitrage is viable in AI model markets (we empirically demonstrate the viability of arbitrage and illustrate its economic consequences).
Empirical experiments and analyses presented in the paper (case study on SWE-bench and additional experiments on arbitrage strategies).
The ACT represents the first open-source effort to consolidate data on Africa's evolving HPC landscape, aiming to encourage more transparency from local AI stakeholders and facilitate broader access for AI developers.
Authors' characterization of ACT as a novel, open-source consolidation; assertion based on literature/tools review performed by the authors and on the tool's stated goals.
This systematic framework can help predict at a detailed level where today's AI systems can and cannot be used and how future AI capabilities may change this.
Interpretive/utility claim: authors argue that the ontology plus classification results serve as rough predictive tools for AI applicability across work activities.
EnterpriseLab provides enterprises a practical path to deploying capable, privacy-preserving agents without compromising operational capability.
Conclusion drawn by the authors based on the platform design and the reported empirical results (performance parity with GPT-4o, cost reductions, benchmark robustness). The abstract offers this as a high-level takeaway rather than a quantified empirical claim.
This pattern suggests that AI search may make hotel discovery less exclusively controlled by commission-based intermediaries (OTAs).
Interpretation/inference from the observed higher non-OTA citation shares for experiential queries in the audited Google Gemini sample; not a direct measurement of market outcomes such as bookings or commissions.
The results contribute to literature arguing that cloud-based GenAI is a source of enterprise value creation rather than merely an experimental technology.
Paper's stated addition to the existing literature based on the combined empirical and theoretical findings.
Our results substantiate the potential of large language models as a foundational pillar for high-fidelity, scalable decision simulation and latter analysis in the real economy based on foundational database.
High-level conclusion drawn from the paper's experiments and methodological contributions; generalization claim asserting LLMs' potential as foundational tools for scalable, high-fidelity decision simulation.
Experiments demonstrate that our framework achieves improved simulation stability compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments vs. baselines showing improved simulation stability (paper statement that framework improved simulation stability, without quantitative details in the excerpt).
Experiments demonstrate that our framework achieves significant improvements in purchase quantity prediction compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments comparing MALLES against existing baselines; paper reports 'significant improvements' in purchase quantity prediction (no numerical values provided in the excerpt).
Experiments demonstrate that our framework achieves significant improvements in product selection accuracy compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments comparing MALLES against existing economic and financial LLM simulation baselines; paper reports 'significant improvements' in product selection accuracy (no numerical values provided in the excerpt).
This preference-learning approach enables the models to internalize and transfer latent consumer preference patterns, thereby mitigating the data sparsity issues prevalent in individual categories.
Claim based on the paper's reported approach: cross-category post-training and transfer of latent preferences; supported by experiments (paper states mitigation of data sparsity).
Orchestrated systems of smaller, domain-adapted models can mathematically outperform frontier generalist models in most institutional deployment environments.
Formal conditions and comparative analysis derived in the paper plus referenced/claimed empirical support across several domains (frontier lab dynamics, alignment evolution, sovereign AI pressures).
There are ethical imperatives of fairness and transparency in automated wealth management, and the paper proposes a roadmap toward sustainable and interpretable financial AI.
Normative analysis and proposed roadmap described in the paper; the excerpt does not provide operationalized fairness metrics, interpretability methods, or evaluation results.
In environments characterized by high-frequency data, non-linear dependencies, and stochastic market regimes, autonomous DRL agents can learn optimal sequential decision-making policies that offer a compelling alternative to static or rule-based allocation strategies.
Argument based on theoretical suitability of DRL for sequential decision problems and the paper's system-level investigation; excerpt does not report specific experimental datasets, sample sizes, benchmarks, or performance metrics.
The integration of Deep Reinforcement Learning (DRL) into portfolio management represents a significant evolution from classical Mean-Variance Optimization and modern econometric frameworks.
Conceptual comparison and synthesis presented in the paper; no empirical sample size or experimental results are provided in the excerpt to quantify the degree of improvement.
Regulatory sandboxes offer a flexible and innovation-friendly governance model compared to traditional command-and-control mechanisms.
Normative and comparative analysis within a law & economics framework; no empirical performance data reported in the abstract.
Comparative insights from FinTech identify the institutional design features necessary to ensure the effectiveness and resilience of regulatory sandboxes.
Comparative case-based reasoning drawing on FinTech regulatory sandbox experience (abstract does not report number or selection of cases).
AI regulatory sandboxes may correct specific government failures, including regulatory capture, rent-seeking, and knowledge gaps.
Analytical claims supported by comparative reasoning (FinTech examples) and economic analysis of government failure; no empirical testing or sample size reported in the abstract.
AI regulatory sandboxes facilitate iterative regulatory learning while promoting responsible AI innovation.
Theoretical argument using experimentalist governance concepts and law & economics reasoning; comparative insights referenced but no empirical sample detailed in the abstract.
AI regulatory sandboxes can reduce negative externalities associated with AI deployment.
Conceptual and economic analysis in the paper (no empirical quantification or sample size reported in the abstract).
AI regulatory sandboxes can mitigate information asymmetries between regulators and firms.
Analytical application of an economic analysis of law framework; theoretical argumentation rather than reported empirical measurement in the abstract.
JobMatchAI provides factor-wise explanations through resume-driven search workflows.
Paper states that the system gives factor-wise explanations and ties them to resume-driven workflows; the excerpt references interpretable reranking and demo artifacts but does not include user study or explanation-faithfulness metrics.