Evidence (5586 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	417	113	67	480	1091
Governance & Regulation	419	202	124	64	823
Research Productivity	261	100	34	303	703
Organizational Efficiency	406	96	71	40	616
Technology Adoption Rate	323	128	74	38	568
Firm Productivity	307	38	70	12	432
Output Quality	260	71	27	29	387
AI Safety & Ethics	118	179	45	24	368
Market Structure	107	128	85	14	339
Decision Quality	177	75	37	19	312
Fiscal & Macroeconomic	89	58	33	22	209
Employment Level	74	34	78	9	197
Skill Acquisition	98	36	40	9	183
Innovation Output	121	12	24	13	171
Firm Revenue	98	35	24	—	157
Consumer Welfare	73	31	37	7	148
Task Allocation	87	16	34	7	144
Inequality Measures	25	76	32	5	138
Regulatory Compliance	54	61	13	3	131
Task Completion Time	89	7	4	3	103
Error Rate	44	51	6	—	101
Training Effectiveness	58	12	12	16	99
Worker Satisfaction	47	33	11	7	98
Wages & Compensation	54	15	20	5	94
Team Performance	47	12	15	7	82
Automation Exposure	27	26	10	6	72
Job Displacement	6	39	13	—	58
Hiring & Recruitment	40	4	6	3	53
Developer Productivity	34	4	3	1	42
Social Protection	22	11	6	2	41
Creative Output	16	7	5	1	29
Labor Share of Income	12	6	9	—	27
Skill Obsolescence	3	20	2	—	25
Worker Turnover	10	12	—	3	25

Adoption Remove filter

Coordinated digital green development strategies are important to promote a more balanced and inclusive transition toward China’s dual-carbon goals.

Policy implication drawn from the study's empirical findings (AI reduces inequality while green innovation has not diffused), recommending coordinated digital and green development to achieve balanced outcomes.

medium positive Artificial intelligence, green innovation, and regional carb... balanced and inclusive transition to carbon peak and neutrality goals

Autonomous coding agents, able to create branches, open pull requests, and perform code reviews, now actively contribute to real-world projects.

Empirical observations reported in the dataset and study showing agent-originated branches, PRs, and review actions in open-source projects (paper asserts these actions occurred in real projects).

medium positive Investigating Autonomous Agent Contributions in the Wild: Ac... presence of agent-originated development activities (branches, PRs, reviews)

Workplace organization (W) materially modifies the augmentation function so that two firms with identical technology investments can realize 'radically different' augmentation outcomes.

Conceptual claim supported by the paper's theoretical model (phi(D,W)) and cited empirical illustration (Colombia EDIT survey interaction result).

medium positive From Automation to Augmentation: A Framework for Designing H... augmentation outcomes / returns to technology

AI enhances innovation and productivity, even though it currently contributes to higher CO2 emissions.

Statement in the study linking AI adoption to improvements in innovation and productivity alongside the empirical finding of higher CO2 emissions (based on the same cross-country panel analysis over 2000–2023).

medium positive Artificial Intelligence: A Blessing or a Curse for Climate A... innovation and productivity

The revealed preference approach is a powerful mechanism for communicating human preferences to AI agents, but its success depends on careful implementation.

Overall findings from the online experiment showing higher predictive accuracy from revealed preferences combined with contextual results about subjects' choices and AI alignment; authors' synthesis and recommendation.

medium positive Should I State or Should I Show? Aligning AI with Human Pref... effectiveness of revealed-preference communication for aligning AI with human pr...

Because other AI systems exhibit similar scaling-law economics, the mechanisms identified extend beyond computer vision, reinforcing that partial automation is often the economically rational long-run outcome, not merely a transitional phase.

Theoretical argument generalized from scaling-law evidence in the paper; no additional cross-domain empirical evidence reported in the summary.

medium positive Economics of Human and AI Collaboration: When is Partial Aut... prevalence of partial automation across AI application domains

These findings support the practical value of structured intent representation as a robust, protocol-like communication layer for human-AI interaction.

Aggregate interpretation of the experimental results (cross-language variance reduction, model compensation pattern, equivalence of structured frameworks, and user-study improvements).

medium positive Structured Intent as a Protocol-Like Communication Layer: Cr... practical utility / robustness of structured intent representations

Intelligent manufacturing policies can generate economically meaningful benefits by improving firms’ sustainability performance and the credibility of ESG information, which are central to capital allocation and the effectiveness of green governance.

Synthesis/implication drawn from the empirical findings reported in the paper (positive effects on ESG ratings, reduced greenwashing, and lower ESG uncertainty).

medium positive Intelligent Manufacturing Demonstration Projects Driving Cor... sustainability performance and credibility of ESG information

The growth of digital platforms contributes to the decentralization of job creation.

Paper cites contemporary data on the growth of digital platforms as part of its analysis (no specific platform-level datasets or sample sizes cited in the abstract).

medium positive AI Civilization and the Transformation of Work role of digital platforms in job creation / decentralization

AI-enabled ESG ratings, green innovation, ethical AI, RegTech, and explainable AI in finance are becoming highly influential in international financial markets.

Paper identifies these themes as emerging and influential based on trends in the reviewed literature and topical focus areas; no quantitative adoption metrics or sample sizes are provided in the excerpt.

medium positive Artificial intelligence in sustainable finance and Environme... influence/adoption of specific AI-related ESG themes in financial markets

Public Model Context Protocol (MCP) server repositories are the current predominant standard for agent tools.

Paper asserts MCP servers are the predominant standard and uses these repositories as the primary monitoring source.

medium positive How are AI agents used? Evidence from 177,000 MCP tools predominance of MCP servers as a standard for agent tools

Drawing on analysis of agentic investment firm operational models demonstrating 50-70% cost reductions while maintaining fiduciary standards.

Internal analysis/modeling of agentic investment firm operational models reported by the authors; paper states the 50–70% cost reduction result but provides no sample size or detailed empirical validation in the provided text.

medium positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... operational costs of investment firms (cost reduction)

Using machine learning applied to news streams constitutes a practical method to augment existing fiscal surveillance tools.

Paper asserts practical applicability of ML + news for surveillance; presented as recommendation/claim rather than documented large-sample trial in the provided excerpt.

medium positive Research on the Construction of an AI-Driven Financial Regul... surveillance capability of fiscal monitoring systems

Incorporating news-based signals into machine-learning models can enhance regulatory practice by improving detection of potential fiscal instabilities.

Paper claims an empirical analysis and synthesizes findings linking news-derived signals and ML methods to improved regulatory monitoring; specific datasets, evaluation metrics, and sample sizes are not provided in the excerpt.

medium positive Research on the Construction of an AI-Driven Financial Regul... detection accuracy and timeliness of identifying fiscal instabilities

The framework offers a replicable model for governments and institutions seeking to proactively support high-potential innovations across sectors.

Paper asserts replicability and applicability to governments/institutions based on the described methods and outputs; no deployment case studies or empirical replication evidence reported in text provided.

medium positive Emerging Technologies Based on Large AI Models and the Desig... replicability and applicability of the framework for proactive policy support

A data-driven, foresight-based approach to policy design significantly enhances responsiveness, precision, and resource efficiency in science and technology governance.

Paper concludes this benefit based on its integrated framework, triangulation, Delphi/AHP validation and illustrative mapping; no quantified comparative metrics or experimental evaluation reported in text provided.

medium positive Emerging Technologies Based on Large AI Models and the Desig... effectiveness of data-driven, foresight-based policy design (responsiveness, pre...

Fostering digital transformation alongside workforce reskilling and innovation-ecosystem development is essential for sustainable industrial growth and strengthening Kazakhstan’s global economic position.

Policy and strategic recommendations based on the study's empirical results, case studies, and macro-level index comparisons.

medium positive Digitalization and labor costs: efficiency of industrial ent... sustainable industrial growth / global economic position

Digital transformation combined with workforce retraining optimizes labor costs and enhances productivity.

Synthesis of enterprise-level case examples and aggregated regression/correlation findings at industry and national levels that link digitalization and retraining programs to labor-cost and productivity indicators.

medium positive Digitalization and labor costs: efficiency of industrial ent... labor costs per unit of production

Overall, the DRL framework enhances traffic capacity and fuel efficiency without compromising safety.

Aggregate interpretation of simulation results comparing DRL-based AV control to IDM across capacity, fuel efficiency, and safety metrics within the simulated scenarios. Specific safety metrics and sample sizes are not described in the claim text.

medium positive Macroscopic Characteristics of Mixed Traffic Flow with Deep ... traffic capacity, fuel efficiency, and safety

These findings provide an early empirical baseline and point toward competitive plurality rather than winner-take-all consolidation among engaged users.

Interpretation synthesized from survey results (multi-platform usage, indistinguishable satisfaction among top platforms, differing adoption reasons); overall sample N=388.

medium positive Beyond Benchmarks: How Users Evaluate AI Chat Assistants market structure (likelihood of plurality vs winner-take-all)

Switching costs between platforms are negligible (users treat these tools as interchangeable utilities rather than sticky ecosystems).

Survey responses indicating platform-switching behavior and perceived costs; inference based on reported multi-platform usage and responses about platform loyalty/switching (overall N=388).

medium positive Beyond Benchmarks: How Users Evaluate AI Chat Assistants perceived switching costs / platform stickiness

This work demonstrates the technical feasibility of scalable, AI-augmented quality assessment for early childhood education and lays a foundation for continuous, inclusive AI-assisted evaluation enabling systemic improvement and equitable growth.

Overall results of dataset release, Interaction2Eval performance (agreement), and deployment efficiency reported in the paper; used by the authors to argue broader feasibility and potential systemic impact.

medium positive When AI Meets Early Childhood Education: Large Language Mode... feasibility and systemic impact of AI-augmented assessment

AI-assisted monitoring could shift assessment practice from annual expert audits to monthly AI-assisted monitoring with targeted human oversight.

Authors' synthesis combining dataset-scale results, Interaction2Eval performance (agreement), and deployment efficiency gains to argue feasibility of more frequent monitoring.

medium positive When AI Meets Early Childhood Education: Large Language Mode... frequency of quality monitoring (audit cadence)

Digital transformation enhances the relational embeddedness among cities, and this enhanced relational embeddedness facilitates improved outcomes in collaborative innovation (mediating mechanism).

Mediation analysis / network metric analysis using city-level relational embeddedness measures computed from patent collaboration networks and digital transformation indicators from A-share listed companies (2011–2021).

medium positive How Does Digital Transformation Affect Cross-Regional Collab... relational embeddedness among cities and its mediating effect on collaborative i...

Robust arbitrage strategies remain profitable even when generalized across different domains (claim reiteration emphasizing cross-domain profitability and robustness).

Repeated/strengthened claim in the paper referencing multiple experiments and robustness checks across domains.

medium positive Computational Arbitrage in AI Model Markets cross-domain profitability of arbitrage strategies

An arbitrageur can efficiently allocate inference budget across providers to undercut the market, creating a competitive offering with no model-development risk.

Methodological description and empirical demonstration in the paper showing arbitrageur strategies that allocate inference budget across multiple providers to create a competitive service without incurring model-development risk.

medium positive Computational Arbitrage in AI Model Markets ability to undercut market prices and create competitive offering without model ...

Arbitrage reduces market segmentation and facilitates market entry for smaller model providers by enabling earlier revenue capture.

Reported analysis and/or experiments suggesting arbitrage homogenizes offerings (reduces segmentation) and allows smaller providers to capture revenue earlier through arbitrage-enabled routes.

medium positive Computational Arbitrage in AI Model Markets market segmentation and ease of market entry for smaller model providers

Robust arbitrage strategies that generalize across different domains remain profitable.

Reported experiments indicating that arbitrage strategies generalized beyond the primary SWE-bench domain and still yielded profit (authors state robust strategies remain profitable across domains).

medium positive Computational Arbitrage in AI Model Markets profitability of arbitrage strategies across multiple domains

Arbitrage is viable in AI model markets (we empirically demonstrate the viability of arbitrage and illustrate its economic consequences).

Empirical experiments and analyses presented in the paper (case study on SWE-bench and additional experiments on arbitrage strategies).

medium positive Computational Arbitrage in AI Model Markets viability/profitability and economic impact of arbitrage strategies

The ACT represents the first open-source effort to consolidate data on Africa's evolving HPC landscape, aiming to encourage more transparency from local AI stakeholders and facilitate broader access for AI developers.

Authors' characterization of ACT as a novel, open-source consolidation; assertion based on literature/tools review performed by the authors and on the tool's stated goals.

medium positive Take the Train: Africa at the Crossroad of Modern AI transparency and access to HPC resources for AI developers

This systematic framework can help predict at a detailed level where today's AI systems can and cannot be used and how future AI capabilities may change this.

Interpretive/utility claim: authors argue that the ontology plus classification results serve as rough predictive tools for AI applicability across work activities.

medium positive Where can AI be used? Insights from a deep ontology of work ... predictive usefulness of the ontology for AI applicability across tasks

EnterpriseLab provides enterprises a practical path to deploying capable, privacy-preserving agents without compromising operational capability.

Conclusion drawn by the authors based on the platform design and the reported empirical results (performance parity with GPT-4o, cost reductions, benchmark robustness). The abstract offers this as a high-level takeaway rather than a quantified empirical claim.

medium positive EnterpriseLab: A Full-Stack Platform for developing and depl... practicality of enterprise deployment balancing capability, privacy, and operati...

This pattern suggests that AI search may make hotel discovery less exclusively controlled by commission-based intermediaries (OTAs).

Interpretation/inference from the observed higher non-OTA citation shares for experiential queries in the audited Google Gemini sample; not a direct measurement of market outcomes such as bookings or commissions.

medium positive The End of Rented Discovery: How AI Search Redistributes Pow... degree of intermediary (OTA) control over hotel discovery

The results contribute to literature arguing that cloud-based GenAI is a source of enterprise value creation rather than merely an experimental technology.

Paper's stated addition to the existing literature based on the combined empirical and theoretical findings.

medium positive Measuring Business ROI of Generative AI Adoption on Azure Cl... enterprise value creation via GenAI

Our results substantiate the potential of large language models as a foundational pillar for high-fidelity, scalable decision simulation and latter analysis in the real economy based on foundational database.

High-level conclusion drawn from the paper's experiments and methodological contributions; generalization claim asserting LLMs' potential as foundational tools for scalable, high-fidelity decision simulation.

medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... potential of LLMs for high-fidelity, scalable decision simulation

Experiments demonstrate that our framework achieves improved simulation stability compared to existing economic and financial LLM simulation baselines.

Empirical claim: experiments vs. baselines showing improved simulation stability (paper statement that framework improved simulation stability, without quantitative details in the excerpt).

medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... simulation stability

Experiments demonstrate that our framework achieves significant improvements in purchase quantity prediction compared to existing economic and financial LLM simulation baselines.

Empirical claim: experiments comparing MALLES against existing baselines; paper reports 'significant improvements' in purchase quantity prediction (no numerical values provided in the excerpt).

medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... purchase quantity prediction accuracy

Experiments demonstrate that our framework achieves significant improvements in product selection accuracy compared to existing economic and financial LLM simulation baselines.

Empirical claim: experiments comparing MALLES against existing economic and financial LLM simulation baselines; paper reports 'significant improvements' in product selection accuracy (no numerical values provided in the excerpt).

medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... product selection accuracy

This preference-learning approach enables the models to internalize and transfer latent consumer preference patterns, thereby mitigating the data sparsity issues prevalent in individual categories.

Claim based on the paper's reported approach: cross-category post-training and transfer of latent preferences; supported by experiments (paper states mitigation of data sparsity).

medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... mitigation of data sparsity through cross-category preference transfer

Orchestrated systems of smaller, domain-adapted models can mathematically outperform frontier generalist models in most institutional deployment environments.

Formal conditions and comparative analysis derived in the paper plus referenced/claimed empirical support across several domains (frontier lab dynamics, alignment evolution, sovereign AI pressures).

medium positive Punctuated Equilibria in Artificial Intelligence: The Instit... relative institutional performance (smaller domain models vs. frontier generalis...

There are ethical imperatives of fairness and transparency in automated wealth management, and the paper proposes a roadmap toward sustainable and interpretable financial AI.

Normative analysis and proposed roadmap described in the paper; the excerpt does not provide operationalized fairness metrics, interpretability methods, or evaluation results.

medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... ethical compliance measures (fairness, transparency, interpretability) for autom...

In environments characterized by high-frequency data, non-linear dependencies, and stochastic market regimes, autonomous DRL agents can learn optimal sequential decision-making policies that offer a compelling alternative to static or rule-based allocation strategies.

Argument based on theoretical suitability of DRL for sequential decision problems and the paper's system-level investigation; excerpt does not report specific experimental datasets, sample sizes, benchmarks, or performance metrics.

medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... policy optimality / portfolio performance in complex market environments (implie...

The integration of Deep Reinforcement Learning (DRL) into portfolio management represents a significant evolution from classical Mean-Variance Optimization and modern econometric frameworks.

Conceptual comparison and synthesis presented in the paper; no empirical sample size or experimental results are provided in the excerpt to quantify the degree of improvement.

medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... methodological advancement in portfolio management (shift from static optimizati...

Regulatory sandboxes offer a flexible and innovation-friendly governance model compared to traditional command-and-control mechanisms.

Normative and comparative analysis within a law & economics framework; no empirical performance data reported in the abstract.

medium positive Experimentalism beyond ex ante regulation: A law and economi... flexibility of governance and degree of innovation-friendliness

Comparative insights from FinTech identify the institutional design features necessary to ensure the effectiveness and resilience of regulatory sandboxes.

Comparative case-based reasoning drawing on FinTech regulatory sandbox experience (abstract does not report number or selection of cases).

medium positive Experimentalism beyond ex ante regulation: A law and economi... presence and performance of institutional design features (effectiveness/resilie...

AI regulatory sandboxes may correct specific government failures, including regulatory capture, rent-seeking, and knowledge gaps.

Analytical claims supported by comparative reasoning (FinTech examples) and economic analysis of government failure; no empirical testing or sample size reported in the abstract.

medium positive Experimentalism beyond ex ante regulation: A law and economi... incidence/severity of government failures such as regulatory capture, rent-seeki...

AI regulatory sandboxes facilitate iterative regulatory learning while promoting responsible AI innovation.

Theoretical argument using experimentalist governance concepts and law & economics reasoning; comparative insights referenced but no empirical sample detailed in the abstract.

medium positive Experimentalism beyond ex ante regulation: A law and economi... degree of regulatory learning and indicators of responsible AI innovation

AI regulatory sandboxes can reduce negative externalities associated with AI deployment.

Conceptual and economic analysis in the paper (no empirical quantification or sample size reported in the abstract).

medium positive Experimentalism beyond ex ante regulation: A law and economi... magnitude/frequency of negative externalities (e.g., harms from AI systems)

AI regulatory sandboxes can mitigate information asymmetries between regulators and firms.

Analytical application of an economic analysis of law framework; theoretical argumentation rather than reported empirical measurement in the abstract.

medium positive Experimentalism beyond ex ante regulation: A law and economi... level of information asymmetry between regulators and AI firms

JobMatchAI provides factor-wise explanations through resume-driven search workflows.

Paper states that the system gives factor-wise explanations and ties them to resume-driven workflows; the excerpt references interpretable reranking and demo artifacts but does not include user study or explanation-faithfulness metrics.

medium positive JobMatchAI An Intelligent Job Matching Platform Using Knowle... explainability: factor-wise explanations presented to users within resume-driven...

« Prev 1 2 3 … 71 72 73 … 111 112 Next »