Evidence (7953 claims)
Adoption
5539 claims
Productivity
4793 claims
Governance
4333 claims
Human-AI Collaboration
3326 claims
Labor Markets
2657 claims
Innovation
2510 claims
Org Design
2469 claims
Skills & Training
2017 claims
Inequality
1378 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 402 | 112 | 67 | 480 | 1076 |
| Governance & Regulation | 402 | 192 | 122 | 62 | 790 |
| Research Productivity | 249 | 98 | 34 | 311 | 697 |
| Organizational Efficiency | 395 | 95 | 70 | 40 | 603 |
| Technology Adoption Rate | 321 | 126 | 73 | 39 | 564 |
| Firm Productivity | 306 | 39 | 70 | 12 | 432 |
| Output Quality | 256 | 66 | 25 | 28 | 375 |
| AI Safety & Ethics | 116 | 177 | 44 | 24 | 363 |
| Market Structure | 107 | 128 | 85 | 14 | 339 |
| Decision Quality | 177 | 76 | 38 | 20 | 315 |
| Fiscal & Macroeconomic | 89 | 58 | 33 | 22 | 209 |
| Employment Level | 77 | 34 | 80 | 9 | 202 |
| Skill Acquisition | 92 | 33 | 40 | 9 | 174 |
| Innovation Output | 120 | 12 | 23 | 12 | 168 |
| Firm Revenue | 98 | 34 | 22 | — | 154 |
| Consumer Welfare | 73 | 31 | 37 | 7 | 148 |
| Task Allocation | 84 | 16 | 33 | 7 | 140 |
| Inequality Measures | 25 | 77 | 32 | 5 | 139 |
| Regulatory Compliance | 54 | 63 | 13 | 3 | 133 |
| Error Rate | 44 | 51 | 6 | — | 101 |
| Task Completion Time | 88 | 5 | 4 | 3 | 100 |
| Training Effectiveness | 58 | 12 | 12 | 16 | 99 |
| Worker Satisfaction | 47 | 32 | 11 | 7 | 97 |
| Wages & Compensation | 53 | 15 | 20 | 5 | 93 |
| Team Performance | 47 | 12 | 15 | 7 | 82 |
| Automation Exposure | 24 | 22 | 9 | 6 | 62 |
| Job Displacement | 6 | 38 | 13 | — | 57 |
| Hiring & Recruitment | 41 | 4 | 6 | 3 | 54 |
| Developer Productivity | 34 | 4 | 3 | 1 | 42 |
| Social Protection | 22 | 10 | 6 | 2 | 40 |
| Creative Output | 16 | 7 | 5 | 1 | 29 |
| Labor Share of Income | 12 | 5 | 9 | — | 26 |
| Skill Obsolescence | 3 | 20 | 2 | — | 25 |
| Worker Turnover | 10 | 12 | — | 3 | 25 |
There is recent and increasing interest in forming teams of LLMs (LLM teams).
Claim made in the paper asserting increased interest and deployment at scale; supported in the paper by literature/contextual citations and reported deployments (specific numbers or studies not provided in the excerpt).
The study contributes a conceptual architecture for next-generation accounting automation that bridges traditional compliance models and modern financial infrastructure (enabling real-time validation, automation, and transparency).
Presentation of a proposed conceptual architecture in the paper, supported by empirical evaluation and stakeholder feedback; claimed as a primary contribution. (The summary does not include architecture diagrams, implementation details, or performance benchmarks beyond the reported metrics.)
Integrating ML and blockchain represents a transformative shift that addresses limitations of traditional financial governance (static ledgers, manual reconciliation, retrospective audits).
High-level argument supported by the study's empirical improvements (fraud detection, reconciliation time, transaction accuracy) and conceptual analysis mapping system capabilities to shortcomings of traditional models. (This is a synthesis/interpretation rather than a single measured outcome.)
Stakeholder validation confirms the system's operational feasibility with 95% approval.
Stakeholder validation (presumably via survey or consultation) reporting 95% approval for operational feasibility. (The summary does not specify the number of stakeholders, selection criteria, or survey instrument.)
The study validates theoretical frameworks such as triple-entry accounting (Grigg, 2024) and X-Accounting (Faccia et al., 2020).
Conceptual/theoretical alignment demonstrated by mapping the hybrid ML-blockchain architecture and empirical findings to the premises of the cited frameworks. (Summary does not specify formal validation method or criteria.)
The system maintains 99.8% transaction accuracy.
Reported transaction accuracy measured on the same empirical datasets (public-sector financial records and private-sector supply chains) used to evaluate the hybrid system. (The summary does not provide sample size, timeframe, or definition of 'transaction accuracy'.)
The hybrid system produces a 60% reduction in reconciliation time.
Empirical measurement of reconciliation time on datasets from public-sector financial records and private-sector supply chains comparing hybrid ML-blockchain workflows to traditional reconciliation processes. (No sample size or absolute times provided in the summary.)
A hybrid ML-blockchain system achieves a 9.8% improvement in fraud detection accuracy (F1-score).
Quantitative evaluation using empirical data drawn from public-sector financial records and private-sector supply chains; improvement reported as change in F1-score between the hybrid system and baseline (traditional) oversight approaches. (Paper does not report sample sizes or exact baseline metrics in the summary.)
Both stable individual differences and moment-to-moment fluctuations in perspective-taking influence AI response quality.
Analyses reported in the paper linking both trait-level (stable) and state-level (moment-to-moment) measures of perspective-taking to variation in AI response quality across the benchmark dataset; assessed via the Bayesian IRT model and supplementary within-subject analyses.
Theory of Mind (the capacity to infer and adapt to others' mental states) emerges as a key predictor of synergy.
Statistical association reported between participants' Theory of Mind measures and the estimated synergy (improvement in performance with AI), based on analysis of the benchmark dataset (n = 667) within the Bayesian IRT framework.
These AI formulation models reduced experimental workload by 30–50%.
Reported in the review as estimated reductions in experimental workload when using AI-driven formulation optimization. The excerpt lacks details on how workload was measured, which experiments were replaced or reduced, and sample sizes.
In formulation optimization, artificial neural networks, neuro-fuzzy systems, and hybrid model-based AI models have been able to predict dissolution profiles and critical quality attributes with accuracy rates of over 90%.
Reported model performance in formulation optimization studies summarized by the review. The excerpt does not include which specific studies, datasets, cross-validation protocols, or sample sizes produced >90% accuracy.
AI has reduced clinical trial duration by up to 59%.
Reported in the review as an observed maximum reduction in trial duration associated with AI-driven approaches. The excerpt omits details on which trials, therapeutic areas, trial phases, or sample sizes produced this figure.
AI has sped up compound screening by 1–2 years.
Presented in the review as a comparative reduction in time-to-screening attributed to AI methods. The excerpt does not provide the underlying studies, screening scope, or sample sizes.
AI-enabled platforms have cut the drug discovery pipeline timelines (compared with the traditional 4–6 years) down to 46 days.
Reported as an outcome of AI-enabled platforms in the review. The excerpt does not list the specific platform(s), individual study design(s), or sample sizes underlying the 46-day figure.
Artificial intelligence (AI) is transforming pharmaceutical research and development (R and D), and making measurable improvements in efficiency, precision, and cost-effectiveness in drug research and development.
Stated as a summary conclusion in the review based on cross-domain literature synthesis. Specific studies or quantitative meta-analytic methods and sample sizes are not provided in the excerpt.
Experiments on simulated and real-world data show that humans assisted by the adaptive AI ensemble achieve significantly higher performance than humans assisted by single AI models trained either for independent AI performance or for human-AI team performance.
Empirical experiments reported in the paper on both simulated datasets and real-world data; the abstract states results are statistically significant but does not provide sample sizes, datasets, or statistical details in the excerpt.
An adaptive AI ensemble that toggles between two specialist models (an aligned model and a complementary model) using a Rational Routing Shortcut mechanism overcomes the complementarity–alignment limitation of single-model approaches.
Methodological contribution described in the paper; includes the design of the ensemble and the Rational Routing Shortcut; theoretical guarantees of near-optimality are claimed in the paper (proofs referenced but not shown in the excerpt).
EASP offers a practical tradeoff between reasoning quality and latency by avoiding iterative LLM tool-calls at inference time while still producing grounded plans.
Methodological claim in the paper: Probe-then-Plan uses a lightweight probe to avoid heavy iterative LLM tool calls during serving; supported by design rationale and performance-focused evaluations (offline and online).
EASP has been successfully deployed in JD.com's AI-Search system.
Statement in the paper that EASP was deployed in JD.com's AI-Search system; presumably validated by internal deployment logs and online A/B testing reported.
Online A/B testing on JD.com demonstrates that EASP achieves substantial lifts in UCVR (user conversion rate) and GMV (gross merchandise volume).
Reported results from online A/B testing on JD.com referenced in the paper indicating lifts in UCVR and GMV (no numerical magnitudes provided in the abstract).
Extensive offline evaluations and online A/B testing on JD.com show that EASP significantly improves relevant recall.
Empirical claims in the paper citing extensive offline evaluations and online A/B testing on JD.com as the basis for observed improvements in relevant recall (specific datasets/sizes not reported in the abstract).
Environment-Aware Search Planning (EASP) resolves the blindness-latency dilemma in LLM-based e-commerce search by grounding planning in the real retrieval environment via a Probe-then-Plan mechanism.
Conceptual design and empirical evaluation described in the paper: introduces a lightweight Retrieval Probe to expose a retrieval snapshot and a Planner that diagnoses execution gaps and generates grounded search plans; supported by offline evaluations and online A/B testing on JD.com (section describing method and experiments).
The findings provide valuable insights for entrepreneurs, policymakers, and academic institutions to implement adaptive strategies for sustainable and inclusive entrepreneurial growth in the era of artificial intelligence.
Authors' implications/conclusions based on the study results (n=350; statistical analyses) recommending adaptive strategies targeted at stakeholders.
AI functions as a strategic enabler that reshapes entrepreneurial practices, labour dynamics, and innovation strategies.
Conclusion drawn from the study's quantitative findings (survey of 350, regression/SEM results) that linked AI adoption to changes in opportunity recognition, labour substitution, and innovation processes.
AI-driven innovation processes accelerated product development, improved operational efficiency, and supported experimentation, thereby strengthening entrepreneurial performance.
Survey data from 350 AI-adopting SMEs analyzed with regression and SEM showing positive associations between AI adoption and measures of product development speed, operational efficiency, experimentation, and overall entrepreneurial performance.
AI facilitated labour substitution by automating repetitive tasks, allowing human resources to focus on creative and analytical roles.
Responses from the same sample (n=350) of AI-adopting SME entrepreneurs/managers; descriptive statistics and inferential analyses (regression/SEM) linking AI adoption to increased automation and role reallocation.
AI adoption significantly enhanced opportunity recognition by enabling entrepreneurs to identify emerging market trends, assess risks, and make informed strategic decisions.
Quantitative survey of 350 entrepreneurs and managers of SMEs who had adopted AI; relationships tested using regression analysis and structural equation modelling (SEM) reported a significant positive effect of AI adoption on opportunity recognition.
Sustainable human capital development requires coordinated interaction between education systems, employers, and public institutions.
Normative recommendation derived from the paper's systemic analysis and comparative review of institutional responses; no empirical policy evaluation or quantified cross-country causal analysis reported.
Alignment of educational strategies with labor market dynamics is necessary to support effective reskilling and upskilling.
Supported by comparative assessment of international practices and systemic analysis linking education strategies to labor market requirements; evidence is analytical rather than experimental or longitudinally quantified in the paper.
Effective reskilling and upskilling depend on the development of continuous learning ecosystems.
Analytical conclusion drawn from organizational learning models and international practice comparison; no controlled trials or quantitative evaluation of specific ecosystems reported.
As technological change accelerates, the ability of individuals and organizations to adapt becomes a central condition of economic resilience and long-term competitiveness.
Analytical generalization from organizational learning models and systemic analysis of labor-market dynamics; supported by comparative observations but not by a reported empirical causal study.
AI-based ESG systems are increasingly applied to extract deeper sustainability signals from corporate disclosures, reports and external data sources.
Descriptive claim supported by cited literature and examples of AI applications in ESG analytics within the paper's background (references to recent AI/ESG studies). The summary does not quantify the rate of adoption.
Regression analysis revealed that AI-derived ESG scores were more strongly associated with excess returns than traditional ESG metrics.
Regression models estimating the association between ESG scores (AI-derived vs traditional) and excess returns. The summary does not specify the regression specification, control variables, sample size, time horizon, or statistical significance measures.
AI-driven high-ESG portfolios demonstrated lower downside-risk exposure and smaller maximum drawdowns during market stress, indicating stronger resilience.
Downside-risk and maximum drawdown metrics computed for AI-driven high-ESG portfolios versus comparator portfolios during periods of market stress (portfolio-level analysis). Specific stress period(s), sample size and statistical tests are not provided in the summary.
AI-enhanced high-ESG portfolios achieved higher mean returns and superior Sharpe ratios than both AI-based low-ESG portfolios and traditionally rated ESG portfolios.
Portfolio-level performance comparison reported in the study (mean returns and Sharpe ratios calculated for portfolios constructed using AI-driven ESG indicators versus portfolios using conventional ESG ratings). The summary does not report sample size, time period, market coverage, rebalancing frequency, or statistical significance levels.
The study recommends multi-stakeholder collaborations (policymakers, financial institutions, entrepreneurs) to design inclusive AI solutions, bridge the digital skills gap, and foster an environment for equitable entrepreneurial growth.
Policy and practice recommendations drawn in the paper's conclusion based on empirical findings and interpretation of barriers.
Firms with high AI adoption reported superior decision-making quality compared to low adopters.
Survey comparisons of decision-making quality measures between AI adoption groups in the questionnaire data (N=400), reported as superior for high adopters.
Firms with high AI adoption reported significantly higher financial literacy scores compared to low adopters.
Comparison of financial literacy scores between high and low AI adoption groups derived from the structured questionnaire responses (sample N=400); described as 'significantly higher' in the paper.
There is a positive correlation between the level of AI adoption and key business outcomes.
Survey-based correlational analysis reported in the paper linking self-reported AI adoption level to business outcome measures across the sample of 400 respondents.
Upstream foundation model providers offering fine-tuning and inference services to downstream firms creates a co-creation dynamic that enhances model quality when downstream firms fine-tune models with proprietary data.
Conceptual claim and theoretical framing in the paper: description of an AI supply-chain interaction where providers supply compute/inference and downstream firms fine-tune with proprietary data; the paper posits this co-creation improves model quality as part of the motivating narrative.
Under pro-price-competitive policies or compute subsidies, the provider and downstream firms can achieve higher profits along with greater consumer surplus (a win-win-win outcome).
Equilibrium profit comparisons in the game-theoretic model showing that, in the parameter regions where these policies raise consumer surplus, both the upstream provider's profit and downstream firms' profits also increase relative to the baseline.
Policies that promote quality competition in downstream markets always improve consumer surplus.
Model outcomes: comparative-static and equilibrium results show that strengthening downstream quality competition monotonically increases consumer surplus across the parameter space considered in the paper.
Pro-price-competitive policies and compute subsidies are complementary: each is effective in different cost regimes and together can cover more cases.
Analytical results from the game-theoretic model showing complementary effectiveness across varying compute/preprocessing cost parameters (comparative statics demonstrating non-overlapping regions of effectiveness).
New employment opportunities are emerging in AI-complementary occupations.
Findings from job-posting analyses and other empirical studies summarized in the paper that identify growth in AI-complementary job listings and roles (specific metrics not provided in excerpt).
Generative AI (GenAI), particularly tools such as ChatGPT and Gemini, has rapidly transformed the global technological landscape.
Qualitative/observational statement in paper citing the rapid public adoption of GenAI tools since late 2022; no specific empirical sample sizes reported in the text provided.
Holistic AI integration across supply chain functions yields greater performance benefits than isolated technological implementations.
Comparative analysis using survey and statistical methods (correlation/regression) on data from supply chain professionals; the summary reports superior outcomes for integrated (ecosystem-level) AI adoption versus isolated implementations, but does not provide the comparative metrics or sample breakdown.
AI-enabled performance management plays a mediating role that strengthens the linkage between strategic planning and operational outcomes.
Mediation analysis conducted on survey data from supply chain professionals (manufacturing and service sectors); the summary indicates a mediating effect of performance management but provides no mediation statistics (indirect effect size, confidence intervals) or sample size.
AI-enabled execution emerged as the strongest direct predictor of supply chain performance.
Regression analysis from the quantitative survey of supply chain professionals comparing AI-enabled planning, execution, and performance management as predictors of supply chain performance; specific coefficients, significance levels, and sample size are not reported in the excerpt.
AI integration significantly improved overall supply chain performance.
Quantitative study using data collected from supply chain professionals and analyzed with reliability testing, correlation, and regression methods; the provided text does not include sample size, p-values, or effect magnitudes.