Evidence (13661 claims)
Adoption
8339 claims
Productivity
7479 claims
Governance
6715 claims
Human-AI Collaboration
6267 claims
Org Design
4098 claims
Innovation
3987 claims
Labor Markets
3488 claims
Skills & Training
2888 claims
Inequality
2016 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 740 | 192 | 95 | 871 | 1945 |
| Governance & Regulation | 796 | 388 | 185 | 119 | 1512 |
| Organizational Efficiency | 765 | 186 | 123 | 82 | 1166 |
| Technology Adoption Rate | 610 | 227 | 121 | 95 | 1061 |
| Research Productivity | 409 | 121 | 56 | 331 | 928 |
| Output Quality | 464 | 174 | 58 | 47 | 743 |
| Decision Quality | 318 | 173 | 75 | 42 | 615 |
| Firm Productivity | 432 | 55 | 88 | 20 | 601 |
| AI Safety & Ethics | 214 | 273 | 65 | 33 | 589 |
| Market Structure | 175 | 165 | 120 | 24 | 489 |
| Task Allocation | 206 | 64 | 70 | 31 | 376 |
| Skill Acquisition | 161 | 57 | 57 | 16 | 291 |
| Innovation Output | 201 | 27 | 41 | 18 | 288 |
| Fiscal & Macroeconomic | 130 | 69 | 43 | 26 | 275 |
| Employment Level | 104 | 50 | 105 | 13 | 274 |
| Consumer Welfare | 116 | 62 | 42 | 11 | 231 |
| Firm Revenue | 149 | 45 | 26 | 3 | 223 |
| Inequality Measures | 43 | 120 | 49 | 6 | 218 |
| Task Completion Time | 164 | 29 | 8 | 12 | 214 |
| Worker Satisfaction | 89 | 60 | 20 | 12 | 181 |
| Error Rate | 69 | 89 | 9 | 2 | 169 |
| Regulatory Compliance | 74 | 67 | 14 | 4 | 159 |
| Training Effectiveness | 91 | 19 | 13 | 19 | 144 |
| Wages & Compensation | 77 | 33 | 25 | 6 | 141 |
| Team Performance | 86 | 17 | 27 | 9 | 140 |
| Automation Exposure | 49 | 50 | 22 | 12 | 136 |
| Developer Productivity | 91 | 17 | 14 | 5 | 128 |
| Job Displacement | 12 | 80 | 19 | 1 | 112 |
| Hiring & Recruitment | 51 | 7 | 8 | 3 | 69 |
| Creative Output | 31 | 16 | 7 | 2 | 57 |
| Skill Obsolescence | 5 | 43 | 6 | 1 | 55 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 17 | 17 | — | 51 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
AEL outperforms five published self-improving methods and all non-LLM baselines while maintaining the lowest variance among all LLM-based approaches on the benchmark.
Comparative empirical evaluation on the same sequential portfolio benchmark, comparing AEL to five published self-improving methods and multiple non-LLM and LLM baselines (reported relative ranking and variance).
On a sequential portfolio benchmark (10 sector-diverse tickers, 208 episodes, 5 random seeds), AEL achieves a Sharpe ratio of 2.13 ± 0.47.
Empirical experiment on the sequential portfolio benchmark with 10 tickers, 208 episodes, evaluated across 5 random seeds (reported Sharpe ratio and standard deviation).
We introduce Agent Evolving Learning (AEL), a two-timescale framework in which a Thompson Sampling bandit at the fast timescale learns which memory retrieval policy to apply each episode, while LLM-driven reflection at the slow timescale diagnoses failure patterns and injects causal insights into the agent's decision prompt.
Methodological description and proposed algorithmic design in the paper (no additional experimental sample size—design/algorithmic claim).
The sustainability of the algorithmic state rests on a movement from technocratic secrecy to value-based transparency to ensure AI- and human collaboration is founded on institutional accountability and algorithmic justice.
Authorial conclusion from the systematic review synthesis (2018-2026) advocating a policy/practice shift; presented as normative policy recommendation rather than quantified empirical finding.
Empirical evidence shows great gains in efficiency in fiscal forecasting.
Empirical studies included in the PRISMA-guided review (2018-2026) reporting improved fiscal forecasting outcomes; no quantitative effect sizes provided in abstract.
Empirical evidence shows great gains in efficiency at routinised administrative tasks.
Empirical studies reported in the systematic review (2018-2026); the abstract claims empirical evidence of efficiency gains but does not report specific study counts, sample sizes, or effect magnitudes.
Digital infrastructure is a primary determinant of both the pace of AI diffusion and its resulting economic returns.
Synthesis of descriptive patterns, difference-in-differences causal estimates, and instrumental-variable results using Turkish administrative and survey data (2021-2024).
Infrastructure-driven AI adoption shifts labor composition toward ICT-related roles.
Instrumental-variable estimates showing changes in occupational composition (increase in ICT-related roles) associated with infrastructure-driven AI adoption; based on administrative employment data and enterprise survey (Turkey, 2021-2024).
Infrastructure-driven AI adoption raises export intensity.
Instrumental-variable estimates linking infrastructure-driven adoption to firm export intensity using administrative and survey data (Turkey, 2021-2024).
Infrastructure-driven AI adoption raises labor productivity.
Instrumental-variable estimates where infrastructure-driven adoption is instrumented (IV) and linked to firm-level labor productivity measures; data from administrative records and enterprise survey in Turkey (2021-2024).
Improved connectivity (due to pipeline-driven fiber deployment) significantly increases AI adoption, particularly for software-intensive technologies and among small and medium-sized enterprises.
Causal inference using difference-in-differences estimates exploiting staggered pipeline expansion as variation in connectivity; sample drawn from administrative records and nationally representative enterprise survey (Turkey, 2021-2024).
AI adoption is concentrated among large firms and in regions with high-speed broadband and proximity to data centers, particularly for software-intensive and cloud-based applications.
Descriptive analysis using administrative data and a nationally representative enterprise survey from Turkey (2021-2024).
This survey provides scholars and practitioners with a structured understanding of how agentic AI is reshaping financial markets and identifies critical research directions to ensure these systems enhance both operational efficiency and market resilience.
Statement of contribution in the paper; based on the paper's literature review, taxonomy, and identified research agenda.
Agentic AI offers substantial potential for enhanced market efficiency, liquidity provision, and risk management.
Survey synthesis of foundational research, market applications, and technical architectures suggesting potential benefits; no original empirical evaluation reported.
The emergence of agentic AI represents a fundamental transformation in financial markets, characterized by autonomous systems capable of reasoning, planning, and adaptive decision-making with minimal human intervention.
Conceptual claim stated in the survey's introduction and synthesis of recent advances; based on literature review and theoretical framing rather than new empirical data.
Countries around the world are rushing to encourage greater investment and growth in their domestic AI industries.
Statement/observation presented in the paper's introduction; based on the paper's descriptive overview of global policy activity (literature review / policy survey implied). No sample size reported.
Results indicate that AI-assisted text-to-model methods can substantially lower the cost of constructing structured procedural representations, making course-wide deployment of structured AI coaching systems practically feasible.
Conclusion drawn from reported results (e.g., time reductions and modeled outputs); the paper claims that these results imply lower costs and practical feasibility for course-wide deployment.
AI-assisted authoring reduced expert modeling time by 50–70% while producing structurally valid and highly reproducible models under fixed-input conditions.
Quantitative claim reported in the paper comparing expert modeling time with AI assistance and reporting structural validity and reproducibility under fixed-input conditions; exact experimental setup and sample size not stated in the abstract.
We apply the pipeline to instructional materials from a graduate-level online AI course, constructing 23 procedural skill models.
Empirical application reported in the paper: the pipeline was run on course materials and produced 23 models (number explicitly stated).
The approach automates structural scaffolding while preserving expert oversight for validating causal transitions and failure conditions.
Claim about system design and human-in-the-loop workflow reported in the paper; implies human validation steps are maintained alongside automated generation.
We present a human-in-the-loop text-to-model pipeline that uses large language models to transform instructional materials into schema-complete Task-Method-Knowledge models via ontology-constrained prompting and template-based generation.
Methodological contribution described in the paper; pipeline design and implementation reported (no separate quantitative validation in this sentence).
When unfairness is driven by uncertainty (rather than incidental noise), accounting for uncertainty is essential to achieving fair and effective decision-making.
Synthesis/argument based on formalization and simulation experiments showing cases where uncertainty causes unfair outcomes and methods that account for uncertainty mitigate those outcomes.
The proposed framework can help practitioners diagnose, audit, and govern fairness risks in socio-technical decision systems.
Authors propose a diagnostic/audit/governance framework (conceptual contribution) and illustrate its use through examples and simulations; no field deployment evidence provided in the abstract.
Algorithmic examples in the paper demonstrate it is possible to reduce outcome variance for disadvantaged groups while preserving institutional objectives such as expected utility.
Algorithmic examples and simulation experiments reported in the paper demonstrating reductions in outcome variance for disadvantaged groups together with preserved expected utility (results from synthetic/simulated data and model runs).
The authors formalize model and feedback uncertainty using counterfactual logic and reinforcement learning.
Paper describes formalization/mathematical definitions linking counterfactual logic and reinforcement learning to model and feedback uncertainty (theoretical/methodological contribution).
This paper introduces a taxonomy of uncertainty in sequential decision-making consisting of three types: model uncertainty, feedback uncertainty, and prediction uncertainty.
Paper presents a conceptual taxonomy and names the three uncertainty types in the text/abstract; theoretical exposition in the methods/definitions sections (no external empirical sample required).
The emergence of 'Industry 4.0 Inc.' is likely to induce further collaboration among participating incumbents.
Authors' inference based on observed interconnections and overlapping investments in the M&A-based mapping (predictive/interpretive claim; no quantified projection provided in the excerpt).
One consequence of increased M&A activity and overlapping investments is the emergence of interconnections that have given rise to a new structure the authors term 'Industry 4.0 Inc.'
Network mapping of corporate linkages and overlapping investments derived from the M&A deal analysis spanning more than two decades (method: empirical mapping of inter-corporate ties); exact counts not provided in the excerpt.
Mergers and acquisitions are one of the principal tools industrial firms use to overcome this dual challenge.
Authors' argumentation supported by an empirical analysis of more than two decades of M&A deals (method: M&A deal analysis); exact sample size not stated in provided text.
HAF-DS provides a scalable and adaptable solution for modern textile and PPE supply chains.
Author claim in conclusions indicating scalability and adaptability as properties of the proposed framework; supported implicitly by application to multiple datasets.
Coupling predictive forecasting with prescriptive optimization enhances both accuracy and efficiency in textile and PPE supply chains.
Summary conclusion drawn from the reported experimental improvements in forecast errors and operational metrics on textile and PPE datasets.
Service level rose from 95.5% to 97.8%.
Reported experimental operational metric (service level) improvement values under HAF-DS versus baseline (95.5% -> 97.8%).
Stockouts decreased by 27.5%.
Reported experimental operational metric indicating a 27.5% reduction in stockouts under HAF-DS compared to baseline.
Inventory cost decreased by 5.4%.
Reported experimental operational metric (inventory cost) showing 5.4% reduction under HAF-DS relative to baseline.
On the combined dataset, HAF-DS reduced Mean Absolute Percentage Error (MAPE) from 9.5% to 8.1%.
Reported experimental result on the combined dataset comparing MAPE of HAF-DS vs baseline (values given: 9.5% -> 8.1%).
On the combined dataset, HAF-DS reduced Root Mean Squared Error (RMSE) from 19.53 to 17.11 (12.4%).
Reported experimental result on the combined dataset comparing RMSE of HAF-DS vs baseline (values given: 19.53 -> 17.11, with percent reduction 12.4%).
On the combined dataset, HAF-DS reduced Mean Absolute Error (MAE) from 15.04 to 12.83 (14.7%).
Reported experimental result on the combined dataset comparing MAE of HAF-DS vs baseline (values given: 15.04 -> 12.83, with percent reduction 14.7%).
Experiments on textile sales and supply chain datasets show significant performance gains over statistical and deep learning baselines.
Empirical evaluation reported on textile sales and supply chain datasets with comparisons to statistical and deep learning baseline models (datasets described broadly; no sample sizes given).
The framework jointly minimizes forecasting error and operational cost through embedding-based feature representation and recurrent neural architectures.
Paper text describing joint objective (minimize forecasting error and operational cost) and the use of embedding-based features plus recurrent networks to accomplish this.
The optimization layer prescribes cost-efficient replenishment and allocation decisions (MILP).
Method description stating the use of a MILP optimization layer to produce replenishment/allocation decisions aimed at cost efficiency.
The LSTM captures temporal and contextual demand dependencies.
Methodological description asserting LSTM's role in modeling temporal and contextual dependencies within the forecasting module.
The paper proposes a Hybrid AI Framework for Demand-Supply Forecasting and Optimization (HAF-DS), which integrates a Long Short-Term Memory (LSTM)-based demand forecasting module with a mixed integer linear programming (MILP) optimization layer.
Paper description of the proposed framework (design/architecture). Reports integration of LSTM forecasting module and MILP optimization layer as the core contribution.
Dynamic combinations of AI and organizational structure can help managers overcome traditional trade-offs between scale and scope, opening pathways for scalable, cross-market expansion.
Managerial implication drawn from the paper's longitudinal case study of ByteDance; qualitative inference from observed organizational practices and AI deployment patterns.
AI transforms the scale–scope nexus from being a trade-off into a source of strategic advantage.
Synthesis and theoretical claim derived from longitudinal case study of ByteDance showing simultaneous scaling and diversification enabled by AI and organizational design.
AI reverses the conventional logic of the resource-based view: rather than valuable resources enabling diversification, diversification amplifies the value of resources.
Theoretical argument supported by the ByteDance case study; paper presents this as a theorized inversion based on observed patterns in the single-case study.
The value of AI learning transfer across domains is contingent on access to structurally related data that allow learning to transfer across domains.
Claim derived from the ByteDance longitudinal case study showing conditions for successful cross-domain AI transfer (qualitative evidence emphasizing data structure/relatedness).
AI evolves and improves through self-learning and cross-fertilization across domains, becoming increasingly valuable as learning accumulates.
Theoretical claim supported by longitudinal observations from the ByteDance case study (qualitative evidence from repeated AI deployments over time).
ByteDance leveraged AI and adaptive organizational design to scale rapidly and diversify across industries and markets without incurring rising costs or coordination complexity.
Longitudinal single-case (qualitative) study of ByteDance described in the paper; method reported as a longitudinal case study of one firm.
Humble leadership indirectly alleviates the negative indirect effect of HAI-C task complexity on work engagement by enhancing employees' AI self-efficacy.
Reported moderated mediation/conditional process findings from hierarchical regression and bootstrapping on the three-wave matched sample of 497 employees.
AI self-efficacy mitigates (buffers) the negative indirect impact of HAI-C task complexity on employees' work engagement.
Moderated mediation analysis conducted on longitudinal survey data (n=497) using hierarchical regression and bootstrapping; reported in Results that AI self-efficacy weakens the negative indirect effect.