Evidence (7156 claims)
Adoption
5126 claims
Productivity
4409 claims
Governance
4049 claims
Human-AI Collaboration
2954 claims
Labor Markets
2432 claims
Org Design
2273 claims
Innovation
2215 claims
Skills & Training
1902 claims
Inequality
1286 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 369 | 105 | 58 | 432 | 972 |
| Governance & Regulation | 365 | 171 | 113 | 54 | 713 |
| Research Productivity | 229 | 95 | 33 | 294 | 655 |
| Organizational Efficiency | 354 | 82 | 58 | 34 | 531 |
| Technology Adoption Rate | 277 | 115 | 63 | 27 | 486 |
| Firm Productivity | 273 | 33 | 68 | 10 | 389 |
| AI Safety & Ethics | 112 | 177 | 43 | 24 | 358 |
| Output Quality | 228 | 61 | 23 | 25 | 337 |
| Market Structure | 105 | 118 | 81 | 14 | 323 |
| Decision Quality | 154 | 68 | 33 | 17 | 275 |
| Employment Level | 68 | 32 | 74 | 8 | 184 |
| Fiscal & Macroeconomic | 74 | 52 | 32 | 21 | 183 |
| Skill Acquisition | 85 | 31 | 38 | 9 | 163 |
| Firm Revenue | 96 | 30 | 22 | — | 148 |
| Innovation Output | 100 | 11 | 20 | 11 | 143 |
| Consumer Welfare | 66 | 29 | 35 | 7 | 137 |
| Regulatory Compliance | 51 | 61 | 13 | 3 | 128 |
| Inequality Measures | 24 | 66 | 31 | 4 | 125 |
| Task Allocation | 64 | 6 | 28 | 6 | 104 |
| Error Rate | 42 | 47 | 6 | — | 95 |
| Training Effectiveness | 55 | 12 | 10 | 16 | 93 |
| Worker Satisfaction | 42 | 32 | 11 | 6 | 91 |
| Task Completion Time | 71 | 5 | 3 | 1 | 80 |
| Wages & Compensation | 38 | 13 | 19 | 4 | 74 |
| Team Performance | 41 | 8 | 15 | 7 | 72 |
| Hiring & Recruitment | 39 | 4 | 6 | 3 | 52 |
| Automation Exposure | 17 | 15 | 9 | 5 | 46 |
| Job Displacement | 5 | 28 | 12 | — | 45 |
| Social Protection | 18 | 8 | 6 | 1 | 33 |
| Developer Productivity | 25 | 1 | 2 | 1 | 29 |
| Worker Turnover | 10 | 12 | — | 3 | 25 |
| Creative Output | 15 | 5 | 3 | 1 | 24 |
| Skill Obsolescence | 3 | 18 | 2 | — | 23 |
| Labor Share of Income | 7 | 4 | 9 | — | 20 |
Small language models offer privacy-preserving alternatives to frontier models, but their specialization is hindered by fragmented development pipelines that separate tool integration, data generation, and training.
Background claim stated in paper/abstract; no experimental data provided for this statement within the abstract.
Extensive synthetic experiments show that policy regularizations reshape the narrative on what is the best DRL method for inventory management.
Paper states results from extensive synthetic experiments that change which DRL methods are considered best under policy regularization; abstract does not provide the experimental sample size, specific methods, or quantitative comparisons.
Implementation of human-replacing technologies leads to significant transformations in skill demand: it reduces reliance on low-skilled labour while increasing demand for qualified engineers, system operators and specialists in digital technologies.
Sector-specific analysis and review of international labour-market studies cited in the article documenting skill-biased effects of automation and digitalization; qualitative assessment for Ukraine's mining and metallurgical sector under workforce shortage conditions.
Foreign direct investment (FDI) shows an insignificantly positive direct effect on local TFCP but a significantly negative indirect (spillover) effect, attributed to a 'pollution haven' effect.
Spatial Durbin Model estimates for FDI on panel (30 provinces, 2010–2023): direct coefficient positive but not significant; indirect coefficient significantly negative; interpretation given as pollution-haven mechanism.
Industrial intelligence exhibits regional heterogeneity: a significantly negative direct effect in the east, a significantly positive direct effect in the central region, an insignificant direct effect in the west, and positive indirect (spillover) effects in the east and west.
Regional/subsample Spatial Durbin Model analyses dividing the sample into east, central, and west regions (30 provinces, 2010–2023); reported region-specific direct and indirect coefficients and significance levels.
Industrial intelligence has an insignificantly negative direct effect on local TFCP, but its positive spatial spillover effect is significant at the 1% level, producing a significantly positive total effect.
Spatial Durbin Model results for industrial intelligence on panel (30 provinces, 2010–2023): direct coefficient negative and not statistically significant; indirect coefficient positive and significant at 1%; total effect positive and significant.
China's TFCP rose overall from 2010 to 2023 but exhibited a widening regional gap of 'higher in the east, lower in the west'.
Panel data of 30 Chinese provincial-level regions (2010–2023); TFCP measured using an undesirable-output super-efficiency SBM model and summarized temporal and spatial patterns.
The study found a significant transformation of the employment structure under the influence of artificial intelligence.
Empirical analysis using an envelope model ("input" orientation) applied to a sample of European Union countries; the paper reports modeled changes in employment structure attributable to AI diffusion.
For AI: a cohesive professional vocabulary formed rapidly in early 2024, but the practitioner population never cohered.
Empirical finding from analysis of the 8.2M resume dataset showing a rapid increase in the vocabulary-cohesion metric around early 2024 while the population-cohesion metric did not show a corresponding rise.
The framework implies threshold effects in training and capability acquisition: when the teaching horizon lies below the prerequisite depth of the target, additional instruction cannot produce successful completion of teaching; once that depth is reached, completion becomes feasible.
Model-derived threshold result described in the abstract (mathematical analysis of prerequisite depth vs. teaching horizon).
The value of information depends on whether downstream users can absorb and act on it: a signal conveys meaning only to a learner with the structural capacity to decode it (an explanation that clarifies a concept for one user may be indistinguishable from noise to another who lacks the relevant prerequisites).
Conceptual argument motivating the model; theoretical reasoning described in the paper's intro/abstract.
Automation holds significant potential for modernising tax administration, but its success depends on aligning technological innovation with inclusive policy design and institutional capacity.
Overall conclusion of the literature synthesis of 36 peer-reviewed articles; based on patterns of positive impacts conditional on contextual factors and governance highlighted across the studies.
Behavioural responses to automation vary across taxpayer segments: some users embrace automation as a facilitator of compliance while others resist due to perceived opacity and technological anxiety.
Synthesis of behavioural findings from the reviewed literature (36 studies) reporting heterogeneous responses by taxpayer segment, including qualitative reports of resistance and quantitative measures of uptake/adoption.
The effectiveness of automated tax systems is mediated by contingencies including digital literacy, institutional trust, and regulatory clarity.
The review identifies recurring contextual factors across the 36 articles that are reported to moderate or mediate the impact of automation on outcomes (qualitative and quantitative findings cited in the synthesis).
The study identifies the main AI-enabled mechanisms advancing CE principles in smart manufacturing, waste valorisation, supply-chain transparency, and sustainable design.
Bibliometric network analysis of 196 peer-reviewed articles (2023–2024) and systematic review of 104 studies, per the abstract; identification is presented as a product of these analyses.
AI is not an inherent instrument of justice but a malleable socio-technical force whose equitable outcomes depend on policy design and institutional context.
Interpretation and synthesis of empirical results showing conditional and heterogeneous effects of AI; normative conclusion drawn by authors from observed heterogeneity and mediating channels.
Governmental structures, labor supply and demand, and incorporation of financial measures act as key intervening variables affecting achieved ROI from GenAI implementations.
Qualitative synthesis and theoretical analysis reported in the paper identifying contextual/intervening variables.
There is an evident tension between privacy and security in existing AI governance approaches.
Thematic synthesis and co-occurrence network from the reviewed studies identify trade-offs and tensions reported between privacy-preserving approaches and security requirements.
Generative AI serves as an effective 'wingman' for employment lawyers, capable of replacing substantial junior associate work while requiring continued human expertise for client counseling, supervision, and final legal advice preparation.
Authors' synthesis of experimental results showing AI-produced substantive analysis plus discussion about remaining limitations (e.g., citation errors) and required human oversight; qualitative assertion about substitutability for junior associate tasks.
The fragility of 'Pax Silica' has implications for global capitalism, technological governance, and geopolitical stability.
Analytical inference and concluding assessment based on theoretical framework and comparative analysis; no empirical quantification provided in the abstract.
The paper proposes new mechanisms through which big data affects individual welfare (beyond simple productivity gains), linking privacy costs, multiplier effects, and R&D transformation patterns.
Theoretical/mechanism development: the paper articulates new channels in its macro theoretical framework describing how data sharing impacts welfare via multiple mechanisms (model construction and analytic discussion; no empirical/sample validation).
Consumption is affected by the multiplier effect and the transformation patterns of R&D.
Theoretical: model analysis links consumption dynamics to a multiplier effect and to how R&D transforms inputs/outputs (comparative statics/dynamics in the theoretical framework).
Individuals’ welfare is influenced by both the privacy cost of big data sharing and their consumption levels.
Theoretical: welfare in the model is specified as a function of consumption and a privacy cost term arising from big data sharing; result follows from analytic derivation within the model (no empirical/sample data).
PPS gains are task-dependent: gains are large in high-ambiguity business analysis tasks but reverse in low-ambiguity travel planning tasks.
Task-level analysis across the three domains (business, technical, travel) within the controlled study (60 tasks total); authors report differential performance patterns by domain/ambiguity.
Capability and trust formally diverge beyond a critical scale (Capability-Trust Divergence).
Claim of a formal proof in the paper (mathematical / theoretical demonstration). No empirical sample size reported in the excerpt.
The Institutional Scaling Law shows that institutional fitness -- jointly measuring capability, trust, affordability, and sovereignty -- is non-monotonic in model scale, with an environment-dependent optimum N*(ε).
Theoretical derivation / analytic model presented in the paper (formal derivation of an 'Institutional Scaling Law'). No empirical sample size reported in the excerpt.
AI usage has dual effects on employees: it can both enhance innovative behavior and predict disengagement, as revealed by a dual-path (SOR-based) model.
Interpretation/synthesis from the four-stage longitudinal study of 285 finance professionals using a dual-path model based on SOR theory (combining the mediation and moderation results).
We evaluate 14 LLMs under zero-shot prompting and retrieval-augmented settings and witness a clear performance gap.
Experimental evaluation reported in the paper: authors state they ran experiments on 14 different large language models, under zero-shot and retrieval-augmented configurations, and observed differing performance across models.
Artificial intelligence embedded in human decision-making can either enhance human reasoning or induce excessive cognitive dependence.
Stated as a conceptual claim in the paper's introduction/abstract; supported by the paper's conceptual framing (theoretical argument), no empirical sample or experimental data reported here.
Policy implication: smarter, better-coordinated green governance is needed to address the negative local impacts and the crowding-out interaction between AI and environmental regulation.
Policy recommendation drawn in the abstract based on the empirical spatial findings (negative local effects and negative interaction).
Substantial regional gaps persist: leading eastern provinces approach a UCEE value of 1.0 while some northeastern provinces remain below 0.1.
Regional UCEE index estimates from the Super-SBM model across the 30 provinces reported in the abstract.
The systemic implications of AI in finance depend less on model intelligence alone than on how agent architectures are distributed, coupled, and governed across institutions.
Central argumentative claim supported by the AFMM conceptual model and an illustrative empirical application described in the paper (modeling + event-study approach); no full-sample details provided in the excerpt.
The Agentic Financial Market Model (AFMM), a stylised agent-based representation, links agent design parameters (autonomy depth, heterogeneity, execution coupling, infrastructure concentration, supervisory observability) to market-level outcomes including efficiency, liquidity resilience, volatility, and systemic risk.
Presentation of a stylised agent-based model (AFMM) in the paper; conceptual modelling linking specified agent parameters to macro/market outcomes. No empirical sample size reported in the excerpt.
Financial AI agents can be described by a four-layer architecture covering data perception, reasoning engines, strategy generation, and execution with control.
Conceptual framework proposed by the authors (theoretical/architectural proposal); no empirical testing or sample size provided.
These productivity gains are most pronounced for lower-skilled workers, producing a pattern the authors call “skill compression.”
Cross-study pattern reported in the literature review: comparative evidence across worker-skill strata in multiple empirical papers showing larger relative gains for lower-skilled/junior workers; specific underlying studies and sample sizes are not enumerated in the brief.
Financial well-being is not an automatic byproduct of automated credit efficiency but an emergent outcome of architectural alignment among technology, borrower capability, and governance structures.
Theoretical conclusion drawn from empirical results showing mixed effects (positive on repayment and resilience, negative on stress) and significant moderation by human capability and institutional design.
Study 1 quantifies confirmation bias through controlled experiments on 250 CVE vulnerability/patch pairs evaluated across four state-of-the-art models under five framing conditions for the review prompt.
Controlled experiment described in the paper: 250 CVE vulnerability/patch pairs evaluated across four state-of-the-art LLMs under five prompt framing conditions.
Lightweight safeguards can reduce risk in some settings but do not consistently prevent severe failures.
Analysis of simulated interventions/safeguards within governance simulations showing reductions in certain risk metrics in some scenarios, but persistence of severe failures in others; assessment based on rubric-judged transcript segments.
There are large differences in corruption-related outcomes across governance regimes and specific model–governance pairings.
Observed heterogeneity in outcomes across different authority structures and model–governance pairings within the multi-agent simulations, evaluated via rubric-based scoring over 28,112 transcript segments.
These findings challenge the narrative of complete automation by AI and underscore the enduring importance of human expertise in data science.
Interpretation based on competition results where AI-only baselines underperformed relative to many participant teams and top solutions used human-AI collaboration.
These findings indicate a misalignment between the perceived benefit of AI writing and an implicit, consistent effect on the semantics of human writing, with potential implications for cultural and scientific institutions.
Synthesis and interpretation of the paper's empirical results (user study, essay revision experiments, and peer-review analysis); presented as the paper's broader conclusion.
Socioeconomic regression analysis confirms strong correlations between neighborhood racial composition and detection likelihood: Pearson r = 0.83 for percent White and r = -0.81 for percent Black.
Reported Pearson correlation coefficients from regression analysis between neighborhood racial composition variables and detection likelihood in the simulations.
A Conditional Tabular GAN (CTGAN) debiasing approach partially redistributes detection rates but cannot eliminate structural disparity without accompanying policy intervention.
Experimental comparison between baseline simulations and CTGAN-debiased synthetic data showing partial redistribution of detection rates; paper asserts remaining structural disparities.
Regional analysis shows inland regions remain capital-dependent, with an estimated (capital) elasticity of approximately 0.43.
Regional decomposition/estimation reported in the study comparing inland regions to coastal ones using the extended production function.
The paper formalizes the distinction using a signal-aggregation model in which an organization maintains an anchor belief and achieves agreement through two exclusion channels: (1) report shrinkage toward the anchor and (2) a tolerance rule that discards reports deviating beyond a threshold.
Analytical formal model presented in the paper specifying an anchor belief and two exclusion mechanisms; model assumptions and mechanisms are explicit in the theoretical development. No empirical sample.
Organizational cohesion is observationally ambiguous: it can arise either from genuine information integration (debate and synthesis of heterogeneous inputs) or from exclusionary processes (conformity pressure, gatekeeping, intolerance of dissent).
Conceptual argument and formal definition in the paper framing; supported by the analytic distinction introduced in the paper between integration and exclusion as alternative generative mechanisms for observed agreement. No empirical sample—argument is theoretical and illustrated by model construction.
The authors identify ten evaluation practices that teams use, ranging from lightweight interpretive checks to formal organizational processes (examples: qualitative user reviews, red-team testing, A/B experiments, telemetry/log analysis, structured annotation, governance/meta-evaluation).
Thematic coding of 19 interview transcripts produced a taxonomy enumerating ten practices (paper reports the taxonomy as an outcome).
Quantum-driven growth depends critically on adoption rates, infrastructure readiness, complementary investments (digital infrastructure, human capital), and enabling policy/regulatory environments.
Scenario framework that varies (a) technical timelines, (b) sectoral adoption rates (diffusion models), (c) infrastructure readiness, and (d) policy environments; policy counterfactual modeling shows sensitivity of adoption and macro outcomes to these parameters.
The magnitude and timing of macroeconomic impact from quantum computing are highly uncertain.
Monte Carlo / scenario ensemble results showing wide (fat-tailed) outcome distributions driven by uncertainty in technical milestones, adoption rates, and complementarity strengths; use of expert elicitation to parameterize tail risks.
Policymakers face trade-offs between promoting innovation and market efficiency on one hand and protecting privacy, fairness, and national security on the other; economic analysis can inform calibration.
Normative policy analysis and synthesis of literature on digital regulation and trade-offs; supported by comparative observations of regulatory priorities across jurisdictions.