Evidence (4114 claims)
Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 758 | 199 | 100 | 900 | 2007 |
| Governance & Regulation | 826 | 400 | 191 | 122 | 1563 |
| Organizational Efficiency | 777 | 193 | 124 | 84 | 1189 |
| Technology Adoption Rate | 635 | 233 | 124 | 97 | 1098 |
| Research Productivity | 422 | 128 | 57 | 336 | 954 |
| Output Quality | 476 | 179 | 59 | 47 | 761 |
| Decision Quality | 328 | 177 | 81 | 47 | 640 |
| Firm Productivity | 435 | 57 | 88 | 20 | 606 |
| AI Safety & Ethics | 218 | 277 | 65 | 33 | 599 |
| Market Structure | 180 | 170 | 123 | 24 | 502 |
| Task Allocation | 213 | 64 | 72 | 33 | 387 |
| Skill Acquisition | 170 | 61 | 61 | 17 | 309 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 54 | 107 | 13 | 281 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 153 | 48 | 26 | 3 | 230 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 56 | 56 | 26 | 13 | 154 |
| Training Effectiveness | 94 | 21 | 13 | 19 | 149 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 31 | 18 | 8 | 3 | 61 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Innovation
Remove filter
Renewable energy consumption positively influences AI investment in the United States.
Empirical analysis using Wavelet Quantile Regression (WQR) and Wavelet Quantile Correlation (WQC) on US quarterly data from 2013Q1 to 2024Q4 (48 quarters).
AlphaFold represents an 'oracle' breakthrough in AI for scientific discovery.
Cited as an example of an algorithmic breakthrough that changed a specific scientific subtask (protein structure prediction). The paper frames AlphaFold as a milestone in the history reviewed; no new experimental data presented.
The resulting policy matrix includes R&D funding, regulatory sandboxes, public procurement incentives, and tax relief, tailored to each stage of technological evolution.
Paper presents a policy matrix produced by the study listing these instruments mapped to maturity stages; no quantitative evaluation of impact reported in text provided.
To validate and prioritise policy instruments, Delphi rounds with domain experts and Analytic Hierarchy Process (AHP) weighting are employed.
Paper reports use of Delphi method and AHP for validation and prioritization; methodological description without reported number of experts or rounds.
A technology maturity classification categorises innovations into emerging, developing, and mature stages, forming the basis for strategic policy matching.
Paper defines a maturity classification (emerging/developing/mature) and indicates it is used to match policy instruments; categorical description provided, no quantitative validation details in text provided.
Temporal mapping and citation networks reveal distinct technology maturity patterns, which are visualised using S-curve and hype cycle models.
Paper describes use of temporal mapping and citation network analysis and visualization via S-curve and hype cycle models; methodological description without quantitative sample-size details.
Technologies such as AI-driven healthcare, quantum communication, hydrogen energy, and smart educational AI are identified as key domains of convergence.
Paper reports these domains were identified via the applied analytic framework and multi-source data triangulation; no numeric counts/sample sizes provided.
The study applies advanced techniques such as LDA topic modelling, BERT-based clustering, and co-citation analysis to detect innovation trajectories.
Paper states these specific analytic techniques were applied (method description).
The research leverages large AI models and multi-source data—including global patent databases (WIPO, USPTO, Lens.org), scientific literature corpora, and industry intelligence platforms (CB Insights, Qichacha).
Paper statement of data sources and use of large AI models; methodological description (no sample sizes reported).
Empirical findings demonstrate that digitalization significantly boosts efficiency and competitiveness of industrial production.
Correlation and regression analyses reported in the study linking digitalization measures to indicators of efficiency and competitiveness across levels of analysis.
Digital technologies (automation, IIoT, ERP systems, AI applications) reduce nonproductive costs, increase per-worker output, and improve the cost-efficiency of production in Kazakhstani enterprises.
Case studies and real examples from named enterprises (Asia Auto, Karaganda Foundry and Engineering Plant, Eurasian Resources Group) presented in the article.
The number of employees and working time have a positive but limited effect on labor productivity.
Results from the study's correlation and regression analysis comparing labor input measures (employee count and working time) with productivity outcomes.
Digitalization is the key driver of labor productivity growth in Kazakhstan.
Empirical correlation and regression analysis reported in the study across enterprise, industry, and national economy levels.
A stylized-facts analysis using OECD and World Bank indicators shows that economies with higher digital capacity, greater R&D intensity, and stronger institutions exhibit superior productivity and growth performance.
Stylized-facts (cross-country) analysis based on OECD and World Bank indicators; descriptive correlations reported in the paper (sample of countries not enumerated in the provided summary).
AI adoption stimulates institutional innovation, which in turn increases total factor productivity (TFP) and supports sustainable economic growth.
Theoretical mediation claim developed in the paper (integration of Schumpeterian growth theory with institutional economics); supported conceptually and argued with stylized-facts analysis but not presented as causally identified empirical estimates.
AI improves governance quality.
Argument within the conceptual framework linking AI capabilities (information processing, monitoring) to improved governance; stated qualitatively in the paper rather than supported by causal empirical tests.
AI lowers transaction costs.
Paper's conceptual/theoretical framework that characterizes AI as lowering transaction costs through improved information and coordination; no quantitative causal estimate reported.
AI reduces information asymmetries.
Theoretical/conceptual argument in the paper framing AI as a general-purpose technology that improves information flows; supported by the paper's conceptual framework (no experimental or causal identification reported).
AI-enabled competitive advantages are more likely to be achieved by innovation platforms than by transaction platforms.
Comparative finding reported from the fsQCA analysis on Chinese listed platform enterprises; the paper explicitly states innovation platforms are more likely to attain AI-enabled competitive advantages than transaction platforms. No sample breakdown by platform type provided in the abstract.
The AI-enabled combinations produce competitive advantages through three paths: AI internalization, AI leverage, and AI collaboration.
Causal/pathway interpretation from fsQCA solutions on the panel of Chinese listed platform enterprises as described in the paper (abstract reports three named paths). No quantitative effect sizes provided in the excerpt.
AI-enabled competitive advantages emerge from three types of configurations: the situated AI dominance type, the situated AI subsidiary type, and the collaborative drive type.
Configurations identified by fsQCA on the panel data; the paper reports three distinct solution/configuration types leading to competitive advantage. Details on case membership and calibration thresholds are not provided in the abstract.
AI technology innovation and recasting AI are necessary conditions for platform enterprises to establish competitive advantages.
Result from necessity analysis within the fsQCA applied to the panel of Chinese listed platform enterprises (paper reports these two conditions as necessary). Specific sample size and statistical measures not provided in the abstract.
This study draws on panel data from Chinese listed platform enterprises and employs fuzzy-set Qualitative Comparative Analysis (fsQCA).
The paper states it uses panel data from Chinese listed platform enterprises and applies fsQCA as its analytic method (methodological statement in abstract). Sample size not reported in the provided text.
The contribution is a falsifiable architectural thesis, a clear threat model, and a set of experimentally testable hypotheses for future work on distillation resistance, alignment, and model governance.
Theoretical contribution claim: the paper proposes hypotheses and a threat model intended to be testable in future empirical work; no experiments in the paper itself are reported.
Embedded shopping AI functions less as a substitute for conventional search than as a complementary interface for exploratory product discovery in e-commerce.
Synthesis of empirical regularities (demographic adoption patterns, timing in journey, interleaving behavior, high share of exploratory/attraction queries) from the descriptive analysis of Ctrip/Wendao usage data.
Consumers disproportionately use the assistant for exploratory, hard-to-keyword tasks: attraction queries account for 42% of observed chat requests.
Intent classification of chat requests in the dataset; reported share of chat requests labeled as 'attraction' (42%).
Among journeys containing both chat and search, the most common pattern is interleaving, with users moving back and forth between the two modalities.
Pattern/sequence analysis of journeys that include both chat and search events, counting and comparing patterns (e.g., interleaving versus strict ordering).
AI chat appears in the same broad phase of the purchase journey as traditional search and well before order placement.
Sequence/timestamp analysis of user journeys in platform logs showing the relative timing of chat, search, and order placement within journeys.
Adoption of the embedded shopping AI is highest among older consumers, female users, and highly engaged existing users, reversing the younger, male-dominated profile commonly documented for general-purpose AI tools.
Descriptive demographic analysis of adoption rates across users in the Ctrip dataset (user-level adoption comparisons by age, gender, and prior engagement). Sample drawn from the 31 million users in the platform logs.
Grok attracts users primarily for its content policy.
Survey items asking users for reasons they use each platform; reported attribution of content policy as primary reason for Grok (overall N=388).
DeepSeek attracts users primarily through word-of-mouth.
Survey items asking users for reasons they use each platform; reported attribution of word-of-mouth as primary reason for DeepSeek (overall N=388).
Claude attracts users primarily for answer quality.
Survey items asking users for reasons they use each platform; reported attribution of answer quality as primary reason for Claude (overall N=388).
ChatGPT attracts users primarily for its interface.
Survey items asking users for reasons they use each platform; reported attribution of interface as primary reason for ChatGPT (overall N=388).
Over 80% of users use two or more platforms (i.e., multi-platform usage is common).
Survey self-reports aggregated across respondents (paper reports 'over 80%'); overall sample N=388.
We conducted a cross-platform survey of 388 active AI chat users comparing satisfaction, adoption drivers, use case performance, and qualitative frustrations across seven major platforms: ChatGPT, Claude, Gemini, DeepSeek, Grok, Mistral, and Llama.
Cross-sectional online survey described in the paper; sample size reported as 388 users; seven named platforms explicitly listed.
Robustness tests confirm that the core conclusions about IRs improving urban energy resilience and the identified mechanisms/moderators are highly reliable.
Multiple robustness checks reported by the authors (unspecified in the abstract) applied to the DML estimates on the 280-city panel (2009–2023).
Science expenditure (SE) positively moderates the promoting effect of IRs on urban energy resilience; the interaction term coefficient is significantly positive.
Moderation analysis reported in the paper using interaction terms between IRs and science expenditure in the DML framework on the 280-city panel (2009–2023); reported statistically significant positive interaction coefficient.
Environmental regulation (ER) positively moderates the promoting effect of IRs on urban energy resilience; the interaction term coefficient is significantly positive.
Moderation analysis reported in the paper using interaction terms between IRs and environmental regulation in the DML framework on the 280-city panel (2009–2023); reported statistically significant positive interaction coefficient.
Green technology innovation is a main mediating path through which IRs improve urban energy resilience.
Mediation/transmission mechanism analysis reported in the paper based on the DML approach applied to the 280-city panel (2009–2023).
Industrial structure upgrading is a main mediating path through which IRs improve urban energy resilience.
Mediation/transmission mechanism analysis reported in the paper based on the same DML framework and the 280-city panel (2009–2023).
Industrial robots (IRs) significantly promote the improvement of urban energy resilience (UER).
Empirical analysis using Double Machine Learning (DML) on a panel of 280 prefecture-level and above Chinese cities from 2009 to 2023; various robustness tests reported.
The best designs often do not originate from top-ranked ILP candidates, indicating that global optimization exposes improvements missed by sub-kernel search.
Analysis comparing origins of the best final designs vs. their ILP ranking, reported across the benchmark set (12).
Larger gains on harder benchmarks: streamcluster exceeds 20× and kmeans reaches approximately 10×.
Per-benchmark empirical results reported for streamcluster and kmeans in the evaluation.
Scaling from 1 to 10 agents yields a mean 8.27× speedup over baseline.
Empirical evaluation across the reported benchmark set comparing performance with 1 agent versus 10 agents; mean speedup stated in the results.
We evaluate the approach on 12 kernels from HLS-Eval and Rodinia-HLS using Claude Code (Opus 4.5/4.6) with AMD Vitis HLS.
Experimental setup described in the paper reporting evaluation on 12 kernels drawn from HLS-Eval and Rodinia-HLS, using Claude Code (Opus 4.5/4.6) and AMD Vitis HLS.
In Stage 2, the pipeline launches N expert agents over the top ILP solutions, each exploring cross-function optimizations such as pragma recombination, loop fusion, and memory restructuring that are not captured by sub-kernel decomposition.
Method section describing Stage 2 which runs multiple expert agents exploring cross-function optimizations on top ILP solutions.
In Stage 1, the pipeline decomposes a design into sub-kernels, independently optimizes each using pragma and code-level transformations, and formulates an Integer Linear Program (ILP) to assemble globally promising configurations under an area constraint.
Method section describing Stage 1 decomposition, per-sub-kernel optimization and ILP assembly under an area constraint.
We introduce an agent factory, a two-stage pipeline that constructs and coordinates multiple autonomous optimization agents.
Method description in the paper describing the design and implementation of the two-stage 'agent factory' pipeline.
Deployment validation across 43 classrooms demonstrated an 18x efficiency gain in the assessment workflow.
Field deployment described in the paper: system was validated across 43 classrooms and an efficiency gain of 18x in the assessment workflow is reported.
Interaction2Eval achieves up to 88% agreement with human expert judgments.
Reported evaluation results comparing Interaction2Eval outputs to human expert annotations (rubric-based judgments) on the dataset.