Evidence (4114 claims)
Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 758 | 199 | 100 | 900 | 2007 |
| Governance & Regulation | 826 | 400 | 191 | 122 | 1563 |
| Organizational Efficiency | 777 | 193 | 124 | 84 | 1189 |
| Technology Adoption Rate | 635 | 233 | 124 | 97 | 1098 |
| Research Productivity | 422 | 128 | 57 | 336 | 954 |
| Output Quality | 476 | 179 | 59 | 47 | 761 |
| Decision Quality | 328 | 177 | 81 | 47 | 640 |
| Firm Productivity | 435 | 57 | 88 | 20 | 606 |
| AI Safety & Ethics | 218 | 277 | 65 | 33 | 599 |
| Market Structure | 180 | 170 | 123 | 24 | 502 |
| Task Allocation | 213 | 64 | 72 | 33 | 387 |
| Skill Acquisition | 170 | 61 | 61 | 17 | 309 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 54 | 107 | 13 | 281 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 153 | 48 | 26 | 3 | 230 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 56 | 56 | 26 | 13 | 154 |
| Training Effectiveness | 94 | 21 | 13 | 19 | 149 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 31 | 18 | 8 | 3 | 61 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Innovation
Remove filter
Site-wise right-sizing combined with spatial complementarity of wind energy keeps aggregate fleet utilization on par with traditional deployments.
Feasibility/analytical evaluation in the paper (presumably simulations/analysis of site sizing and spatial complementarity); specific methods/details not in abstract.
Our feasibility analysis shows that 890+ GW of wind capacity lies within 50 ms network round trip time of Azure data centers.
Feasibility analysis mapping wind capacity to Azure data center network latency; result reported as aggregate capacity (890+ GW).
AI Greenferencing brings modular AI compute to renewable energy sources (focusing on wind), allowing AI footprint expansion, generating local behind-the-meter demand for renewable sites, and helping ease the growing strain on power utilities.
Conceptual/proposed deployment model described in the paper; feasibility analysis described elsewhere in the paper supports feasibility but exact empirical backing for all claimed benefits not specified in abstract.
Fitted on 3.9B Pythia models with 30180B tokens, the Shannon Scaling Law predicts an unseen 12B model up to 307B tokens at pooled R^2=0.847, while monotonic baselines collapse.
Specific extrapolation experiment reported: model fit trained on models <=6.9B and <=180B tokens (Pythia), then used to predict behavior of an unseen 12B model up to 307B tokens; pooled R^2 reported as 0.847 and monotonic baselines reported to fail.
The Shannon Scaling Law consistently outperforms classical scaling laws and recent perturbation-aware laws, achieving strong R^2 scores and accurately capturing loss basins missed by prior approaches.
Empirical model comparison reported in the paper: goodness-of-fit comparisons (R^2) between the proposed Shannon Scaling Law and prior scaling laws / perturbation-aware variants, with qualitative claims about capturing loss basins.
We validate our theory through experiments on Pythia and OLMo2 under perturbations, including Gaussian noise, quantization and supervised fine-tuning on math, QA and code tasks.
Empirical experiments reported in the paper using Pythia and OLMo2 model families, testing various perturbations and tasks (math, QA, code).
Export controls often unintentionally boost China's self-reliance and R&D.
Argument in the paper that restrictions spur domestic substitution and investment in R&D in the targeted country (qualitative/historical reasoning; no quantified estimate provided).
Export controls are strategic tools in U.S.-China AI competition.
Analytical argument in the paper connecting export controls to broader strategic aims in great-power competition over AI; qualitative policy analysis rather than empirical measurement.
Since October 2022, the U.S. Bureau of Industry and Security (BIS) has progressively tightened restrictions on advanced computing components to China.
Factual timeline asserted in the paper referencing BIS policy actions beginning October 2022 (policy documents and announcements invoked).
Controls cover advanced chips, capital, personnel, and critical minerals for semiconductors.
Enumerative claim in the paper listing categories of items and flows targeted by export controls (policy documents and examples cited).
Export controls have become central to U.S.-China tech rivalry, especially in AI.
Policy analysis in the paper citing recent U.S. measures (e.g., BIS actions) and Chinese responses; contextual argumentation rather than a quantitative study.
Export control is a policy and legal tool to protect national interests by regulating exports of sensitive goods and technology to foreign nations.
Descriptive/legal characterization presented in the paper (normative definition and overview of export control regimes).
This paper provides new evidence on AI adoption from a non-US context by leveraging German firm-level data (ifo Business Survey).
Use of a large German business survey (ifo Business Survey) and analysis of AI adoption patterns among German firms.
AI is expected to have positive long-term productivity impacts for different sectors of the German economy.
Assessment of potential productivity impacts using firm-level survey responses about expected long-term benefits of AI (forward-looking/expectation-based analysis).
The increase in AI usage from 2023 to 2024 was particularly pronounced in manufacturing and services sectors.
Sectoral breakdown of ifo Business Survey firm-level data showing higher increases in reported AI usage for manufacturing and services compared with other sectors.
There was a significant increase in AI usage among German firms from 2023 to 2024.
Firm-level responses from the ifo Business Survey comparing reported AI usage in 2023 versus 2024 (cross-sectional/descriptive trend analysis).
Findings provide practical insights for AI implementation that prioritize management capability and adaptability to external environments.
Authors' interpretation and managerial implication drawn from empirical PLS-SEM (mediation/moderation) and fsQCA results on 251 firms.
Decision-making agility is a critical conduit linking AI capabilities to improving organizational outcomes.
Inference from PLS-SEM mediation results reported in paper indicating AI capability effects on performance operate via decision-making agility; analysis based on survey of 251 firms.
Two sub-dimensions of AI capability, technical infrastructure and management, affect performance outcomes through decision-making agility.
PLS-SEM results reported in paper showing relationships among measured constructs (AI capability sub-dimensions, decision-making agility, and performance outcomes); based on survey of 251 firms.
Enterprise capability adaptation serves as the key support for implementing intelligent international marketing models.
Conclusion from the paper's review and content analysis of literature (2010–2025); presented as a synthesized enabling factor rather than empirically quantified effect.
Mainstream innovation models include data-driven precision marketing, AI-powered cross-border CRM, intelligent omnichannel integration, and cross-cultural intelligent localization marketing.
Summary from the paper's systematic review and content analysis of core literature (2010–2025); descriptive synthesis, no primary experimental sample size reported.
New theoretical frameworks have emerged: data-driven precision marketing theory, nonlinear customer journey reconstruction theory, cross-border intelligent value co-creation theory, and global intelligent marketing ecosystem theory.
Identified via the paper's systematic review and content analysis of literature from 2010–2025; presented as conceptual/theoretical developments rather than quantified empirical effects.
Intelligent technologies have increased international marketing ROI by 12%–25%.
Mixed-method systematic review and content analysis of core literature sources from 2010 to 2025 (as reported in the paper). No primary dataset or sample size reported for this quantified range.
Findings extend digital transformation theory by showing that GenAI moves organizing from human-driven adaptation toward technology-embedded reconfiguration.
Authors' theoretical interpretation linking empirical findings from 17 interviews to broader digital transformation theory.
The paper conceptualizes 'AI-augmented orchestration', where human and algorithmic actors jointly configure work and value creation.
Theoretical contribution / conceptualization derived from analysis of interview data and authors' synthesis.
The study links GenAI-driven organizational changes to four value dimensions: operational, structural, innovation, and market value.
Authors' analytical framework developed from interview data (17 interviews) mapping changes to four value dimensions.
Startups integrate GenAI not as a peripheral tool but as a structural collaborator.
Interpretive finding from interviews and authors' theorization based on the dataset (17 interviews).
Generative AI (GenAI) is influencing how startups form, operate, and create value.
Statement in paper's introduction/abstract; supported by the study's framing and qualitative interview data (17 expert interviews).
The ML community should adopt PBOS as its default contract for such collaborations.
Normative recommendation by the authors; presented as a conclusion/proposal rather than empirically validated policy.
The boundary could not have been drawn correctly without scientists at the negotiating table.
Normative/analytic claim offered by the authors asserting the necessity of scientist involvement in contract design; no empirical evidence provided in the excerpt.
This boundary (pre-training open / post-training proprietary) is technically meaningful, legally clean, and auditable.
Claim about the properties of the PBOS boundary presented as an argument/claim in the paper; no empirical/legal audit data provided in the excerpt.
PBOS: pre-training artifacts (architectures, training code, benchmarks, untrained weights) are open science; post-training artifacts (weights trained on proprietary data) are business IP.
Proposed contract template/definition presented in the paper (prescriptive/design proposal); no empirical validation reported in the provided text.
Empirically stable pricing near the Nash Bargaining benchmark is observed in testing.
Reported empirical observation from experiments across varying population sizes and a 30-day horizon (abstract statement).
Testing across 6–100 agents over a 30-day horizon confirms scalability across population size.
Reported experimental sweep over agent population sizes from 6 to 100 across a 30-day horizon (as stated in abstract).
Nash-guided price proximity rewards align agent learning toward bargaining-optimal strategies.
Algorithmic design claim from the paper: inclusion of a Nash-guided price-proximity reward to shape agent learning (abstract statement).
The paper integrates Nash Bargaining Solution into Multi-Agent Deep Deterministic Policy Gradient, creating Nash-MADDPG, where Nash bargaining determines efficient bilateral pricing.
Methodological claim describing the proposed algorithm and role of Nash bargaining (as stated in abstract).
Nash-MADDPG achieves superior fairness, showing a 40.1% improvement in Jain's index.
Reported fairness metric (Jain's index) improvement in the paper's evaluation over a 30-day horizon (abstract statement).
Nash-MADDPG yields a 62.9% improvement in trading volume over Double Auction.
Reported comparison versus Double Auction in the paper's 30-day continuous-operation evaluation (abstract statement).
Nash-MADDPG improves social welfare by 61.6% over Double Auction in evaluation over 30-day continuous operation.
Simulation evaluation reported in the paper: 30-day continuous operation comparison against Double Auction baseline (as stated in abstract).
The paper proposes a multi-layered governance framework combining core regulatory requirements with supporting ecosystem measures to ensure accountability, security, and transparency in the age of autonomous financial agency.
Policy proposal presented in the paper (concluding recommendation summarized in the abstract).
ScienceClaw x Infinite provides the auditable artifact and provenance layer for this evaluation.
Paper statement that ScienceClaw x Infinite was used to supply auditable artifacts and provenance for the benchmark.
When one signal dominates, as in paradigm-shift detection, coordination mainly improves interpretation and traceability.
Reported result for the historical paradigm-shift detection task indicating limited predictive gains but improved interpretability and provenance when using coordinated agents.
Cross-channel composites improve over single-channel baselines: exoplanet vetting reaches AUROC 0.955.
Reported performance metric (AUROC=0.955) for the exoplanet vetting task comparing cross-channel composite to single-channel baselines.
When different disciplines each capture only part of the phenomenon, cross-channel composites improve over single-channel baselines: climate-vector emergence reaches AUROC 0.944.
Reported performance metric (AUROC=0.944) for the climate-vector emergence task comparing cross-channel composite to single-channel baselines.
Each case uses a frozen evaluation panel, predefined scoring protocols, explicit baselines, ablations or null controls, and stated limitations.
Methods claim describing evaluation protocol components reported in the paper.
We evaluate this question with a cross-domain benchmark spanning four scientific tasks: mapping molecular structure into musical representations, detecting historical paradigm shifts in science, identifying vector-borne disease emergence, and vetting transiting-exoplanet candidates.
Stated design of the study: description of benchmark tasks in the paper's methods/abstract.
Policy implication: governments in emerging economies should support AI-based learning ecosystems, strengthen university-industry collaboration and expand digital literacy programs to accelerate digital competitiveness.
Authors' policy recommendations based on study findings and contextual discussion about Pakistan's IT sector and emerging economies.
Organisational intelligence (OI) is a major driver of sustained innovation and helps firms translate learning into commercial outcomes.
Survey measures for OI and IP (N=348) and results from mediation/association analyses indicating OI positively relates to innovation performance and mediates effects of AIDLC/KO.
Knowledge orchestration functions as a critical bridge between AI-driven learning culture and innovation; success depends less on what information is stored and more on how quickly and intelligently it can be used.
Mediation analysis from the cross-sectional survey (N=348) showing KO mediates the relationship between AIDLC and innovation performance; conceptual interpretation in discussion contrasting KO with traditional knowledge management.
AI-supported learning environments were linked to greater creativity, experimentation and technological improvement.
Survey responses (N=348) using established measurement scales; authors report associations between AIDLC measures and subcomponents of innovation (creativity, experimentation, technological improvement).