The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲

Evidence (4114 claims)

Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 758 199 100 900 2007
Governance & Regulation 826 400 191 122 1563
Organizational Efficiency 777 193 124 84 1189
Technology Adoption Rate 635 233 124 97 1098
Research Productivity 422 128 57 336 954
Output Quality 476 179 59 47 761
Decision Quality 328 177 81 47 640
Firm Productivity 435 57 88 20 606
AI Safety & Ethics 218 277 65 33 599
Market Structure 180 170 123 24 502
Task Allocation 213 64 72 33 387
Skill Acquisition 170 61 61 17 309
Innovation Output 203 27 43 18 292
Employment Level 105 54 107 13 281
Fiscal & Macroeconomic 131 69 43 26 276
Consumer Welfare 117 63 42 11 233
Firm Revenue 153 48 26 3 230
Task Completion Time 173 31 8 12 225
Inequality Measures 44 122 49 6 221
Worker Satisfaction 89 65 22 12 188
Error Rate 69 92 10 2 173
Regulatory Compliance 77 69 14 5 165
Automation Exposure 56 56 26 13 154
Training Effectiveness 94 21 13 19 149
Wages & Compensation 77 36 25 6 144
Team Performance 86 17 27 10 141
Developer Productivity 95 17 14 6 133
Job Displacement 12 80 20 1 113
Hiring & Recruitment 52 7 8 3 70
Creative Output 31 18 8 3 61
Skill Obsolescence 5 46 6 1 58
Social Protection 27 16 8 2 53
Labor Share of Income 17 19 17 53
Worker Turnover 11 12 3 26
Industry 1 1
Clear
Innovation Remove filter
Site-wise right-sizing combined with spatial complementarity of wind energy keeps aggregate fleet utilization on par with traditional deployments.
Feasibility/analytical evaluation in the paper (presumably simulations/analysis of site sizing and spatial complementarity); specific methods/details not in abstract.
high positive XWind: A Cross-site Router for Large Language Model Inferenc... aggregate fleet utilization
Our feasibility analysis shows that 890+ GW of wind capacity lies within 50 ms network round trip time of Azure data centers.
Feasibility analysis mapping wind capacity to Azure data center network latency; result reported as aggregate capacity (890+ GW).
high positive XWind: A Cross-site Router for Large Language Model Inferenc... wind capacity within 50 ms RTT of Azure data centers
AI Greenferencing brings modular AI compute to renewable energy sources (focusing on wind), allowing AI footprint expansion, generating local behind-the-meter demand for renewable sites, and helping ease the growing strain on power utilities.
Conceptual/proposed deployment model described in the paper; feasibility analysis described elsewhere in the paper supports feasibility but exact empirical backing for all claimed benefits not specified in abstract.
high positive XWind: A Cross-site Router for Large Language Model Inferenc... local demand generation at renewable sites and reduction in grid strain
Fitted on 3.9B Pythia models with 30180B tokens, the Shannon Scaling Law predicts an unseen 12B model up to 307B tokens at pooled R^2=0.847, while monotonic baselines collapse.
Specific extrapolation experiment reported: model fit trained on models <=6.9B and <=180B tokens (Pythia), then used to predict behavior of an unseen 12B model up to 307B tokens; pooled R^2 reported as 0.847 and monotonic baselines reported to fail.
high positive LLMs as Noisy Channels: A Shannon Perspective on Model Capac... extrapolative predictive performance measured by pooled R^2 when predicting loss...
The Shannon Scaling Law consistently outperforms classical scaling laws and recent perturbation-aware laws, achieving strong R^2 scores and accurately capturing loss basins missed by prior approaches.
Empirical model comparison reported in the paper: goodness-of-fit comparisons (R^2) between the proposed Shannon Scaling Law and prior scaling laws / perturbation-aware variants, with qualitative claims about capturing loss basins.
high positive LLMs as Noisy Channels: A Shannon Perspective on Model Capac... goodness-of-fit (R^2) to observed loss/ performance curves and ability to captur...
We validate our theory through experiments on Pythia and OLMo2 under perturbations, including Gaussian noise, quantization and supervised fine-tuning on math, QA and code tasks.
Empirical experiments reported in the paper using Pythia and OLMo2 model families, testing various perturbations and tasks (math, QA, code).
high positive LLMs as Noisy Channels: A Shannon Perspective on Model Capac... empirical behavior of models under perturbations (robustness and fit to the prop...
Export controls often unintentionally boost China's self-reliance and R&D.
Argument in the paper that restrictions spur domestic substitution and investment in R&D in the targeted country (qualitative/historical reasoning; no quantified estimate provided).
high positive Strategic Stalemates: The Paradox of Export Controls in the ... China's domestic R&D capacity and technological self-reliance
Export controls are strategic tools in U.S.-China AI competition.
Analytical argument in the paper connecting export controls to broader strategic aims in great-power competition over AI; qualitative policy analysis rather than empirical measurement.
high positive Strategic Stalemates: The Paradox of Export Controls in the ... use of export controls as strategic instruments
Since October 2022, the U.S. Bureau of Industry and Security (BIS) has progressively tightened restrictions on advanced computing components to China.
Factual timeline asserted in the paper referencing BIS policy actions beginning October 2022 (policy documents and announcements invoked).
high positive Strategic Stalemates: The Paradox of Export Controls in the ... degree of U.S. export restrictions on advanced computing components to China
Controls cover advanced chips, capital, personnel, and critical minerals for semiconductors.
Enumerative claim in the paper listing categories of items and flows targeted by export controls (policy documents and examples cited).
high positive Strategic Stalemates: The Paradox of Export Controls in the ... categories of goods/flows subject to export controls
Export controls have become central to U.S.-China tech rivalry, especially in AI.
Policy analysis in the paper citing recent U.S. measures (e.g., BIS actions) and Chinese responses; contextual argumentation rather than a quantitative study.
high positive Strategic Stalemates: The Paradox of Export Controls in the ... centrality of export controls in bilateral tech competition
Export control is a policy and legal tool to protect national interests by regulating exports of sensitive goods and technology to foreign nations.
Descriptive/legal characterization presented in the paper (normative definition and overview of export control regimes).
high positive Strategic Stalemates: The Paradox of Export Controls in the ... scope and use of export control as a policy instrument
This paper provides new evidence on AI adoption from a non-US context by leveraging German firm-level data (ifo Business Survey).
Use of a large German business survey (ifo Business Survey) and analysis of AI adoption patterns among German firms.
high positive AI adoption among German firms Empirical evidence on AI adoption in Germany (contribution to literature)
AI is expected to have positive long-term productivity impacts for different sectors of the German economy.
Assessment of potential productivity impacts using firm-level survey responses about expected long-term benefits of AI (forward-looking/expectation-based analysis).
high positive AI adoption among German firms Expected firm-level productivity / anticipated long-term productivity benefits
The increase in AI usage from 2023 to 2024 was particularly pronounced in manufacturing and services sectors.
Sectoral breakdown of ifo Business Survey firm-level data showing higher increases in reported AI usage for manufacturing and services compared with other sectors.
high positive AI adoption among German firms AI usage / AI adoption rate by sector
There was a significant increase in AI usage among German firms from 2023 to 2024.
Firm-level responses from the ifo Business Survey comparing reported AI usage in 2023 versus 2024 (cross-sectional/descriptive trend analysis).
high positive AI adoption among German firms AI usage / AI adoption rate (reported by firms)
Findings provide practical insights for AI implementation that prioritize management capability and adaptability to external environments.
Authors' interpretation and managerial implication drawn from empirical PLS-SEM (mediation/moderation) and fsQCA results on 251 firms.
high positive AI for decision-making: exploring the linkage from AI capabi... organizational effectiveness of AI implementation (management capability and ada...
Decision-making agility is a critical conduit linking AI capabilities to improving organizational outcomes.
Inference from PLS-SEM mediation results reported in paper indicating AI capability effects on performance operate via decision-making agility; analysis based on survey of 251 firms.
high positive AI for decision-making: exploring the linkage from AI capabi... organizational/firm performance mediated by decision-making agility
Two sub-dimensions of AI capability, technical infrastructure and management, affect performance outcomes through decision-making agility.
PLS-SEM results reported in paper showing relationships among measured constructs (AI capability sub-dimensions, decision-making agility, and performance outcomes); based on survey of 251 firms.
high positive AI for decision-making: exploring the linkage from AI capabi... firm performance (through decision-making agility)
Enterprise capability adaptation serves as the key support for implementing intelligent international marketing models.
Conclusion from the paper's review and content analysis of literature (2010–2025); presented as a synthesized enabling factor rather than empirically quantified effect.
high positive Research on International Marketing in the Context of Intell... role of enterprise capability adaptation in model implementation
Mainstream innovation models include data-driven precision marketing, AI-powered cross-border CRM, intelligent omnichannel integration, and cross-cultural intelligent localization marketing.
Summary from the paper's systematic review and content analysis of core literature (2010–2025); descriptive synthesis, no primary experimental sample size reported.
high positive Research on International Marketing in the Context of Intell... prevalent innovation models in international marketing
New theoretical frameworks have emerged: data-driven precision marketing theory, nonlinear customer journey reconstruction theory, cross-border intelligent value co-creation theory, and global intelligent marketing ecosystem theory.
Identified via the paper's systematic review and content analysis of literature from 2010–2025; presented as conceptual/theoretical developments rather than quantified empirical effects.
high positive Research on International Marketing in the Context of Intell... emergence of new international marketing theoretical frameworks
Intelligent technologies have increased international marketing ROI by 12%–25%.
Mixed-method systematic review and content analysis of core literature sources from 2010 to 2025 (as reported in the paper). No primary dataset or sample size reported for this quantified range.
high positive Research on International Marketing in the Context of Intell... international marketing ROI
Findings extend digital transformation theory by showing that GenAI moves organizing from human-driven adaptation toward technology-embedded reconfiguration.
Authors' theoretical interpretation linking empirical findings from 17 interviews to broader digital transformation theory.
high positive From Prompt To Process: Qualitative Insights On How Genai Us... nature of organizational change (human-driven adaptation vs technology-embedded ...
The paper conceptualizes 'AI-augmented orchestration', where human and algorithmic actors jointly configure work and value creation.
Theoretical contribution / conceptualization derived from analysis of interview data and authors' synthesis.
high positive From Prompt To Process: Qualitative Insights On How Genai Us... mode of organizing/work configuration (human-algorithm collaboration)
The study links GenAI-driven organizational changes to four value dimensions: operational, structural, innovation, and market value.
Authors' analytical framework developed from interview data (17 interviews) mapping changes to four value dimensions.
high positive From Prompt To Process: Qualitative Insights On How Genai Us... mapping of organizational changes to operational/structural/innovation/market va...
Startups integrate GenAI not as a peripheral tool but as a structural collaborator.
Interpretive finding from interviews and authors' theorization based on the dataset (17 interviews).
high positive From Prompt To Process: Qualitative Insights On How Genai Us... degree of integration of GenAI within organizational structure
Generative AI (GenAI) is influencing how startups form, operate, and create value.
Statement in paper's introduction/abstract; supported by the study's framing and qualitative interview data (17 expert interviews).
high positive From Prompt To Process: Qualitative Insights On How Genai Us... how startups form, operate, and create value (organizational formation and value...
The ML community should adopt PBOS as its default contract for such collaborations.
Normative recommendation by the authors; presented as a conclusion/proposal rather than empirically validated policy.
high positive Position: The Pre/Post-Training Boundary Should Govern IP in... community adoption of PBOS as default contracting practice
The boundary could not have been drawn correctly without scientists at the negotiating table.
Normative/analytic claim offered by the authors asserting the necessity of scientist involvement in contract design; no empirical evidence provided in the excerpt.
high positive Position: The Pre/Post-Training Boundary Should Govern IP in... appropriateness/correctness of contractual artifact boundaries
This boundary (pre-training open / post-training proprietary) is technically meaningful, legally clean, and auditable.
Claim about the properties of the PBOS boundary presented as an argument/claim in the paper; no empirical/legal audit data provided in the excerpt.
high positive Position: The Pre/Post-Training Boundary Should Govern IP in... technical and legal clarity/auditability of artifact boundary
PBOS: pre-training artifacts (architectures, training code, benchmarks, untrained weights) are open science; post-training artifacts (weights trained on proprietary data) are business IP.
Proposed contract template/definition presented in the paper (prescriptive/design proposal); no empirical validation reported in the provided text.
high positive Position: The Pre/Post-Training Boundary Should Govern IP in... classification of artifact ownership under collaborative contracts
Empirically stable pricing near the Nash Bargaining benchmark is observed in testing.
Reported empirical observation from experiments across varying population sizes and a 30-day horizon (abstract statement).
high positive Incentive-Aligned Vehicle-to-Vehicle Energy Trading via Nash... pricing proximity/stability relative to Nash Bargaining benchmark
Testing across 6–100 agents over a 30-day horizon confirms scalability across population size.
Reported experimental sweep over agent population sizes from 6 to 100 across a 30-day horizon (as stated in abstract).
high positive Incentive-Aligned Vehicle-to-Vehicle Energy Trading via Nash... scalability across population size (algorithm performance across agent counts)
Nash-guided price proximity rewards align agent learning toward bargaining-optimal strategies.
Algorithmic design claim from the paper: inclusion of a Nash-guided price-proximity reward to shape agent learning (abstract statement).
high positive Incentive-Aligned Vehicle-to-Vehicle Energy Trading via Nash... alignment of learned strategies to bargaining-optimal strategies
The paper integrates Nash Bargaining Solution into Multi-Agent Deep Deterministic Policy Gradient, creating Nash-MADDPG, where Nash bargaining determines efficient bilateral pricing.
Methodological claim describing the proposed algorithm and role of Nash bargaining (as stated in abstract).
high positive Incentive-Aligned Vehicle-to-Vehicle Energy Trading via Nash... bilateral pricing efficiency (algorithmic pricing)
Nash-MADDPG achieves superior fairness, showing a 40.1% improvement in Jain's index.
Reported fairness metric (Jain's index) improvement in the paper's evaluation over a 30-day horizon (abstract statement).
Nash-MADDPG yields a 62.9% improvement in trading volume over Double Auction.
Reported comparison versus Double Auction in the paper's 30-day continuous-operation evaluation (abstract statement).
Nash-MADDPG improves social welfare by 61.6% over Double Auction in evaluation over 30-day continuous operation.
Simulation evaluation reported in the paper: 30-day continuous operation comparison against Double Auction baseline (as stated in abstract).
The paper proposes a multi-layered governance framework combining core regulatory requirements with supporting ecosystem measures to ensure accountability, security, and transparency in the age of autonomous financial agency.
Policy proposal presented in the paper (concluding recommendation summarized in the abstract).
high positive AI Agents in Payments: Applications, Risks and Regulations proposed governance framework for accountability, security, transparency
ScienceClaw x Infinite provides the auditable artifact and provenance layer for this evaluation.
Paper statement that ScienceClaw x Infinite was used to supply auditable artifacts and provenance for the benchmark.
high positive Cross-domain benchmarks reveal when coordinated AI agents im... availability of auditable artifact and provenance layer
When one signal dominates, as in paradigm-shift detection, coordination mainly improves interpretation and traceability.
Reported result for the historical paradigm-shift detection task indicating limited predictive gains but improved interpretability and provenance when using coordinated agents.
high positive Cross-domain benchmarks reveal when coordinated AI agents im... interpretation and traceability of detection results for paradigm-shift detectio...
Cross-channel composites improve over single-channel baselines: exoplanet vetting reaches AUROC 0.955.
Reported performance metric (AUROC=0.955) for the exoplanet vetting task comparing cross-channel composite to single-channel baselines.
high positive Cross-domain benchmarks reveal when coordinated AI agents im... classifier performance (AUROC) for vetting transiting-exoplanet candidates
When different disciplines each capture only part of the phenomenon, cross-channel composites improve over single-channel baselines: climate-vector emergence reaches AUROC 0.944.
Reported performance metric (AUROC=0.944) for the climate-vector emergence task comparing cross-channel composite to single-channel baselines.
high positive Cross-domain benchmarks reveal when coordinated AI agents im... classifier performance (AUROC) for detecting vector-borne disease emergence
Each case uses a frozen evaluation panel, predefined scoring protocols, explicit baselines, ablations or null controls, and stated limitations.
Methods claim describing evaluation protocol components reported in the paper.
high positive Cross-domain benchmarks reveal when coordinated AI agents im... evaluation protocol completeness
We evaluate this question with a cross-domain benchmark spanning four scientific tasks: mapping molecular structure into musical representations, detecting historical paradigm shifts in science, identifying vector-borne disease emergence, and vetting transiting-exoplanet candidates.
Stated design of the study: description of benchmark tasks in the paper's methods/abstract.
high positive Cross-domain benchmarks reveal when coordinated AI agents im... benchmark scope (four tasks)
Policy implication: governments in emerging economies should support AI-based learning ecosystems, strengthen university-industry collaboration and expand digital literacy programs to accelerate digital competitiveness.
Authors' policy recommendations based on study findings and contextual discussion about Pakistan's IT sector and emerging economies.
high positive Enhancing innovation in Pakistan’s IT sector policy actions to improve digital competitiveness
Organisational intelligence (OI) is a major driver of sustained innovation and helps firms translate learning into commercial outcomes.
Survey measures for OI and IP (N=348) and results from mediation/association analyses indicating OI positively relates to innovation performance and mediates effects of AIDLC/KO.
high positive Enhancing innovation in Pakistan’s IT sector organisational intelligence impact on innovation performance
Knowledge orchestration functions as a critical bridge between AI-driven learning culture and innovation; success depends less on what information is stored and more on how quickly and intelligently it can be used.
Mediation analysis from the cross-sectional survey (N=348) showing KO mediates the relationship between AIDLC and innovation performance; conceptual interpretation in discussion contrasting KO with traditional knowledge management.
high positive Enhancing innovation in Pakistan’s IT sector mediating effect of knowledge orchestration on innovation performance
AI-supported learning environments were linked to greater creativity, experimentation and technological improvement.
Survey responses (N=348) using established measurement scales; authors report associations between AIDLC measures and subcomponents of innovation (creativity, experimentation, technological improvement).
high positive Enhancing innovation in Pakistan’s IT sector creativity / experimentation / technological improvement