The Commonplace
Home Dashboard Papers Evidence Digests 🎲

Evidence (2215 claims)

Adoption
5126 claims
Productivity
4409 claims
Governance
4049 claims
Human-AI Collaboration
2954 claims
Labor Markets
2432 claims
Org Design
2273 claims
Innovation
2215 claims
Skills & Training
1902 claims
Inequality
1286 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 369 105 58 432 972
Governance & Regulation 365 171 113 54 713
Research Productivity 229 95 33 294 655
Organizational Efficiency 354 82 58 34 531
Technology Adoption Rate 277 115 63 27 486
Firm Productivity 273 33 68 10 389
AI Safety & Ethics 112 177 43 24 358
Output Quality 228 61 23 25 337
Market Structure 105 118 81 14 323
Decision Quality 154 68 33 17 275
Employment Level 68 32 74 8 184
Fiscal & Macroeconomic 74 52 32 21 183
Skill Acquisition 85 31 38 9 163
Firm Revenue 96 30 22 148
Innovation Output 100 11 20 11 143
Consumer Welfare 66 29 35 7 137
Regulatory Compliance 51 61 13 3 128
Inequality Measures 24 66 31 4 125
Task Allocation 64 6 28 6 104
Error Rate 42 47 6 95
Training Effectiveness 55 12 10 16 93
Worker Satisfaction 42 32 11 6 91
Task Completion Time 71 5 3 1 80
Wages & Compensation 38 13 19 4 74
Team Performance 41 8 15 7 72
Hiring & Recruitment 39 4 6 3 52
Automation Exposure 17 15 9 5 46
Job Displacement 5 28 12 45
Social Protection 18 8 6 1 33
Developer Productivity 25 1 2 1 29
Worker Turnover 10 12 3 25
Creative Output 15 5 3 1 24
Skill Obsolescence 3 18 2 23
Labor Share of Income 7 4 9 20
Clear
Innovation Remove filter
Expert (per-expert) sizes and overall design are positioned between the GPT-OSS and Qwen3 MoE designs.
Architectural comparison asserted in the paper; claim is based on relative model-design choices (expert count/size) compared to public descriptions of GPT-OSS and Qwen3. The summary provides the positioning but not detailed layer-by-layer comparisons.
medium null result EngGPT2: Sovereign, Efficient and Open Intelligence relative expert size / MoE configuration compared to named architectures
An orchestrator coordinates components with intent-aware routing and layered safety checks, enabling multi-step workflows and productized services.
Paper describes an agentic tool-calling framework and multi-layer orchestrator used for intent-aware routing, defense-in-depth safety validation, and multi-step workflows.
medium null result Fanar 2.0: Arabic Generative AI Stack system orchestration capability (intent-aware routing, layered safety)
Aura is a long-form ASR system capable of handling hours-long audio.
Paper lists Aura in the product stack as 'long-form ASR handling hours-long audio.' Specific evaluation metrics or training data for ASR are not provided in the summary.
medium null result Fanar 2.0: Arabic Generative AI Stack ASR capability (long-form/hours-long audio handling)
Arabic content comprises only about 0.5% of web data despite roughly 400 million native speakers.
Paper cites this data-point to motivate intentional data strategies for Arabic underrepresentation on the web; exact source of the web-proportion not specified in the summary.
medium null result Fanar 2.0: Arabic Generative AI Stack proportion of web data in Arabic (~0.5%)
Methods among the surveyed systems span token-level code generation to circuit-structure generation, and evaluation metrics are often task- and artifact-specific.
Surveyed system descriptions show diversity in generative approaches (token-level language models, graph/diffusion-based circuit generators, agentic optimizers) and corresponding tailored metrics noted in the review.
medium null result Generative AI for Quantum Circuits and Quantum Code: A Techn... range of generative methods and specificity of evaluation metrics
Many early-stage AI advances have not translated into higher Phase II/III success rates.
Synthesis of reported outcomes and failures from industry experience; no new systematic statistical analysis provided.
medium null result Learning from the successes and failures of early artificial... Phase II/III clinical success rates
After roughly a decade of adoption in large biopharma, AI has not yet changed late-stage (Phase II/III) clinical success rates.
Qualitative assessment of industrywide experience and reported outcomes; statement based on narrative review rather than systematic, long-run quantitative analysis or causal estimates.
medium null result Learning from the successes and failures of early artificial... Phase II/III clinical success rates (late-stage trial success probability)
Three primary adoption archetypes in large pharma are (1) partnership-driven acceleration, (2) culture-centric transformation, and (3) production-first democratization.
Conceptual classification in the editorial derived from trends and illustrative examples rather than empirical survey or sampling; no quantitative validation provided.
medium null result AI as the Catalyst for a New Paradigm in Biomedical Research types of organizational approaches to AI adoption
This paper systematically studies the Impact Mechanism of artificial intelligence on the Globalized Division of Labor and reveals the Structural Transformation under Technology Substitution and Data Elements Dual-wheel Drive through Literature Review and Theoretical Analysis.
Methodological claim: supported by the paper's literature review and theoretical analysis; no quantitative sample or empirical design indicated for this specific conclusion in the excerpt.
medium null result Artificial Intelligence and Globalized Division of Labor: Re... identification of mechanisms (technology substitution; data elements dual-wheel ...
AI adoption is not associated with significant changes in operating costs.
Analysis of operating costs in firm financials showing no significant post-adoption change for adopters relative to nonadopters.
medium null result AI and Productivity: The Role of Innovation operating costs
The innovation effects of AI adoption are not concentrated among larger firms, financially unconstrained firms, or high-tech firms.
Heterogeneity tests across firm size, financial constraint status, and industry technology intensity showing no concentration of effects in these groups (as reported in the paper).
medium null result AI and Productivity: The Role of Innovation distribution of treatment effects across firm-size, financial-constraint, and in...
There is a gap in the existing literature regarding empirical evidence about the relationship between AI/Big Data use and market uncertainty during economic downturns.
Paper motivates the study by citing this gap based on its literature review (the summary does not list the reviewed works or systematic review method).
medium null result An Empirical Study on the Impact of the Integration of AI an... Existence of an empirical evidence gap in the literature
Unemployment does not exert a statistically significant impact on GDP growth in the employed model.
Unemployment included among the macroeconomic determinants in the panel regressions but reported as statistically insignificant (no effect) in the provided summary; methods cited include OLS, FE, Difference and System GMM (sample details not included).
medium null result The Role of Artificial Intelligence in Economic Growth: Syst... GDP growth (national GDP growth rate)
Robust methodology (panel VAR and DID) was used to assess the impact of technology and public policy interventions on emissions reductions.
Methods stated in the paper (panel VAR and difference-in-differences); robustness is claimed by the authors based on using these established econometric approaches, though formal robustness checks are not detailed in the summary.
medium null result Digital intelligence for reducing carbon emissions and impro... methodological robustness in estimating effects on emissions
The short‑term effect of AI on labor‑intensive industries is weak.
Short‑run/dynamic subgroup analysis in the China 2003–2017 panel indicating minimal or weak immediate growth effects for labor‑intensive sectors.
medium null result The Impact of Artificial Intelligence Development on Economi... short‑term industry growth rate in labor‑intensive industries
The article clarifies theoretical relationships and gaps between Material Passports, Digital Product Passports, and Digital Building Logbooks.
Theoretical analysis and synthesis section of the SLR where the authors compare concepts and identify overlaps and gaps among MPs, DPPs, and DBLs.
medium null result The Material Passport for a Circular Construction Industry: ... conceptual clarity (relationships/gaps) among MPs, DPPs, and DBLs
Top management support does not have a direct influence on AI Adoption in the sampled firms.
PLS-SEM results from the 207-firm survey showing a non-significant direct path from top management support to AI Adoption (as reported in the paper).
Effort expectancy does not have a direct influence on AI Adoption in the sampled firms.
PLS-SEM results from the 207-firm survey showing a non-significant direct path from effort expectancy to AI Adoption (as reported in the paper).
The paper introduces a novel taxonomy that separates patenting into three domains: core AI, traditional robotics, and AI-enhanced robotics.
Methodological contribution of the paper: construction and application of a classification scheme that assigns patent filings (1980–2019) into three domains (core AI, traditional robotics, AI-enhanced robotics). Data source: patent filings 1980–2019 (aggregate counts by domain and country). Exact number of patents not provided in the summary.
medium null result The "Gold Rush" in AI and Robotics Patenting Activity. Do in... categorization/classification of patent filings into three domains
When green-technology innovation is low (below the threshold), the main measurable effect of DE is on improving carbon emission efficiency (CEE), but DE does not yet reduce per capita emissions (PCE).
Results from the threshold-regression models on the 278-city panel (2011–2022) show that in the low-green-innovation regime DE coefficients are significant for CEE but not for PCE; mediating-effect models corroborate the efficiency channel in low-innovation contexts.
medium null result Digital Economy, Green Technology Innovation and Urban Carbo... Carbon emission efficiency (CEE) and Per capita carbon emissions (PCE)
Realising DT value requires upfront investment in sensors, integration, standards, and skills; economic viability depends on contract structures and how gains are allocated between investors, owners, contractors, and operators.
Synthesis of cost/benefit discussions and case descriptions in the reviewed literature; policy and procurement examples referenced.
medium null result Digital Twins Across the Asset Lifecycle: Technical, Organis... investment requirements and determinants of economic viability
HCI has explored usable consent, but there is no systematic framework for consent in the AI era.
Literature synthesis and gap identification from workshop participants and solicited position papers; no systematic review or meta-analysis with counted studies reported in the summary.
medium null result Moving Beyond Clicks: Rethinking Consent and User Control in... existence of a systematic AI-era consent framework
Research priorities include causal studies on productivity gains from AI, firm‑level adoption dynamics, sectoral labor reallocation, long‑run general equilibrium effects, and heterogeneous impacts across regions and demographic groups.
Set of empirical research recommendations drawn from gaps identified in the literature review and limitations section; not an empirical claim but a prioritized research agenda based on secondary evidence.
medium null result AI and Robotics Redefine Output and Growth: The New Producti... knowledge gaps to be addressed (research outcomes)
Growth‑accounting frameworks and measurement approaches must be updated to capture AI/robotics as intangible and embodied capital, including quality improvements and spillovers.
Methodological argument grounded in literature on measurement challenges and examples of intangible capital; no new measurement exercise or empirical re‑estimation is provided in the paper.
medium null result AI and Robotics Redefine Output and Growth: The New Producti... measurement accuracy of productivity accounts, capture of intangible capital and...
Recommendation for research and modeling: economic models of AI markets should incorporate institutional regime types (centralized vs decentralized), enforcement uncertainty, and legitimacy effects as parameters affecting data access costs, R&D productivity, and market concentration.
Normative recommendation based on the comparative typology and inferred mechanisms from the document analysis; not empirically validated within the study.
medium null result Balancing openness and security in scientific data governanc... modeling parameters (regime type, enforcement uncertainty, legitimacy effects) a...
Theoretical contribution: the paper extends modular coordination theory by treating openness–security trade‑offs as layered, adaptive institutional processes embedded in political regimes and 'legitimacy economies.'
Argumentative/theoretical development in the paper grounded in document analysis and literature on coordination and legitimacy.
medium null result Balancing openness and security in scientific data governanc... theoretical framing / extension of modular coordination theory
The framework offers a replicable model for governments and institutions seeking to proactively support high-potential innovations across sectors.
Paper asserts replicability and applicability to governments/institutions based on the described methods and outputs; no deployment case studies or empirical replication evidence reported in text provided.
medium positive Emerging Technologies Based on Large AI Models and the Desig... replicability and applicability of the framework for proactive policy support
A data-driven, foresight-based approach to policy design significantly enhances responsiveness, precision, and resource efficiency in science and technology governance.
Paper concludes this benefit based on its integrated framework, triangulation, Delphi/AHP validation and illustrative mapping; no quantified comparative metrics or experimental evaluation reported in text provided.
medium positive Emerging Technologies Based on Large AI Models and the Desig... effectiveness of data-driven, foresight-based policy design (responsiveness, pre...
Fostering digital transformation alongside workforce reskilling and innovation-ecosystem development is essential for sustainable industrial growth and strengthening Kazakhstan’s global economic position.
Policy and strategic recommendations based on the study's empirical results, case studies, and macro-level index comparisons.
medium positive Digitalization and labor costs: efficiency of industrial ent... sustainable industrial growth / global economic position
Digital transformation combined with workforce retraining optimizes labor costs and enhances productivity.
Synthesis of enterprise-level case examples and aggregated regression/correlation findings at industry and national levels that link digitalization and retraining programs to labor-cost and productivity indicators.
medium positive Digitalization and labor costs: efficiency of industrial ent... labor costs per unit of production
These findings provide an early empirical baseline and point toward competitive plurality rather than winner-take-all consolidation among engaged users.
Interpretation synthesized from survey results (multi-platform usage, indistinguishable satisfaction among top platforms, differing adoption reasons); overall sample N=388.
medium positive Beyond Benchmarks: How Users Evaluate AI Chat Assistants market structure (likelihood of plurality vs winner-take-all)
Switching costs between platforms are negligible (users treat these tools as interchangeable utilities rather than sticky ecosystems).
Survey responses indicating platform-switching behavior and perceived costs; inference based on reported multi-platform usage and responses about platform loyalty/switching (overall N=388).
medium positive Beyond Benchmarks: How Users Evaluate AI Chat Assistants perceived switching costs / platform stickiness
These results establish agent scaling as a practical and effective axis for HLS optimization.
Synthesis/interpretation of empirical results (including mean 8.27× speedup and per-benchmark gains) reported in the paper.
medium positive Agent Factories for High Level Synthesis: How Far Can Genera... practical effectiveness of scaling the number of agents for HLS optimization
Across benchmarks, agents consistently rediscover known hardware optimization patterns without domain-specific training.
Qualitative and empirical observations across the evaluated benchmarks (12) reporting that agents found recognized hardware optimization patterns despite no hardware-specific training.
medium positive Agent Factories for High Level Synthesis: How Far Can Genera... discovery of known hardware optimization patterns by agents
This work demonstrates the technical feasibility of scalable, AI-augmented quality assessment for early childhood education and lays a foundation for continuous, inclusive AI-assisted evaluation enabling systemic improvement and equitable growth.
Overall results of dataset release, Interaction2Eval performance (agreement), and deployment efficiency reported in the paper; used by the authors to argue broader feasibility and potential systemic impact.
medium positive When AI Meets Early Childhood Education: Large Language Mode... feasibility and systemic impact of AI-augmented assessment
AI-assisted monitoring could shift assessment practice from annual expert audits to monthly AI-assisted monitoring with targeted human oversight.
Authors' synthesis combining dataset-scale results, Interaction2Eval performance (agreement), and deployment efficiency gains to argue feasibility of more frequent monitoring.
medium positive When AI Meets Early Childhood Education: Large Language Mode... frequency of quality monitoring (audit cadence)
Digital transformation enhances the relational embeddedness among cities, and this enhanced relational embeddedness facilitates improved outcomes in collaborative innovation (mediating mechanism).
Mediation analysis / network metric analysis using city-level relational embeddedness measures computed from patent collaboration networks and digital transformation indicators from A-share listed companies (2011–2021).
medium positive How Does Digital Transformation Affect Cross-Regional Collab... relational embeddedness among cities and its mediating effect on collaborative i...
Robust arbitrage strategies remain profitable even when generalized across different domains (claim reiteration emphasizing cross-domain profitability and robustness).
Repeated/strengthened claim in the paper referencing multiple experiments and robustness checks across domains.
medium positive Computational Arbitrage in AI Model Markets cross-domain profitability of arbitrage strategies
An arbitrageur can efficiently allocate inference budget across providers to undercut the market, creating a competitive offering with no model-development risk.
Methodological description and empirical demonstration in the paper showing arbitrageur strategies that allocate inference budget across multiple providers to create a competitive service without incurring model-development risk.
medium positive Computational Arbitrage in AI Model Markets ability to undercut market prices and create competitive offering without model ...
Arbitrage reduces market segmentation and facilitates market entry for smaller model providers by enabling earlier revenue capture.
Reported analysis and/or experiments suggesting arbitrage homogenizes offerings (reduces segmentation) and allows smaller providers to capture revenue earlier through arbitrage-enabled routes.
medium positive Computational Arbitrage in AI Model Markets market segmentation and ease of market entry for smaller model providers
Robust arbitrage strategies that generalize across different domains remain profitable.
Reported experiments indicating that arbitrage strategies generalized beyond the primary SWE-bench domain and still yielded profit (authors state robust strategies remain profitable across domains).
medium positive Computational Arbitrage in AI Model Markets profitability of arbitrage strategies across multiple domains
Arbitrage is viable in AI model markets (we empirically demonstrate the viability of arbitrage and illustrate its economic consequences).
Empirical experiments and analyses presented in the paper (case study on SWE-bench and additional experiments on arbitrage strategies).
medium positive Computational Arbitrage in AI Model Markets viability/profitability and economic impact of arbitrage strategies
This systematic framework can help predict at a detailed level where today's AI systems can and cannot be used and how future AI capabilities may change this.
Interpretive/utility claim: authors argue that the ontology plus classification results serve as rough predictive tools for AI applicability across work activities.
medium positive Where can AI be used? Insights from a deep ontology of work ... predictive usefulness of the ontology for AI applicability across tasks
The results contribute to literature arguing that cloud-based GenAI is a source of enterprise value creation rather than merely an experimental technology.
Paper's stated addition to the existing literature based on the combined empirical and theoretical findings.
medium positive Measuring Business ROI of Generative AI Adoption on Azure Cl... enterprise value creation via GenAI
When compared to baseline approaches, the ARL-based model's accuracy in revenue and price optimization decreased by less than 20%, indicating that it can adapt and optimize pricing techniques in intricate, cutthroat markets.
Reported experimental comparison versus baselines (fixed/rule-based and cost-plus); specific metrics, dataset size, and whether 'decrease' refers to error or accuracy are not clarified in the excerpt.
medium positive The Application of Adaptive Reinforcement Learning in Dynami... accuracy in revenue and price optimization
Our results substantiate the potential of large language models as a foundational pillar for high-fidelity, scalable decision simulation and latter analysis in the real economy based on foundational database.
High-level conclusion drawn from the paper's experiments and methodological contributions; generalization claim asserting LLMs' potential as foundational tools for scalable, high-fidelity decision simulation.
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... potential of LLMs for high-fidelity, scalable decision simulation
Experiments demonstrate that our framework achieves improved simulation stability compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments vs. baselines showing improved simulation stability (paper statement that framework improved simulation stability, without quantitative details in the excerpt).
Experiments demonstrate that our framework achieves significant improvements in purchase quantity prediction compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments comparing MALLES against existing baselines; paper reports 'significant improvements' in purchase quantity prediction (no numerical values provided in the excerpt).
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... purchase quantity prediction accuracy
Experiments demonstrate that our framework achieves significant improvements in product selection accuracy compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments comparing MALLES against existing economic and financial LLM simulation baselines; paper reports 'significant improvements' in product selection accuracy (no numerical values provided in the excerpt).
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... product selection accuracy
This preference-learning approach enables the models to internalize and transfer latent consumer preference patterns, thereby mitigating the data sparsity issues prevalent in individual categories.
Claim based on the paper's reported approach: cross-category post-training and transfer of latent preferences; supported by experiments (paper states mitigation of data sparsity).
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... mitigation of data sparsity through cross-category preference transfer