The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲

Evidence (3492 claims)

Adoption
7395 claims
Productivity
6507 claims
Governance
5877 claims
Human-AI Collaboration
5157 claims
Innovation
3492 claims
Org Design
3470 claims
Labor Markets
3224 claims
Skills & Training
2608 claims
Inequality
1835 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 609 159 77 736 1615
Governance & Regulation 664 329 160 99 1273
Organizational Efficiency 624 143 105 70 949
Technology Adoption Rate 502 176 98 78 861
Research Productivity 348 109 48 322 836
Output Quality 391 120 44 40 595
Firm Productivity 385 46 85 17 539
Decision Quality 275 143 62 34 521
AI Safety & Ethics 183 241 59 30 517
Market Structure 152 154 109 20 440
Task Allocation 158 50 56 26 295
Innovation Output 178 23 38 17 257
Skill Acquisition 137 52 50 13 252
Fiscal & Macroeconomic 120 64 38 23 252
Employment Level 93 46 96 12 249
Firm Revenue 130 43 26 3 202
Consumer Welfare 99 51 40 11 201
Inequality Measures 36 105 40 6 187
Task Completion Time 134 18 6 5 163
Worker Satisfaction 79 54 16 11 160
Error Rate 64 78 8 1 151
Regulatory Compliance 69 64 14 3 150
Training Effectiveness 81 15 13 18 129
Wages & Compensation 70 25 22 6 123
Team Performance 74 16 21 9 121
Automation Exposure 41 48 19 9 120
Job Displacement 11 71 16 1 99
Developer Productivity 71 14 9 3 98
Hiring & Recruitment 49 7 8 3 67
Social Protection 26 14 8 2 50
Creative Output 26 14 6 2 49
Skill Obsolescence 5 37 5 1 48
Labor Share of Income 12 13 12 37
Worker Turnover 11 12 3 26
Industry 1 1
Clear
Innovation Remove filter
The paper's predictions are consistent with empirical observations from scientific productivity data.
Authors state they compare model predictions to scientific productivity data (no sample sizes or dataset details provided in the provided text).
medium positive The Novelty Bottleneck: A Framework for Understanding Human ... consistency with scientific productivity patterns
The paper's predictions are consistent with empirical observations from AI coding benchmarks.
Authors state they compare model predictions to AI coding benchmark results (no sample sizes or specific benchmarks reported in the provided text).
medium positive The Novelty Bottleneck: A Framework for Understanding Human ... consistency with AI coding benchmark performance
AI-enabled ESG ratings, green innovation, ethical AI, RegTech, and explainable AI in finance are becoming highly influential in international financial markets.
Paper identifies these themes as emerging and influential based on trends in the reviewed literature and topical focus areas; no quantitative adoption metrics or sample sizes are provided in the excerpt.
medium positive Artificial intelligence in sustainable finance and Environme... influence/adoption of specific AI-related ESG themes in financial markets
The framework offers a replicable model for governments and institutions seeking to proactively support high-potential innovations across sectors.
Paper asserts replicability and applicability to governments/institutions based on the described methods and outputs; no deployment case studies or empirical replication evidence reported in text provided.
medium positive Emerging Technologies Based on Large AI Models and the Desig... replicability and applicability of the framework for proactive policy support
A data-driven, foresight-based approach to policy design significantly enhances responsiveness, precision, and resource efficiency in science and technology governance.
Paper concludes this benefit based on its integrated framework, triangulation, Delphi/AHP validation and illustrative mapping; no quantified comparative metrics or experimental evaluation reported in text provided.
medium positive Emerging Technologies Based on Large AI Models and the Desig... effectiveness of data-driven, foresight-based policy design (responsiveness, pre...
Fostering digital transformation alongside workforce reskilling and innovation-ecosystem development is essential for sustainable industrial growth and strengthening Kazakhstan’s global economic position.
Policy and strategic recommendations based on the study's empirical results, case studies, and macro-level index comparisons.
medium positive Digitalization and labor costs: efficiency of industrial ent... sustainable industrial growth / global economic position
Digital transformation combined with workforce retraining optimizes labor costs and enhances productivity.
Synthesis of enterprise-level case examples and aggregated regression/correlation findings at industry and national levels that link digitalization and retraining programs to labor-cost and productivity indicators.
medium positive Digitalization and labor costs: efficiency of industrial ent... labor costs per unit of production
These findings provide an early empirical baseline and point toward competitive plurality rather than winner-take-all consolidation among engaged users.
Interpretation synthesized from survey results (multi-platform usage, indistinguishable satisfaction among top platforms, differing adoption reasons); overall sample N=388.
medium positive Beyond Benchmarks: How Users Evaluate AI Chat Assistants market structure (likelihood of plurality vs winner-take-all)
Switching costs between platforms are negligible (users treat these tools as interchangeable utilities rather than sticky ecosystems).
Survey responses indicating platform-switching behavior and perceived costs; inference based on reported multi-platform usage and responses about platform loyalty/switching (overall N=388).
medium positive Beyond Benchmarks: How Users Evaluate AI Chat Assistants perceived switching costs / platform stickiness
These results establish agent scaling as a practical and effective axis for HLS optimization.
Synthesis/interpretation of empirical results (including mean 8.27× speedup and per-benchmark gains) reported in the paper.
medium positive Agent Factories for High Level Synthesis: How Far Can Genera... practical effectiveness of scaling the number of agents for HLS optimization
Across benchmarks, agents consistently rediscover known hardware optimization patterns without domain-specific training.
Qualitative and empirical observations across the evaluated benchmarks (12) reporting that agents found recognized hardware optimization patterns despite no hardware-specific training.
medium positive Agent Factories for High Level Synthesis: How Far Can Genera... discovery of known hardware optimization patterns by agents
This work demonstrates the technical feasibility of scalable, AI-augmented quality assessment for early childhood education and lays a foundation for continuous, inclusive AI-assisted evaluation enabling systemic improvement and equitable growth.
Overall results of dataset release, Interaction2Eval performance (agreement), and deployment efficiency reported in the paper; used by the authors to argue broader feasibility and potential systemic impact.
medium positive When AI Meets Early Childhood Education: Large Language Mode... feasibility and systemic impact of AI-augmented assessment
AI-assisted monitoring could shift assessment practice from annual expert audits to monthly AI-assisted monitoring with targeted human oversight.
Authors' synthesis combining dataset-scale results, Interaction2Eval performance (agreement), and deployment efficiency gains to argue feasibility of more frequent monitoring.
medium positive When AI Meets Early Childhood Education: Large Language Mode... frequency of quality monitoring (audit cadence)
Digital transformation enhances the relational embeddedness among cities, and this enhanced relational embeddedness facilitates improved outcomes in collaborative innovation (mediating mechanism).
Mediation analysis / network metric analysis using city-level relational embeddedness measures computed from patent collaboration networks and digital transformation indicators from A-share listed companies (2011–2021).
medium positive How Does Digital Transformation Affect Cross-Regional Collab... relational embeddedness among cities and its mediating effect on collaborative i...
Robust arbitrage strategies remain profitable even when generalized across different domains (claim reiteration emphasizing cross-domain profitability and robustness).
Repeated/strengthened claim in the paper referencing multiple experiments and robustness checks across domains.
medium positive Computational Arbitrage in AI Model Markets cross-domain profitability of arbitrage strategies
An arbitrageur can efficiently allocate inference budget across providers to undercut the market, creating a competitive offering with no model-development risk.
Methodological description and empirical demonstration in the paper showing arbitrageur strategies that allocate inference budget across multiple providers to create a competitive service without incurring model-development risk.
medium positive Computational Arbitrage in AI Model Markets ability to undercut market prices and create competitive offering without model ...
Arbitrage reduces market segmentation and facilitates market entry for smaller model providers by enabling earlier revenue capture.
Reported analysis and/or experiments suggesting arbitrage homogenizes offerings (reduces segmentation) and allows smaller providers to capture revenue earlier through arbitrage-enabled routes.
medium positive Computational Arbitrage in AI Model Markets market segmentation and ease of market entry for smaller model providers
Robust arbitrage strategies that generalize across different domains remain profitable.
Reported experiments indicating that arbitrage strategies generalized beyond the primary SWE-bench domain and still yielded profit (authors state robust strategies remain profitable across domains).
medium positive Computational Arbitrage in AI Model Markets profitability of arbitrage strategies across multiple domains
Arbitrage is viable in AI model markets (we empirically demonstrate the viability of arbitrage and illustrate its economic consequences).
Empirical experiments and analyses presented in the paper (case study on SWE-bench and additional experiments on arbitrage strategies).
medium positive Computational Arbitrage in AI Model Markets viability/profitability and economic impact of arbitrage strategies
This systematic framework can help predict at a detailed level where today's AI systems can and cannot be used and how future AI capabilities may change this.
Interpretive/utility claim: authors argue that the ontology plus classification results serve as rough predictive tools for AI applicability across work activities.
medium positive Where can AI be used? Insights from a deep ontology of work ... predictive usefulness of the ontology for AI applicability across tasks
The results contribute to literature arguing that cloud-based GenAI is a source of enterprise value creation rather than merely an experimental technology.
Paper's stated addition to the existing literature based on the combined empirical and theoretical findings.
medium positive Measuring Business ROI of Generative AI Adoption on Azure Cl... enterprise value creation via GenAI
When compared to baseline approaches, the ARL-based model's accuracy in revenue and price optimization decreased by less than 20%, indicating that it can adapt and optimize pricing techniques in intricate, cutthroat markets.
Reported experimental comparison versus baselines (fixed/rule-based and cost-plus); specific metrics, dataset size, and whether 'decrease' refers to error or accuracy are not clarified in the excerpt.
medium positive The Application of Adaptive Reinforcement Learning in Dynami... accuracy in revenue and price optimization
Our results substantiate the potential of large language models as a foundational pillar for high-fidelity, scalable decision simulation and latter analysis in the real economy based on foundational database.
High-level conclusion drawn from the paper's experiments and methodological contributions; generalization claim asserting LLMs' potential as foundational tools for scalable, high-fidelity decision simulation.
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... potential of LLMs for high-fidelity, scalable decision simulation
Experiments demonstrate that our framework achieves improved simulation stability compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments vs. baselines showing improved simulation stability (paper statement that framework improved simulation stability, without quantitative details in the excerpt).
Experiments demonstrate that our framework achieves significant improvements in purchase quantity prediction compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments comparing MALLES against existing baselines; paper reports 'significant improvements' in purchase quantity prediction (no numerical values provided in the excerpt).
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... purchase quantity prediction accuracy
Experiments demonstrate that our framework achieves significant improvements in product selection accuracy compared to existing economic and financial LLM simulation baselines.
Empirical claim: experiments comparing MALLES against existing economic and financial LLM simulation baselines; paper reports 'significant improvements' in product selection accuracy (no numerical values provided in the excerpt).
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... product selection accuracy
This preference-learning approach enables the models to internalize and transfer latent consumer preference patterns, thereby mitigating the data sparsity issues prevalent in individual categories.
Claim based on the paper's reported approach: cross-category post-training and transfer of latent preferences; supported by experiments (paper states mitigation of data sparsity).
medium positive MALLES: A Multi-agent LLMs-based Economic Sandbox with Consu... mitigation of data sparsity through cross-category preference transfer
Orchestrated systems of smaller, domain-adapted models can mathematically outperform frontier generalist models in most institutional deployment environments.
Formal conditions and comparative analysis derived in the paper plus referenced/claimed empirical support across several domains (frontier lab dynamics, alignment evolution, sovereign AI pressures).
medium positive Punctuated Equilibria in Artificial Intelligence: The Instit... relative institutional performance (smaller domain models vs. frontier generalis...
An increasing number of enterprises are using the label of artificial intelligence merely as a cosmetic embellishment in their annual reports (the phenomenon of 'AI washing' is spreading).
Framing/background claim in the paper's introduction/abstract; implied support from the semantic analysis of annual report texts across Chinese A-share firms over 2006–2024.
medium positive The Spillover Effects of Peer AI Rinsing on Corporate Green ... prevalence/trend of AI washing in annual reports
There are ethical imperatives of fairness and transparency in automated wealth management, and the paper proposes a roadmap toward sustainable and interpretable financial AI.
Normative analysis and proposed roadmap described in the paper; the excerpt does not provide operationalized fairness metrics, interpretability methods, or evaluation results.
medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... ethical compliance measures (fairness, transparency, interpretability) for autom...
In environments characterized by high-frequency data, non-linear dependencies, and stochastic market regimes, autonomous DRL agents can learn optimal sequential decision-making policies that offer a compelling alternative to static or rule-based allocation strategies.
Argument based on theoretical suitability of DRL for sequential decision problems and the paper's system-level investigation; excerpt does not report specific experimental datasets, sample sizes, benchmarks, or performance metrics.
medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... policy optimality / portfolio performance in complex market environments (implie...
The integration of Deep Reinforcement Learning (DRL) into portfolio management represents a significant evolution from classical Mean-Variance Optimization and modern econometric frameworks.
Conceptual comparison and synthesis presented in the paper; no empirical sample size or experimental results are provided in the excerpt to quantify the degree of improvement.
medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... methodological advancement in portfolio management (shift from static optimizati...
Blindfolding (anonymizing identifiers) allows verification of whether meaningful predictive signals persist (i.e., predictions reflect legitimate patterns rather than pre-trained recall of tickers).
Combined methodological-and-result claim: approach described (anonymization) plus stated objective and reported validation (negative controls and reported Sharpe under anonymization). Specific experimental protocol and quantitative results isolating the effect of anonymization are not provided in the excerpt.
medium positive Can Blindfolded LLMs Still Trade? An Anonymization-First Fra... persistence of predictive signal after anonymization (signal legitimacy)
On 2025 year-to-date (through 2025-08-01), the system achieved Sharpe 1.40 +/- 0.22 across 20 random seeds.
Backtest/performance claim: reported Sharpe ratio with reported uncertainty and a sample size of 20 seeds; time window specified as 2025 YTD through 2025-08-01. No further details on portfolio construction, leverage, transaction costs, or benchmark adjustment provided in the excerpt.
medium positive Can Blindfolded LLMs Still Trade? An Anonymization-First Fra... Sharpe ratio (mean and +/- presumably standard error or standard deviation) over...
Regulatory sandboxes offer a flexible and innovation-friendly governance model compared to traditional command-and-control mechanisms.
Normative and comparative analysis within a law & economics framework; no empirical performance data reported in the abstract.
medium positive Experimentalism beyond ex ante regulation: A law and economi... flexibility of governance and degree of innovation-friendliness
Comparative insights from FinTech identify the institutional design features necessary to ensure the effectiveness and resilience of regulatory sandboxes.
Comparative case-based reasoning drawing on FinTech regulatory sandbox experience (abstract does not report number or selection of cases).
medium positive Experimentalism beyond ex ante regulation: A law and economi... presence and performance of institutional design features (effectiveness/resilie...
AI regulatory sandboxes may correct specific government failures, including regulatory capture, rent-seeking, and knowledge gaps.
Analytical claims supported by comparative reasoning (FinTech examples) and economic analysis of government failure; no empirical testing or sample size reported in the abstract.
medium positive Experimentalism beyond ex ante regulation: A law and economi... incidence/severity of government failures such as regulatory capture, rent-seeki...
AI regulatory sandboxes facilitate iterative regulatory learning while promoting responsible AI innovation.
Theoretical argument using experimentalist governance concepts and law & economics reasoning; comparative insights referenced but no empirical sample detailed in the abstract.
medium positive Experimentalism beyond ex ante regulation: A law and economi... degree of regulatory learning and indicators of responsible AI innovation
AI regulatory sandboxes can reduce negative externalities associated with AI deployment.
Conceptual and economic analysis in the paper (no empirical quantification or sample size reported in the abstract).
medium positive Experimentalism beyond ex ante regulation: A law and economi... magnitude/frequency of negative externalities (e.g., harms from AI systems)
AI regulatory sandboxes can mitigate information asymmetries between regulators and firms.
Analytical application of an economic analysis of law framework; theoretical argumentation rather than reported empirical measurement in the abstract.
medium positive Experimentalism beyond ex ante regulation: A law and economi... level of information asymmetry between regulators and AI firms
Partial validation against observed AIS vessel behavior shows PIER is consistent with the fastest real transits while exhibiting 23.1× lower variance.
Comparison between PIER trajectories and observed fastest transits in AIS data (details in paper); reported relative variance reduction of 23.1×.
medium positive Physics-informed offline reinforcement learning eliminates c... variance of transit times or fuel use compared to fastest observed AIS transits
PIER eliminates catastrophic fuel waste: great-circle routing produces extreme fuel consumption (>1.5× median) in 4.8% of voyages, while PIER reduces this to 0.5% (a 9-fold reduction).
Analysis on the same 2023 AIS validation dataset across seven Gulf of Mexico routes (840 episodes per method) comparing distribution tails of voyage fuel consumption; reported incidence rates (4.8% vs 0.5%).
medium positive Physics-informed offline reinforcement learning eliminates c... fraction of voyages with fuel consumption >1.5× median
PIER reduces mean CO2 emissions by 10% relative to great-circle routing.
Offline evaluation using physics‑calibrated environments grounded in historical AIS data and ocean reanalysis products; validation on one full year (2023) of AIS across seven Gulf of Mexico routes with 840 episodes per method; reported mean reduction of 10% and bootstrap 95% CI for mean savings [2.9%, 15.7%].
medium positive Physics-informed offline reinforcement learning eliminates c... mean CO2 emissions per voyage (percent reduction vs great-circle routing)
The results confirm the positive impact of cognitive technologies on the development of entrepreneurial opportunities and innovative activity.
Conclusion drawn from the positive estimated association (0.33 coefficient) and the observed increases in the indices between 2020 and 2024 reported in the paper.
medium positive Innovative Cognitive Tools for Studying Market Opportunities... entrepreneurial opportunities and innovation activity (proxied by the Market Opp...
The Cognitive Tools Index and the Market Opportunity Index were -0.42 and -0.35 in 2020 and 0.94 and 0.92 in 2024, respectively.
Reported observed/computed index values for the years 2020 and 2024 in the study (data source and aggregation method not detailed in the excerpt).
medium positive Innovative Cognitive Tools for Studying Market Opportunities... Cognitive Tools Index and Market Opportunity Index (yearly values for 2020 and 2...
The empirical study for 2020–2024 showed that a one standard unit increase in the Cognitive Tools Index is associated with an average 0.33 increase in the Market Opportunity Index.
Estimated coefficient reported from the panel econometric model over 2020–2024 (model included lags and used instrumental approach; sample size and standard errors not provided in the excerpt).
medium positive Innovative Cognitive Tools for Studying Market Opportunities... Market Opportunity Index (effect of one standard unit change in Cognitive Tools ...
Generative AI functions as a socio‑technical intermediary that facilitates interpretation, coordination, and decision support rather than merely automating discrete tasks.
Thematic analysis and co‑word linkage between terms related to interpretative work, coordination, and decision‑support and technical GenAI terms within the corpus.
medium positive Generative AI and the algorithmic workplace: a bibliometric ... portrayal of GenAI role in organisational processes (socio‑technical intermediar...
The literature indicates a managerial shift away from hierarchical command‑and‑control toward guide‑and‑collaborate paradigms, where managers curate, guide, and coordinate AI‑augmented teams rather than micro‑manage tasks.
Synthesis of themes from the 212‑paper corpus (co‑word and thematic analyses) showing recurrent managerial/behavioural concepts such as autonomy, coordination, and decision‑support tied to GenAI discussions.
medium positive Generative AI and the algorithmic workplace: a bibliometric ... reported dominant managerial paradigm in the literature (guide‑and‑collaborate v...
Standardized data schemas and interoperable protocols reduce transaction costs and increase returns on AI investments; public-good components (shared taxonomies, open benchmarks) will accelerate innovation in DPP ecosystems.
Policy/economic recommendation synthesized from empirical observations about interoperability needs (survey and qualitative inputs) and economic reasoning; not directly measured as an outcome in the study.
medium positive Integrating knowledge management and digital product passpor... reduced transaction costs and increased returns on AI investments contingent on ...
Different consumer segments imply different AI-driven engagement strategies: targeted personalization and recommender systems for 'aware' consumers, and default, nudging, and tangible-benefit signals for 'unaware' consumers.
Derived from k‑means segmentation results and implication discussion linking consumer cluster characteristics to appropriate AI/UX interventions; segmentation is empirical, the AI-prescription is inferential.
medium positive Integrating knowledge management and digital product passpor... recommended AI engagement strategies tailored to consumer segment outcomes