The Commonplace
Home Dashboard Papers Evidence Digests 🎲

Evidence (7156 claims)

Adoption
5126 claims
Productivity
4409 claims
Governance
4049 claims
Human-AI Collaboration
2954 claims
Labor Markets
2432 claims
Org Design
2273 claims
Innovation
2215 claims
Skills & Training
1902 claims
Inequality
1286 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 369 105 58 432 972
Governance & Regulation 365 171 113 54 713
Research Productivity 229 95 33 294 655
Organizational Efficiency 354 82 58 34 531
Technology Adoption Rate 277 115 63 27 486
Firm Productivity 273 33 68 10 389
AI Safety & Ethics 112 177 43 24 358
Output Quality 228 61 23 25 337
Market Structure 105 118 81 14 323
Decision Quality 154 68 33 17 275
Employment Level 68 32 74 8 184
Fiscal & Macroeconomic 74 52 32 21 183
Skill Acquisition 85 31 38 9 163
Firm Revenue 96 30 22 148
Innovation Output 100 11 20 11 143
Consumer Welfare 66 29 35 7 137
Regulatory Compliance 51 61 13 3 128
Inequality Measures 24 66 31 4 125
Task Allocation 64 6 28 6 104
Error Rate 42 47 6 95
Training Effectiveness 55 12 10 16 93
Worker Satisfaction 42 32 11 6 91
Task Completion Time 71 5 3 1 80
Wages & Compensation 38 13 19 4 74
Team Performance 41 8 15 7 72
Hiring & Recruitment 39 4 6 3 52
Automation Exposure 17 15 9 5 46
Job Displacement 5 28 12 45
Social Protection 18 8 6 1 33
Developer Productivity 25 1 2 1 29
Worker Turnover 10 12 3 25
Creative Output 15 5 3 1 24
Skill Obsolescence 3 18 2 23
Labor Share of Income 7 4 9 20
At equilibrium prices in symmetric markets, consumer surplus is improved by cheaper search but may be decreased by more informative search, due to weakened inter-business competition.
Equilibrium price analysis within the theoretical model for symmetric firms; comparative statics showing how search cost and signal informativeness affect pricing, competition intensity, and consumer surplus. No empirical validation reported.
high mixed Agentic Markets: Equilibrium Effects of Improving Consumer S... consumer surplus (under equilibrium pricing)
The market (in the model) tracks indications of fit for searched products and indications of quality for chosen products, thereby guiding subsequent searches.
Model structure and assumptions specified in the paper: an endogenous information-tracking mechanism that records signals from searches and purchases and which then influences future search behavior; presented as part of the theoretical framework rather than empirical evidence.
high mixed Agentic Markets: Equilibrium Effects of Improving Consumer S... information available to guide search (market-tracked signals)
The top four models are statistically indistinguishable (mean score 0.147–0.153) while a clear tier gap separates them from the remaining four models (mean score <= 0.113).
Reported mean performance scores across 8 models and statement of statistical indistinguishability for the top four vs lower-tier four; numerical means provided.
high mixed SWE-PRBench: Benchmarking AI Code Review Quality Against Pul... mean model performance score
Behavioral factors — specifically trust calibration, cognitive load, and affective reactions — shape the transition of corporate AI initiatives from pilot deployments to scalable, sustained use.
Synthesis of human-AI interaction literature integrated with adoption frameworks (TAM and TOE); conceptual linkage rather than new empirical testing in this paper.
high mixed Behavioral Factors as Determinants of Successful Scaling of ... success of pilot-to-production transition (scalability and sustained use)
The proportion of consumers who adopt AI-induced services influences the pricing of those services and through price adjustments will further impact wages across traditional and non-traditional services.
Theoretical development and analysis in the paper via a demand-switching model and a Finite Change General Equilibrium framework introducing AI as a technological shock modeled through price adjustments.
high mixed Artificial Intelligence, Demand Switching and Sectoral Wage ... wages (across traditional and non-traditional services) and service prices
AI accelerates value-chain maturation while creating distinct risks — including professional responsibility tensions and potential system-level externalities.
Conceptual argument and risk analysis in the Article (theoretical reasoning and synthesis of management/ethics literature). No empirical causal estimate reported in the excerpt.
high mixed Rewired: Reconceptualizing Legal Services for the AI Age acceleration of value-chain maturation and emergence of professional responsibil...
The legal profession is at a crossroads, caught between intensifying fears of AI-driven displacement and a generational opportunity for transformation.
Author's synthesis and framing in the Article (conceptual assessment; literature/contextual synthesis). No empirical sample or experiment reported in the excerpt.
high mixed Rewired: Reconceptualizing Legal Services for the AI Age risk of AI-driven displacement and opportunity for transformation in the legal p...
This advantage is contingent upon robust AI governance, ethical frameworks, and the transition from 'pilot-lite' projects to integrated, data-driven 'AI-first' business models.
Conditional claim in the paper linking success to governance, ethics, and organizational integration; appears to be normative/analytical rather than empirical in the abstract.
high mixed The AI Advantage: Strategic Innovation and Global Expansion ... dependency of AI-driven advantage on governance, ethics, and organizational inte...
The paper reframes AI governance as a form of social policy shaped by political and economic institutions.
Conceptual/interpretive claim supported by the authors' comparative analysis and theoretical framing of AI governance alongside social policy dimensions.
high mixed Artificial intelligence governance and social policy diverge... conceptual framing of AI governance as social policy influenced by political-eco...
Although many regions use similar ethical language, substantial differences persist in risk allocation, regulatory enforcement, welfare integration and social protection.
Content analysis of policy documents showing overlap in ethical rhetoric but divergence across coded institutional dimensions related to risk allocation, enforcement, welfare integration and social protection (n=24).
high mixed Artificial intelligence governance and social policy diverge... similarity of ethical language vs. divergence in (a) risk allocation, (b) regula...
Five distinct governance models emerge: rights-based (EU), market-driven (US), state-centric (China), hybrid (Australia–Japan–Singapore) and developmental (India).
Typology derived from coding and index comparison of the 24 policy documents; authors classify regions/countries into five labeled governance models.
high mixed Artificial intelligence governance and social policy diverge... categorical classification of regional AI governance model
The findings show clear and systematic differences in how regions govern AI.
Comparative analysis of coded policy documents (n=24) producing indices that the authors interpret as showing systematic cross-regional differences in governance approaches.
high mixed Artificial intelligence governance and social policy diverge... degree and nature of differences in regional AI governance approaches
The documents are systematically coded across four institutional dimensions and converted into simple indices to compare governance approaches across the regions.
Author-reported method: systematic coding of documents on four institutional dimensions and construction of indices for cross-regional comparison (based on the 24 documents).
high mixed Artificial intelligence governance and social policy diverge... coding across four institutional dimensions and index construction
This study uses a comparative qualitative policy analysis based on 24 key AI policy documents published between 2018 and 2025 across the European Union, United States, China, and Indo-Pacific economies.
Author-stated research design and sample: systematic review/comparative qualitative policy analysis of 24 AI policy documents spanning 2018–2025 covering EU, US, China and Indo-Pacific economies.
high mixed Artificial intelligence governance and social policy diverge... research design and document sample
Actual sharing often contradicted willingness to share (the privacy paradox), with consistently high sharing rates across all conditions.
Observed discrepancy reported in the experimental results (N=240): despite variation in willingness-to-share, behavioral sharing rates remained high and similar across human, white-box AI, and black-box AI conditions.
high mixed Understanding Data-Sharing with AI Systems: The Roles of Tra... discrepancy between stated willingness to share vs actual sharing behavior
Energy policy uncertainty has a nonlinear effect on AI investment: moderate uncertainty fosters innovation, whereas high volatility hinders long-term investment.
Empirical analysis using nonlinear methods (WQR and WQC) on US quarterly data 2013Q1–2024Q4 (48 quarters), assessing distributional asymmetries across quantiles and time–frequency bands.
Machine-readable metrics and open scholarly infrastructure are reshaping scholarly profiles and incentives.
Conceptual and historical discussion referring to platforms and metrics (e.g., arXiv, Google Scholar, ORCID) as mechanisms changing incentives; no new empirical estimates provided.
high mixed A Brief History of AI for Scientific Discovery: Open Researc... changes in scholarly incentives and profile construction due to machine-readable...
That interconnected ecosystem is fundamentally restructuring who can do science (access), how fast discoveries propagate, and what counts as a valid scientific contribution.
Argumentative claim linking infrastructural and tool changes to changes in access, dissemination speed, and norms of contribution. The paper presents examples and narrative but no systematic empirical evaluation or sample.
high mixed A Brief History of AI for Scientific Discovery: Open Researc... access to scientific practice, speed of discovery dissemination, and norms of sc...
The most consequential development is not any single tool but the emergence of an interconnected ecosystem—AI agents, preprint platforms, open source codebases, and citation infrastructure—that forms a feedback loop.
Synthesis/argument based on multiple examples (LLM agents, preprint servers like arXiv, open-source code repositories, citation indices). No quantitative measurement or causal identification reported.
high mixed A Brief History of AI for Scientific Discovery: Open Researc... emergence of an interconnected scientific infrastructure ecosystem
The central tension in AI for science is between automation (building systems that replace human researchers) and augmentation (tools that amplify human creativity and judgement).
Analytical claim based on the paper's review of historical examples and conceptual discussion; no primary data or experimental design reported.
high mixed A Brief History of AI for Scientific Discovery: Open Researc... relationship between automation and augmentation in research practice
Science has repeatedly delegated its bottlenecks to machines—first inference, then search, then measurement, then the full workflow—and each delegation solves one problem while exposing a harder one underneath.
Interpretive historical argument drawing on examples across AI-for-science milestones (e.g., DENDRAL, search and inference systems, measurement automation, and contemporary end-to-end workflows). No quantitative sample or experimental method reported.
high mixed A Brief History of AI for Scientific Discovery: Open Researc... pattern of delegation and emergent bottlenecks in research workflows
Testing revealed AI excels at computational tasks but consistently misses nuanced factors like new construction rent premiums and infrastructure proximity impacts, validating the framework's hybrid structure as essential for professional-grade underwriting.
Findings from the controlled ChatGPT-4 test on the single 150-unit scenario: qualitative and comparative observations showing AI handled computations well but failed to capture specific local-market nuances, leading authors to endorse a hybrid human-AI framework.
Phase Two requires human-led professional validation to correct AI limitations, apply local market knowledge, and integrate risk factors.
Framework description supported by observations from the controlled test where human review was used to correct AI outputs and apply local knowledge (e.g., adjusting for nuanced market factors).
The growth effects of AI are conditional on institutional quality and organizational adaptability.
Theoretical/analytical claim in the paper's framework and supported by the stylized-facts analysis indicating heterogeneity in productivity and growth outcomes by institutional and digital capacity indicators.
high mixed Artificial intelligence, institutional innovation and econom... growth effects of AI (heterogeneity/conditionality by institutions and adaptabil...
AI agents implicate many areas of law, ranging from agency law and contracts to tort liability and labor law.
Legal/policy analysis in the paper enumerating legal domains implicated by AI agents (qualitative analysis; no sample size).
high mixed Regulating AI Agents scope of legal domains implicated by AI agents
Firms of different ownership structures and industries exhibit different responses to the income distribution changes brought by AI (heterogeneous effects).
Paper reports performing grouped regressions by ownership type and industry to identify heterogeneous responses.
high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... heterogeneous change in income distribution (e.g., labor share or profit-labor r...
Financing constraints are a key factor that hinder firms' choice of technology level, which alters the corresponding income distribution effect of AI.
Paper posits financing constraint as a moderator and states it is considered in empirical analysis (interaction/moderation tests).
high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... change in income distribution effects (e.g., labor share) conditional on financi...
The development of AI may trigger new changes in the interest pattern between corporate profits and labor compensation.
Framed as the central research question/hypothesis; paper conducts empirical tests on firm panel data to evaluate this.
high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... relationship between corporate profits and labor compensation (interest pattern)
Artificial intelligence is profoundly reshaping the organizational form, operating model and operating mechanism of enterprises, and bringing unprecedented impact to the income distribution structure within enterprises.
Statement asserted in the paper's introduction/abstract; motivates empirical analysis using panel data of Shanghai and Shenzhen A-share non-financial listed firms (2010–2022).
high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... income distribution structure within enterprises (general claim)
These findings contribute to the literature by providing empirical insights from a developing economy, where unique socioeconomic and institutional factors shape the impact of AI.
Scope/claim of contribution based on the study's context (Cambodia) and its dataset (n = 351).
This study employed PLS‐SEM analysis on data from 351 respondents, revealing significant workforce reshaping.
PLS-SEM analysis conducted on survey data (n = 351) as reported in the paper.
The rapid adoption of artificial intelligence (AI) is fundamentally transforming labor markets worldwide, presenting both opportunities and challenges.
Statement made in the paper as background/justification; not based on the study's empirical data.
Traffic performance is sensitive to the distribution of safe time gaps and the proportion of RL vehicles.
Simulation results comparing Fundamental Diagrams across scenarios with different distributions of safe time gaps and shares of RL-controlled vehicles. Number of simulation runs or replicates not stated in the claim text.
high mixed Macroscopic Characteristics of Mixed Traffic Flow with Deep ... traffic performance (e.g., flow, capacity) sensitivity to time-gap distribution ...
Chat intent varies systematically with both the timing of chat relative to search and the category of products later purchased within the same journey.
Cross-tabulation/regression-style descriptive analysis relating classified chat intents to timing (relative to search) and subsequent purchased product categories in journey-level logs.
AI assistance in safety engineering is fundamentally a collaboration design problem rather than merely a software procurement decision: the same tool can either degrade or improve analysis quality depending entirely on how it is used.
Synthesis of the formal framework and analytic results in the paper (theoretical argument; no empirical sample reported).
The paper concludes by discussing open challenges in evaluating harmful manipulation by AI models.
Paper includes a discussion/conclusion section enumerating open challenges; stated in abstract.
high mixed Evaluating Language Models for Harmful Manipulation identification of open research and evaluation challenges
We identify significant differences across our tested geographies, suggesting that AI manipulation results from one geographic region may not generalise to others.
Empirical comparison across three locales (US, UK, India) showing statistically significant differences in manipulation outcomes by geography.
high mixed Evaluating Language Models for Harmful Manipulation geographic variation in manipulative behaviour/effects
Context matters: AI manipulation differs between domains, suggesting that it needs to be evaluated in the high-stakes context(s) in which an AI system is likely to be used.
Comparative analysis across three domains (public policy, finance, health) showing differences in manipulative behaviour and/or impact by domain in the empirical study.
high mixed Evaluating Language Models for Harmful Manipulation variation in manipulative behaviour/effects across use domains
AUROC_2 and M-ratio produce fully inverted model rankings, demonstrating these metrics answer fundamentally different evaluation questions.
Metric comparison across models showing that AUROC_2-based ranking and M-ratio-based ranking are fully inverted in the reported results on the evaluated dataset.
high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... model ranking by AUROC_2 versus model ranking by M-ratio
Temperature manipulation shifts Type-2 criterion while meta-d' remains stable for two of four models, dissociating confidence policy from metacognitive capacity.
Experimental manipulation (temperature changes) applied to models; reported result that Type-2 criterion shifted with temperature while meta-d' was stable for two models (out of four) in the 224,000-trial dataset.
high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... Type-2 criterion (confidence policy) and meta-d' (metacognitive capacity)
Metacognitive efficiency is domain-specific, with different models showing different weakest domains, invisible to aggregate metrics.
Domain-level analyses reported in the paper showing per-domain M-ratio results and identification of different weakest domains per model, contrasted with aggregate metric behavior.
high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... domain-specific metacognitive efficiency (M-ratio) across task domains
Metacognitive efficiency varies substantially across models even when Type-1 sensitivity is similar — Mistral achieves the highest d' but the lowest M-ratio.
Empirical comparison of Type-1 sensitivity (d') and metacognitive efficiency (M-ratio) across the four evaluated LLMs on the 224,000 QA trials; explicit statement that Mistral had highest d' but lowest M-ratio.
high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... Type-1 sensitivity (d') and metacognitive efficiency (M-ratio)
The paper's findings deepen the understanding of algorithmic aversion in the context of generative AI and offer practical guidance for creators and platforms navigating transparency versus engagement trade-offs.
Authors' interpretation and conclusions summarized in the abstract, based on the two experiments (study 1: n = 325; study 2: n = 371).
high mixed AI content labeling and user engagement on social media: The... interpretation of experimental results (algorithmic aversion / guidance implicat...
The paper's primary contribution is to combine established ingredients—attention scarcity, free-entry dilution, superstar effects, and preferential attachment—into a unified framework directed at claims about AI-enabled entrepreneurship.
Stated contribution and methodological description in the paper (synthesis and applied formalisation); this is a descriptive/methodological claim rather than an empirical result.
high mixed The Economics of Builder Saturation in Digital Markets n/a (methodological contribution)
Modern pretrained time-series foundation models can forecast without task-specific training, but they do not fully incorporate economic behavior.
Statement in paper's introduction/abstract summarizing prior capabilities and limitations of pretrained time-series foundation models (no experimental sample or numeric evidence provided in the excerpt).
high mixed GARP-EFM: Improving Foundation Models with Revealed Preferen... ability of pretrained time-series models to forecast and degree to which they in...
The governance risk-mitigation effects of AI operate through increasing financial risk exposure.
Authors' mechanism tests indicate a relationship between AI adoption and changes in financial risk exposure measures, which they interpret as a channel affecting executive behavior.
high mixed The risk-mitigation effects of artificial intelligence adopt... financial risk exposure (financial risk/proxy metrics)
Organizational culture and technological readiness moderate the effectiveness of generative AI integration in decision-making processes.
The paper reports moderation effects tested in the SEM framework using survey data from senior managers, decision-makers, and AI adoption specialists (SmartPLS). No numeric moderator effect sizes or sample size provided in the excerpt.
high mixed The Strategic Impact of Generative Artificial Intelligence o... effectiveness of generative AI integration in decision-making (moderation effect...
The paper draws comparisons between inference tokens and established commodities such as electricity, carbon emission allowances, and bandwidth to motivate financialization.
Theoretical comparison and historical analysis (drawing on the historical experience of electricity futures markets and commodity financialization theory) as presented in the paper.
high mixed AI Token Futures Market: Commoditization of Compute and Deri... similarity / comparability to established commodity markets
The effects of financial digital intelligence on the innovative development of strategic emerging industries vary across regions and sectors: there are differences across central, eastern, and western regions and across capital‑intensive and technology‑intensive sectors, while no significant impact is noted in other regions and industries.
Heterogeneity analysis reported on the panel dataset (5,731 observations, 2015–2022) examining regional and industry subsamples (details of subgroup sizes and statistical tests not provided in excerpt).
high mixed Financial Digital Intelligence and Innovative Development of... innovative development of strategic emerging industries (heterogeneous effects b...
Initiatives such as Cassava AI's network of AI factories signal growing interest in adopting AI in Africa, but these projects remain very targeted and continental adoption still requires better coordination between African stakeholders.
Cited example (Cassava AI) in the paper to illustrate nascent initiatives; combined with the authors' qualitative assessment of scope and geographic targeting of such projects.
high mixed Take the Train: Africa at the Crossroad of Modern AI scope and coordination of AI adoption initiatives