The Commonplace
Home Dashboard Papers Evidence Digests 🎲

Evidence (7156 claims)

Adoption
5126 claims
Productivity
4409 claims
Governance
4049 claims
Human-AI Collaboration
2954 claims
Labor Markets
2432 claims
Org Design
2273 claims
Innovation
2215 claims
Skills & Training
1902 claims
Inequality
1286 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 369 105 58 432 972
Governance & Regulation 365 171 113 54 713
Research Productivity 229 95 33 294 655
Organizational Efficiency 354 82 58 34 531
Technology Adoption Rate 277 115 63 27 486
Firm Productivity 273 33 68 10 389
AI Safety & Ethics 112 177 43 24 358
Output Quality 228 61 23 25 337
Market Structure 105 118 81 14 323
Decision Quality 154 68 33 17 275
Employment Level 68 32 74 8 184
Fiscal & Macroeconomic 74 52 32 21 183
Skill Acquisition 85 31 38 9 163
Firm Revenue 96 30 22 148
Innovation Output 100 11 20 11 143
Consumer Welfare 66 29 35 7 137
Regulatory Compliance 51 61 13 3 128
Inequality Measures 24 66 31 4 125
Task Allocation 64 6 28 6 104
Error Rate 42 47 6 95
Training Effectiveness 55 12 10 16 93
Worker Satisfaction 42 32 11 6 91
Task Completion Time 71 5 3 1 80
Wages & Compensation 38 13 19 4 74
Team Performance 41 8 15 7 72
Hiring & Recruitment 39 4 6 3 52
Automation Exposure 17 15 9 5 46
Job Displacement 5 28 12 45
Social Protection 18 8 6 1 33
Developer Productivity 25 1 2 1 29
Worker Turnover 10 12 3 25
Creative Output 15 5 3 1 24
Skill Obsolescence 3 18 2 23
Labor Share of Income 7 4 9 20
Small language models offer privacy-preserving alternatives to frontier models, but their specialization is hindered by fragmented development pipelines that separate tool integration, data generation, and training.
Background claim stated in paper/abstract; no experimental data provided for this statement within the abstract.
high mixed EnterpriseLab: A Full-Stack Platform for developing and depl... privacy-preserving capability and ease of specialization of small LMs (vs fronti...
Extensive synthetic experiments show that policy regularizations reshape the narrative on what is the best DRL method for inventory management.
Paper states results from extensive synthetic experiments that change which DRL methods are considered best under policy regularization; abstract does not provide the experimental sample size, specific methods, or quantitative comparisons.
high mixed DeepStock: Reinforcement Learning with Policy Regularization... relative performance/ranking of DRL methods for inventory management
Implementation of human-replacing technologies leads to significant transformations in skill demand: it reduces reliance on low-skilled labour while increasing demand for qualified engineers, system operators and specialists in digital technologies.
Sector-specific analysis and review of international labour-market studies cited in the article documenting skill-biased effects of automation and digitalization; qualitative assessment for Ukraine's mining and metallurgical sector under workforce shortage conditions.
high mixed Human-replacing technologies as a driver of labour productiv... skill demand composition (shift from low-skilled to high-skilled roles)
Foreign direct investment (FDI) shows an insignificantly positive direct effect on local TFCP but a significantly negative indirect (spillover) effect, attributed to a 'pollution haven' effect.
Spatial Durbin Model estimates for FDI on panel (30 provinces, 2010–2023): direct coefficient positive but not significant; indirect coefficient significantly negative; interpretation given as pollution-haven mechanism.
high mixed Study on the impact of industrial intelligence and the digit... total factor carbon productivity (TFCP)
Industrial intelligence exhibits regional heterogeneity: a significantly negative direct effect in the east, a significantly positive direct effect in the central region, an insignificant direct effect in the west, and positive indirect (spillover) effects in the east and west.
Regional/subsample Spatial Durbin Model analyses dividing the sample into east, central, and west regions (30 provinces, 2010–2023); reported region-specific direct and indirect coefficients and significance levels.
high mixed Study on the impact of industrial intelligence and the digit... total factor carbon productivity (TFCP)
Industrial intelligence has an insignificantly negative direct effect on local TFCP, but its positive spatial spillover effect is significant at the 1% level, producing a significantly positive total effect.
Spatial Durbin Model results for industrial intelligence on panel (30 provinces, 2010–2023): direct coefficient negative and not statistically significant; indirect coefficient positive and significant at 1%; total effect positive and significant.
high mixed Study on the impact of industrial intelligence and the digit... total factor carbon productivity (TFCP)
China's TFCP rose overall from 2010 to 2023 but exhibited a widening regional gap of 'higher in the east, lower in the west'.
Panel data of 30 Chinese provincial-level regions (2010–2023); TFCP measured using an undesirable-output super-efficiency SBM model and summarized temporal and spatial patterns.
high mixed Study on the impact of industrial intelligence and the digit... total factor carbon productivity (TFCP)
The study found a significant transformation of the employment structure under the influence of artificial intelligence.
Empirical analysis using an envelope model ("input" orientation) applied to a sample of European Union countries; the paper reports modeled changes in employment structure attributable to AI diffusion.
high mixed Artificial intelligence as a driver of economic growth: Chal... transformation of employment structure
For AI: a cohesive professional vocabulary formed rapidly in early 2024, but the practitioner population never cohered.
Empirical finding from analysis of the 8.2M resume dataset showing a rapid increase in the vocabulary-cohesion metric around early 2024 while the population-cohesion metric did not show a corresponding rise.
high mixed NLP Occupational Emergence Analysis: How Occupations Form an... vocabulary cohesion (rapid formation) and population cohesion (absence of cohesi...
The framework implies threshold effects in training and capability acquisition: when the teaching horizon lies below the prerequisite depth of the target, additional instruction cannot produce successful completion of teaching; once that depth is reached, completion becomes feasible.
Model-derived threshold result described in the abstract (mathematical analysis of prerequisite depth vs. teaching horizon).
high mixed A Mathematical Theory of Understanding feasibility of successful teaching / completion of instruction
The value of information depends on whether downstream users can absorb and act on it: a signal conveys meaning only to a learner with the structural capacity to decode it (an explanation that clarifies a concept for one user may be indistinguishable from noise to another who lacks the relevant prerequisites).
Conceptual argument motivating the model; theoretical reasoning described in the paper's intro/abstract.
high mixed A Mathematical Theory of Understanding ability to interpret instructional signals / effective information transfer
Automation holds significant potential for modernising tax administration, but its success depends on aligning technological innovation with inclusive policy design and institutional capacity.
Overall conclusion of the literature synthesis of 36 peer-reviewed articles; based on patterns of positive impacts conditional on contextual factors and governance highlighted across the studies.
high mixed The Influence of Automation on Tax Compliance Behaviour overall success/potential of tax administration modernisation
Behavioural responses to automation vary across taxpayer segments: some users embrace automation as a facilitator of compliance while others resist due to perceived opacity and technological anxiety.
Synthesis of behavioural findings from the reviewed literature (36 studies) reporting heterogeneous responses by taxpayer segment, including qualitative reports of resistance and quantitative measures of uptake/adoption.
high mixed The Influence of Automation on Tax Compliance Behaviour taxpayer behavioural response / adoption of automated systems
The effectiveness of automated tax systems is mediated by contingencies including digital literacy, institutional trust, and regulatory clarity.
The review identifies recurring contextual factors across the 36 articles that are reported to moderate or mediate the impact of automation on outcomes (qualitative and quantitative findings cited in the synthesis).
high mixed The Influence of Automation on Tax Compliance Behaviour effectiveness of automated tax systems (e.g., compliance/adoption/effect size)
The study identifies the main AI-enabled mechanisms advancing CE principles in smart manufacturing, waste valorisation, supply-chain transparency, and sustainable design.
Bibliometric network analysis of 196 peer-reviewed articles (2023–2024) and systematic review of 104 studies, per the abstract; identification is presented as a product of these analyses.
high mixed Artificial intelligence as a catalyst for the circular econo... AI-enabled mechanisms advancing circular economy principles (e.g., in smart manu...
AI is not an inherent instrument of justice but a malleable socio-technical force whose equitable outcomes depend on policy design and institutional context.
Interpretation and synthesis of empirical results showing conditional and heterogeneous effects of AI; normative conclusion drawn by authors from observed heterogeneity and mediating channels.
high mixed Artificial intelligence adoption for advancing energy justic... conceptual claim about AI's role in producing equitable outcomes
Governmental structures, labor supply and demand, and incorporation of financial measures act as key intervening variables affecting achieved ROI from GenAI implementations.
Qualitative synthesis and theoretical analysis reported in the paper identifying contextual/intervening variables.
high mixed Measuring Business ROI of Generative AI Adoption on Azure Cl... influence of governance and labor market factors on ROI
There is an evident tension between privacy and security in existing AI governance approaches.
Thematic synthesis and co-occurrence network from the reviewed studies identify trade-offs and tensions reported between privacy-preserving approaches and security requirements.
high mixed AI Governance Risk Tiering for Sustainable Digital Infrastru... presence of trade-offs/tensions between privacy and security in frameworks
Generative AI serves as an effective 'wingman' for employment lawyers, capable of replacing substantial junior associate work while requiring continued human expertise for client counseling, supervision, and final legal advice preparation.
Authors' synthesis of experimental results showing AI-produced substantive analysis plus discussion about remaining limitations (e.g., citation errors) and required human oversight; qualitative assertion about substitutability for junior associate tasks.
high mixed Robot Wingman: Using AI to Assess an Employment Termination potential replacement of junior associate tasks and required human oversight
The fragility of 'Pax Silica' has implications for global capitalism, technological governance, and geopolitical stability.
Analytical inference and concluding assessment based on theoretical framework and comparative analysis; no empirical quantification provided in the abstract.
high mixed The Logistics of Hegemony: Semiconductor Chokepoints, Global... impacts on global capitalism, technological governance, and geopolitical stabili...
The paper proposes new mechanisms through which big data affects individual welfare (beyond simple productivity gains), linking privacy costs, multiplier effects, and R&D transformation patterns.
Theoretical/mechanism development: the paper articulates new channels in its macro theoretical framework describing how data sharing impacts welfare via multiple mechanisms (model construction and analytic discussion; no empirical/sample validation).
high mixed Study on the impact of big data sharing on individuals’ welf... mechanisms linking big data to individual welfare (privacy, multiplier, R&D tran...
Consumption is affected by the multiplier effect and the transformation patterns of R&D.
Theoretical: model analysis links consumption dynamics to a multiplier effect and to how R&D transforms inputs/outputs (comparative statics/dynamics in the theoretical framework).
Individuals’ welfare is influenced by both the privacy cost of big data sharing and their consumption levels.
Theoretical: welfare in the model is specified as a function of consumption and a privacy cost term arising from big data sharing; result follows from analytic derivation within the model (no empirical/sample data).
high mixed Study on the impact of big data sharing on individuals’ welf... individuals' welfare (as affected by privacy cost and consumption)
PPS gains are task-dependent: gains are large in high-ambiguity business analysis tasks but reverse in low-ambiguity travel planning tasks.
Task-level analysis across the three domains (business, technical, travel) within the controlled study (60 tasks total); authors report differential performance patterns by domain/ambiguity.
high mixed Evaluating 5W3H Structured Prompting for Intent Alignment in... relative_performance_by_task_domain (PPS vs baselines)
Capability and trust formally diverge beyond a critical scale (Capability-Trust Divergence).
Claim of a formal proof in the paper (mathematical / theoretical demonstration). No empirical sample size reported in the excerpt.
high mixed The Institutional Scaling Law: Non-Monotonic Fitness, Capabi... capability and trust as functions of model scale
The Institutional Scaling Law shows that institutional fitness -- jointly measuring capability, trust, affordability, and sovereignty -- is non-monotonic in model scale, with an environment-dependent optimum N*(ε).
Theoretical derivation / analytic model presented in the paper (formal derivation of an 'Institutional Scaling Law'). No empirical sample size reported in the excerpt.
high mixed The Institutional Scaling Law: Non-Monotonic Fitness, Capabi... institutional fitness (composite of capability, trust, affordability, sovereignt...
AI usage has dual effects on employees: it can both enhance innovative behavior and predict disengagement, as revealed by a dual-path (SOR-based) model.
Interpretation/synthesis from the four-stage longitudinal study of 285 finance professionals using a dual-path model based on SOR theory (combining the mediation and moderation results).
high mixed Autonomous enhancement or emotional depletion? The dual-path... innovative work behavior and work disengagement behavior (dual outcomes)
We evaluate 14 LLMs under zero-shot prompting and retrieval-augmented settings and witness a clear performance gap.
Experimental evaluation reported in the paper: authors state they ran experiments on 14 different large language models, under zero-shot and retrieval-augmented configurations, and observed differing performance across models.
high mixed FinTradeBench: A Financial Reasoning Benchmark for LLMs model performance on financial reasoning benchmark (accuracy/score across models...
Artificial intelligence embedded in human decision-making can either enhance human reasoning or induce excessive cognitive dependence.
Stated as a conceptual claim in the paper's introduction/abstract; supported by the paper's conceptual framing (theoretical argument), no empirical sample or experimental data reported here.
high mixed Cognitive Amplification vs Cognitive Delegation in Human-AI ... human reasoning quality / cognitive dependence
Policy implication: smarter, better-coordinated green governance is needed to address the negative local impacts and the crowding-out interaction between AI and environmental regulation.
Policy recommendation drawn in the abstract based on the empirical spatial findings (negative local effects and negative interaction).
high mixed How artificial intelligence and environmental regulation inf... governance/policy recommendation
Substantial regional gaps persist: leading eastern provinces approach a UCEE value of 1.0 while some northeastern provinces remain below 0.1.
Regional UCEE index estimates from the Super-SBM model across the 30 provinces reported in the abstract.
high mixed How artificial intelligence and environmental regulation inf... UCEE index (regional/provincial levels)
The systemic implications of AI in finance depend less on model intelligence alone than on how agent architectures are distributed, coupled, and governed across institutions.
Central argumentative claim supported by the AFMM conceptual model and an illustrative empirical application described in the paper (modeling + event-study approach); no full-sample details provided in the excerpt.
high mixed AI Agents in Financial Markets: Architecture, Applications, ... systemic implications / market-level risk and stability as a function of archite...
The Agentic Financial Market Model (AFMM), a stylised agent-based representation, links agent design parameters (autonomy depth, heterogeneity, execution coupling, infrastructure concentration, supervisory observability) to market-level outcomes including efficiency, liquidity resilience, volatility, and systemic risk.
Presentation of a stylised agent-based model (AFMM) in the paper; conceptual modelling linking specified agent parameters to macro/market outcomes. No empirical sample size reported in the excerpt.
high mixed AI Agents in Financial Markets: Architecture, Applications, ... market-level outcomes (efficiency, liquidity resilience, volatility, systemic ri...
Financial AI agents can be described by a four-layer architecture covering data perception, reasoning engines, strategy generation, and execution with control.
Conceptual framework proposed by the authors (theoretical/architectural proposal); no empirical testing or sample size provided.
high mixed AI Agents in Financial Markets: Architecture, Applications, ... architectural decomposition of financial AI agents
These productivity gains are most pronounced for lower-skilled workers, producing a pattern the authors call “skill compression.”
Cross-study pattern reported in the literature review: comparative evidence across worker-skill strata in multiple empirical papers showing larger relative gains for lower-skilled/junior workers; specific underlying studies and sample sizes are not enumerated in the brief.
high mixed AI, Productivity, and Labor Markets: A Review of the Empiric... relative productivity/gains by worker skill level (leading to 'skill compression...
Financial well-being is not an automatic byproduct of automated credit efficiency but an emergent outcome of architectural alignment among technology, borrower capability, and governance structures.
Theoretical conclusion drawn from empirical results showing mixed effects (positive on repayment and resilience, negative on stress) and significant moderation by human capability and institutional design.
high mixed Architecting financial well-being in algorithmic credit syst... multidimensional financial well-being (conceptual outcome)
Study 1 quantifies confirmation bias through controlled experiments on 250 CVE vulnerability/patch pairs evaluated across four state-of-the-art models under five framing conditions for the review prompt.
Controlled experiment described in the paper: 250 CVE vulnerability/patch pairs evaluated across four state-of-the-art LLMs under five prompt framing conditions.
high mixed Measuring and Exploiting Confirmation Bias in LLM-Assisted S... confirmation bias as measured by vulnerability detection performance
Lightweight safeguards can reduce risk in some settings but do not consistently prevent severe failures.
Analysis of simulated interventions/safeguards within governance simulations showing reductions in certain risk metrics in some scenarios, but persistence of severe failures in others; assessment based on rubric-judged transcript segments.
high mixed I Can't Believe It's Corrupt: Evaluating Corruption in Multi... risk of rule-breaking/abuse and severity of failures under safeguards
There are large differences in corruption-related outcomes across governance regimes and specific model–governance pairings.
Observed heterogeneity in outcomes across different authority structures and model–governance pairings within the multi-agent simulations, evaluated via rubric-based scoring over 28,112 transcript segments.
high mixed I Can't Believe It's Corrupt: Evaluating Corruption in Multi... variation in corruption-related outcomes across regimes and pairings
These findings challenge the narrative of complete automation by AI and underscore the enduring importance of human expertise in data science.
Interpretation based on competition results where AI-only baselines underperformed relative to many participant teams and top solutions used human-AI collaboration.
high mixed AgentDS Technical Report: Benchmarking the Future of Human-A... implications for automation vs. human expertise
These findings indicate a misalignment between the perceived benefit of AI writing and an implicit, consistent effect on the semantics of human writing, with potential implications for cultural and scientific institutions.
Synthesis and interpretation of the paper's empirical results (user study, essay revision experiments, and peer-review analysis); presented as the paper's broader conclusion.
high mixed How LLMs Distort Our Written Language alignment between perceived benefits and actual semantic effects of AI writing; ...
Socioeconomic regression analysis confirms strong correlations between neighborhood racial composition and detection likelihood: Pearson r = 0.83 for percent White and r = -0.81 for percent Black.
Reported Pearson correlation coefficients from regression analysis between neighborhood racial composition variables and detection likelihood in the simulations.
high mixed Unmasking Algorithmic Bias in Predictive Policing: A GAN-Bas... correlation between neighborhood racial composition and detection likelihood
A Conditional Tabular GAN (CTGAN) debiasing approach partially redistributes detection rates but cannot eliminate structural disparity without accompanying policy intervention.
Experimental comparison between baseline simulations and CTGAN-debiased synthetic data showing partial redistribution of detection rates; paper asserts remaining structural disparities.
high mixed Unmasking Algorithmic Bias in Predictive Policing: A GAN-Bas... effect of CTGAN debiasing on detection rate distribution / structural disparity
Regional analysis shows inland regions remain capital-dependent, with an estimated (capital) elasticity of approximately 0.43.
Regional decomposition/estimation reported in the study comparing inland regions to coastal ones using the extended production function.
high mixed Analysis of China's Economic Growth Drivers: An Empirical St... capital elasticity in inland regions (≈0.43)
The paper formalizes the distinction using a signal-aggregation model in which an organization maintains an anchor belief and achieves agreement through two exclusion channels: (1) report shrinkage toward the anchor and (2) a tolerance rule that discards reports deviating beyond a threshold.
Analytical formal model presented in the paper specifying an anchor belief and two exclusion mechanisms; model assumptions and mechanisms are explicit in the theoretical development. No empirical sample.
high mixed Cohesion as Concentration: Exclusion-Driven Fragility in Fin... mechanisms producing agreement (report shrinkage, tolerance-based discarding)
Organizational cohesion is observationally ambiguous: it can arise either from genuine information integration (debate and synthesis of heterogeneous inputs) or from exclusionary processes (conformity pressure, gatekeeping, intolerance of dissent).
Conceptual argument and formal definition in the paper framing; supported by the analytic distinction introduced in the paper between integration and exclusion as alternative generative mechanisms for observed agreement. No empirical sample—argument is theoretical and illustrated by model construction.
high mixed Cohesion as Concentration: Exclusion-Driven Fragility in Fin... source of observed cohesion (integration versus exclusion)
The authors identify ten evaluation practices that teams use, ranging from lightweight interpretive checks to formal organizational processes (examples: qualitative user reviews, red-team testing, A/B experiments, telemetry/log analysis, structured annotation, governance/meta-evaluation).
Thematic coding of 19 interview transcripts produced a taxonomy enumerating ten practices (paper reports the taxonomy as an outcome).
high mixed Results-Actionability Gap: Understanding How Practitioners E... taxonomy/count and description of evaluation practices
Quantum-driven growth depends critically on adoption rates, infrastructure readiness, complementary investments (digital infrastructure, human capital), and enabling policy/regulatory environments.
Scenario framework that varies (a) technical timelines, (b) sectoral adoption rates (diffusion models), (c) infrastructure readiness, and (d) policy environments; policy counterfactual modeling shows sensitivity of adoption and macro outcomes to these parameters.
high mixed Modeling Macroeconomic Output Gains from Quantum-Driven Prod... realized productivity gains, adoption rates, speed of diffusion
The magnitude and timing of macroeconomic impact from quantum computing are highly uncertain.
Monte Carlo / scenario ensemble results showing wide (fat-tailed) outcome distributions driven by uncertainty in technical milestones, adoption rates, and complementarity strengths; use of expert elicitation to parameterize tail risks.
high mixed Modeling Macroeconomic Output Gains from Quantum-Driven Prod... distribution of macroeconomic outcomes (GDP growth, TFP), timing of impacts
Policymakers face trade-offs between promoting innovation and market efficiency on one hand and protecting privacy, fairness, and national security on the other; economic analysis can inform calibration.
Normative policy analysis and synthesis of literature on digital regulation and trade-offs; supported by comparative observations of regulatory priorities across jurisdictions.
high mixed Path Analysis of Digital Economy and Reconstruction of Inter... policy trade-offs (innovation vs. privacy/fairness/security) and associated welf...