The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲

Evidence (13870 claims)

Adoption
8467 claims
Productivity
7558 claims
Governance
6805 claims
Human-AI Collaboration
6363 claims
Org Design
4132 claims
Innovation
4065 claims
Labor Markets
3526 claims
Skills & Training
2945 claims
Inequality
2066 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 749 196 98 892 1984
Governance & Regulation 817 394 188 121 1544
Organizational Efficiency 771 189 124 83 1177
Technology Adoption Rate 627 233 123 96 1088
Research Productivity 411 123 56 332 933
Output Quality 467 178 59 47 751
Decision Quality 320 174 75 42 618
Firm Productivity 435 55 88 20 604
AI Safety & Ethics 214 276 65 33 593
Market Structure 178 167 122 24 496
Task Allocation 207 64 71 32 379
Skill Acquisition 165 59 60 17 301
Innovation Output 203 27 43 18 292
Employment Level 105 52 107 13 279
Fiscal & Macroeconomic 131 69 43 26 276
Consumer Welfare 116 63 42 11 232
Firm Revenue 150 48 26 3 227
Inequality Measures 44 122 49 6 221
Task Completion Time 169 29 8 12 219
Worker Satisfaction 89 63 20 12 184
Error Rate 69 92 10 2 173
Regulatory Compliance 76 68 14 5 163
Training Effectiveness 93 21 13 19 148
Wages & Compensation 77 36 25 6 144
Automation Exposure 51 54 22 12 142
Team Performance 86 17 27 9 140
Developer Productivity 94 17 14 6 132
Job Displacement 12 80 20 1 113
Hiring & Recruitment 51 7 8 3 69
Creative Output 31 17 7 3 59
Skill Obsolescence 5 46 6 1 58
Social Protection 27 16 8 2 53
Labor Share of Income 17 17 17 51
Worker Turnover 11 12 3 26
Industry 1 1
As a secondary external check, we evaluate the consistency of reconstructed signals against stock-price data for a multi-firm dataset of AI-related news titles (November 2024 to February 2026).
Empirical evaluation reported in the paper using reconstructed signals compared to stock-price time series over the specified date range; described as a 'multi-firm' dataset (exact number of firms not stated in the abstract).
high positive Causal Reconstruction of Sentiment Signals from Sparse News ... consistency (relationship) between reconstructed sentiment signals and stock pri...
Because ground-truth longitudinal sentiment labels are typically unavailable, we introduce a label-free evaluation framework based on signal stability diagnostics, information preservation lag proxies, and counterfactual tests for causality compliance and redundancy robustness.
Methodological contribution described in the paper (evaluation framework proposal).
high positive Causal Reconstruction of Sentiment Signals from Sparse News ... evaluation of reconstructed sentiment signals without labeled longitudinal senti...
We present a modular three-stage pipeline that (i) aggregates article-level scores onto a regular temporal grid with uncertainty-aware and redundancy-aware weights, (ii) fills coverage gaps through strictly causal projection rules, and (iii) applies causal smoothing to reduce residual noise.
Description of proposed algorithm/pipeline in the paper (design/implementation claim).
high positive Causal Reconstruction of Sentiment Signals from Sparse News ... method for producing stable temporal sentiment series
Rather than treating this as a classification challenge, we propose to frame it as a causal signal reconstruction problem: given probabilistic sentiment outputs from a fixed classifier, recover a stable latent sentiment series that is robust to the structural pathologies of news data such as sparsity, redundancy, and classifier uncertainty.
Methodological proposal presented in the paper (conceptual framing and problem statement).
high positive Causal Reconstruction of Sentiment Signals from Sparse News ... quality/stability of reconstructed latent sentiment series from classifier outpu...
Automatic speech recognition (ASR) has shown increasing potential to assist in the transcription of endangered language data.
Background claim in the paper, referring to advances in ASR and prior work suggesting utility for endangered-language transcription; stated as motivation rather than a novel empirical finding in this paper.
high positive Automatic Speech Recognition for Documenting Endangered Lang... utility/potential of ASR for endangered-language transcription
We train an ASR model that achieves a character error rate as low as 15%.
Reported quantitative evaluation of the trained ASR model on the constructed Ikema dataset (character error rate = 15%). Exact evaluation protocol, test set size, and train/test split not provided in the abstract.
We construct a {\totaldatasethours}-hour speech corpus from field recordings.
Stated in paper as an outcome of the authors' data-collection and corpus-construction effort from field recordings; no numeric value resolved in the provided text (placeholder present).
high positive Automatic Speech Recognition for Documenting Endangered Lang... size of speech corpus (hours)
With calibrated oversight that aligns accountability to real-world risks, AI can secure the profession’s future.
Normative/prognostic claim in the Article (argument that appropriate governance will preserve or strengthen the legal profession).
high positive Rewired: Reconceptualizing Legal Services for the AI Age long-term resilience/stability of the legal profession
With calibrated oversight that aligns accountability to real-world risks, AI can improve service quality in legal services.
Normative/prognostic claim in the Article (argument that governance plus AI yields quality improvements). No empirical effect sizes reported in the excerpt.
high positive Rewired: Reconceptualizing Legal Services for the AI Age service quality of legal services
While the risks of AI are real, they must not eclipse the opportunity: with calibrated oversight that aligns accountability to real-world risks, AI can expand access to legal services.
Normative claim and projected benefit argued by the authors (theoretical/argumentative; no empirical evidence in excerpt).
high positive Rewired: Reconceptualizing Legal Services for the AI Age expansion of access to legal services
Using agentic financial transactions as an example, we demonstrate how governments and regulators can use this monitoring method to extend oversight beyond model outputs to the tool layer to monitor risks of agent deployment.
Paper includes a case demonstration (agentic financial transactions) showing application of MCP monitoring to identify and assess risky tool deployments and to inform regulatory oversight.
high positive How are AI agents used? Evidence from 177,000 MCP tools feasibility of a monitoring approach for regulatory oversight at the tool layer
The share of 'action' tools rose from 27% to 65% of total usage over the 16-month period sampled.
Time-series usage/download data from MCP servers across the 16-month sample (paper reports increase in share of action tools from 27% to 65%).
high positive How are AI agents used? Evidence from 177,000 MCP tools share of 'action' tools as fraction of total usage/downloads
Software development accounts for 90% of MCP server downloads.
Download metrics from monitored MCP servers stratified by tool domain indicating 90% of downloads are for software development tools (paper statement).
high positive How are AI agents used? Evidence from 177,000 MCP tools share of MCP server downloads attributed to software development tools
Software development accounts for 67% of all agent tools.
Categorisation of the 177,436 monitored agent tools by task domain (O*NET mapping) yielding 67% in software development.
high positive How are AI agents used? Evidence from 177,000 MCP tools share of agent tools in the software development task domain
We evaluated 177,436 agent tools created from 11/2024 to 02/2026 by monitoring public Model Context Protocol (MCP) server repositories.
Empirical monitoring of public MCP server repositories; dataset of 177,436 agent tools collected over the period 11/2024–02/2026 (as stated in paper).
high positive How are AI agents used? Evidence from 177,000 MCP tools number of agent tools observed
The framework provides a roadmap for coordinated response across educational institutions, government agencies, and industry to ensure workforce resilience and domestic leadership in the emerging agentic finance era.
Authors' proposed integrated roadmap (prescriptive recommendation; no empirical testing or outcome measurement reported in the provided text).
high positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... workforce resilience and domestic leadership in agentic finance
We develop a comprehensive government policy framework including: 1) Federal AI literacy mandates for post-secondary business education; 2) Department of Labor workforce retraining programs with income support for displaced financial professionals; 3) SEC and Treasury regulatory innovations creating market incentives for workforce development; 4) State-level workforce partnerships implementing regional transition support; and 5) Enhanced social safety nets for workers navigating career transitions during the estimated 5-15 year transformation period.
Author-presented policy framework and recommendations (policy design proposals and an asserted 5–15 year transformation timeframe; no empirical evaluation reported).
high positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... policy adoption and worker support measures during technological transition
We propose a multi-layered integration strategy for higher education encompassing: 1) Foundational AI literacy modules for all business students; 2) A specialized "Agentic Financial Planning" course with hands-on labs; 3) AI-augmented redesign of core courses (Investments, Portfolio Management, Ethics); 4) Interdisciplinary project-based learning with Computer Science; and 5) A governance and policy module addressing regulatory compliance (NIST AI RMF, SEC regulations).
Proposed curricular framework presented by the authors (recommendation/proposal, not empirically tested within the paper).
high positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... student AI-related skills and preparedness for agentic finance roles
The ultimate competitive edge lies in an organization's ability to treat AI not as a standalone tool, but as a core component of sustainable, long-term corporate strategy.
Concluding normative claim in the paper; presented as an interpretation/synthesis rather than supported by cited empirical evidence in the abstract.
high positive The AI Advantage: Strategic Innovation and Global Expansion ... competitive advantage derived from integrating AI into corporate strategy
Successful global expansion is no longer predicated solely on physical presence but on the deployment of scalable, localized AI models that navigate diverse regulatory, linguistic, and cultural landscapes.
Argumentative claim in the paper describing a strategic determinant for global expansion; no empirical sample or quantified outcomes presented in the abstract.
high positive The AI Advantage: Strategic Innovation and Global Expansion ... drivers of successful global expansion (physical presence vs. localized AI deplo...
AI hyper-personalizes customer engagement.
Declarative claim in the paper about AI's effect on customer engagement personalization; no experimental or observational data reported in the abstract.
high positive The AI Advantage: Strategic Innovation and Global Expansion ... degree of personalization in customer engagement
AI acts as an internal engine for operational agility by compressing R&D cycles.
Claim made in the paper asserting R&D cycle compression due to AI; no empirical data, sample size or quantitative measures provided in the abstract.
high positive The AI Advantage: Strategic Innovation and Global Expansion ... length/duration of R&D cycles (time-to-iteration)
The strategic focus has transitioned from mere process automation to autonomous orchestration, where multi-agent systems independently manage complex, cross-border operations and real-time decision-making.
Analytic statement from the paper describing an observed/argued shift in strategic focus; no empirical methodology or sample reported.
high positive The AI Advantage: Strategic Innovation and Global Expansion ... shift in strategic focus from automation to autonomous orchestration via multi-a...
Organizations leverage agentic workflows and domain-specific intelligence to catalyse strategic innovation and facilitate global expansion in the digital era.
Conceptual claim in the paper describing how organizations use specific AI capabilities; no empirical design or sample described in the abstract.
high positive The AI Advantage: Strategic Innovation and Global Expansion ... use of agentic workflows and domain-specific models to drive innovation and glob...
The rapid evolution of Artificial Intelligence (AI) has shifted from a disruptive trend to the fundamental operating layer of the modern enterprise.
Statement/assertion in the paper (conceptual/positioning claim); no empirical method, sample size, or statistical analysis reported in the abstract.
high positive The AI Advantage: Strategic Innovation and Global Expansion ... role of AI in enterprise operations (from peripheral/disruptive to core/operatin...
The analysis provides a transparent measurement framework and baseline statistics for tracking the emerging shift from AI discussion to action-oriented, agentic deployments in finance.
Methodological contribution claim: presentation of an auditable dictionary-and-context approach plus reported baseline statistics (percentages by year).
high positive Measuring agentic AI adoption and control frameworks in fina... availability of a measurement framework and baseline statistics for tracking age...
Autonomy evidence focuses on regions with higher control density, consistent with governance maturity serving as a prerequisite for action-taking deployments.
Comparative text-as-data analysis showing agentic/autonomy references concentrated in disclosure windows with higher measured controls density; interpretive claim linking this pattern to governance maturity as a prerequisite.
high positive Measuring agentic AI adoption and control frameworks in fina... co-location/correlation of autonomy evidence with higher controls density in dis...
Agentic disclosures are absent in 2021–2023, appear in 2024 (0.4% of firm-years), and increase in 2025 (1.6% of firm-years), indicating a late but accelerating diffusion phase.
Empirical counts/percentages reported from the assembled panel; per-year denominators are 500 firm–year observations (500 firms per year).
high positive Measuring agentic AI adoption and control frameworks in fina... frequency (share) of firm–years with agentic disclosures
We implement an auditable dictionary-and-context approach that flags agentic references and then quantifies the surrounding 'controls density' (governance and safety language) within the same local disclosure window.
Methods description: dictionary-and-context text-as-data approach and a quantified 'controls density' metric applied to filings.
high positive Measuring agentic AI adoption and control frameworks in fina... presence of agentic references and measured controls density in disclosure text
We assemble a balanced panel of 2,500 firm–year observations (500 firms per year) from 2021–2025.
Stated dataset construction in the paper: balanced panel across years with 500 firm–year observations per year, total 2,500 firm–years.
high positive Measuring agentic AI adoption and control frameworks in fina... dataset size and composition (firm–year observations)
Agentic artificial intelligence (AI) systems can execute actions rather than merely generate content.
Conceptual/definitional statement in the paper framing agentic AI as systems that execute actions (not an empirical test).
high positive Measuring agentic AI adoption and control frameworks in fina... ability of AI systems to execute actions versus generate content
Transparency’s effectiveness in promoting data-sharing is amplified by, and dependent upon, user trust; fostering trust in AI may be a more vital prerequisite for data-sharing than implementing transparent designs.
Synthesis of experimental findings (N=240): transparency increased willingness only among users with pre-existing trust; null effect of transparency alone on actual sharing; authors conclude that trust moderates transparency effects and recommend focusing on trust-building.
high positive Understanding Data-Sharing with AI Systems: The Roles of Tra... recommendation/policy implication regarding trust vs transparency for promoting ...
Immediate sharing decisions were largely driven by intuitive System 1 processing rather than deliberative evaluation (System 2).
Interpretation of the pattern in experimental data (N=240): high, similar sharing rates across conditions despite differing stated willingness-to-share and measured privacy concerns; authors attribute this to dual-process dynamics (System 1 driving immediate behavior).
high positive Understanding Data-Sharing with AI Systems: The Roles of Tra... dominance of intuitive (System 1) processing in immediate sharing behavior
The positive effect of transparency on willingness to share was contingent on pre-existing user trust in AI, particularly for white-box systems.
Moderation analyses reported from the experiment (N=240): interaction between transparency (white-box vs black-box) and measured pre-existing trust in AI showed increased willingness-to-share only among users with higher trust, with the effect most pronounced for white-box systems.
high positive Understanding Data-Sharing with AI Systems: The Roles of Tra... willingness to share (stated/deliberative sharing intention)
We conducted a pre-registered online experiment (N=240) where participants interacted with a fictional sleep-optimization app and were randomly assigned to scenarios where data was processed by either a human expert, a transparent white-box AI, or an opaque black-box AI.
Pre-registered online experimental design described in paper; random assignment to three processing-entity conditions (human, white-box AI, black-box AI); sample size reported as N=240; measured outcomes included actual data-sharing and willingness to share, plus trust and privacy concerns.
high positive Understanding Data-Sharing with AI Systems: The Roles of Tra... experimental manipulation / treatment assignment and measurement of sharing outc...
A Metacognitive Co-Regulation Agent (in CRDAL) assists the Design Agent in metacognition to mitigate design fixation, thereby improving system performance for engineering design tasks.
Mechanistic claim supported by the paper's experimental results on the battery pack design problem showing CRDAL outperforming SRL and RWL; detailed measures of fixation reduction not provided in the excerpt.
high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... reduction in design fixation / improvement in performance due to co-regulation
The CRDAL system navigated through the latent design space more effectively than both SRL and RWL.
Empirical analysis on the battery pack design task comparing latent-space trajectories/exploration between CRDAL, SRL, and RWL; details on how 'more effectively' was quantified and sample size are not provided in the excerpt.
high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... quality/coverage of exploration in latent design space
The CRDAL system achieves better design performance without significantly increasing the computational cost compared to SRL and RWL.
Empirical claim based on experiments on the battery pack design problem comparing computational cost across CRDAL, SRL, and RWL; exact computational metrics and sample size not provided in the excerpt.
high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... computational cost (efficiency/resource usage) of design-generation process
In the battery pack design problem examined here, the CRDAL system generates designs with better performance compared to a plain Ralph Wiggum Loop (RWL) and the metacognitively self-assessing Self-Regulation Loop (SRL).
Empirical comparison on a battery pack design task between CRDAL, SRL, and RWL reported in the paper; exact number of test instances or runs not stated in the excerpt.
high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... design performance (battery pack designs)
We propose a novel Co-Regulation Design Agentic Loop (CRDAL), in which a Metacognitive Co-Regulation Agent assists the Design Agent in metacognition to mitigate design fixation.
Methodological contribution presented in the paper (proposed system architecture). No empirical sample size reported for the proposal itself.
high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... proposed agent architecture (Co-Regulation Design Agentic Loop)
We propose a novel Self-Regulation Loop (SRL), in which the Design Agent self-regulates and explicitly monitors its own metacognition.
Methodological contribution presented in the paper (proposed system architecture). No empirical sample size reported for the proposal itself.
high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... proposed agent architecture (Self-Regulation Loop)
When models are used in research, potential threats to inference should be systematically identified alongside the steps taken to mitigate them, and specific justifications for model selection should be provided.
Prescriptive recommendation in the paper (normative guidance) based on the authors' analysis of threats to inference; no empirical testing reported in abstract.
high positive How Open Must Language Models be to Enable Reliable Scientif... transparency and robustness of research inferences / research practices
The inferential issues that closed models present can be resolved or mitigated by certain measures.
Paper's analytic discussion of mitigation strategies and ways to resolve or reduce threats to inference; no empirical validation or quantified results provided in the abstract.
high positive How Open Must Language Models be to Enable Reliable Scientif... reliability of inference after mitigation
EcoThink offers a scalable path toward a sustainable, inclusive, and energy-efficient generative AI Agent.
Concluding claim in the paper asserting broader impact and scalability of the proposed method (position/interpretive claim based on reported results).
high positive EcoThink: A Green Adaptive Inference Framework for Sustainab... scalability / potential for adoption toward sustainable AI agents
Extensive evaluations were performed across 9 diverse benchmarks.
Statement in the paper that evaluations were run on 9 benchmarks (as stated in the abstract).
high positive EcoThink: A Green Adaptive Inference Framework for Sustainab... evaluation scope (number of benchmarks)
EcoThink employs a lightweight, distillation-based router to dynamically assess query complexity, skipping unnecessary reasoning for factoid retrieval while reserving deep computation for complex logic.
Methodological description of the proposed framework in the paper (design/architecture claim).
high positive EcoThink: A Green Adaptive Inference Framework for Sustainab... query-routing decision to skip or use deep reasoning
EcoThink reduces inference energy by up to 81.9% for web knowledge retrieval.
Experimental result reported in the paper (maximum observed reduction for the web knowledge retrieval benchmark, as stated in the abstract).
high positive EcoThink: A Green Adaptive Inference Framework for Sustainab... inference energy (web knowledge retrieval)
EcoThink reduces inference energy by 40.4% on average across 9 diverse benchmarks.
Experimental evaluations reported in the paper across 9 benchmarks comparing inference energy of EcoThink versus baseline (as stated in the abstract).
Policy efficacy varies significantly across corporate profiles, with the strongest effects observed in non-state-owned enterprises, high-tech firms, and firms located in eastern regions.
Heterogeneity analyses reported in the study (subgroup analysis by ownership, technology intensity, and geographic region).
high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... heterogeneous policy impact on corporate NQPF across firm subgroups
The estimated positive effect of the pilot zones on corporate NQPF is robust across a comprehensive battery of robustness and endogeneity tests.
Paper reports multiple robustness and endogeneity checks (details not provided in abstract) that reportedly do not overturn main findings.
high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... robustness of estimated policy effect on NQPF