The Commonplace
Home Dashboard Papers Evidence Syntheses Digests 🎲

Evidence (7953 claims)

Adoption
5539 claims
Productivity
4793 claims
Governance
4333 claims
Human-AI Collaboration
3326 claims
Labor Markets
2657 claims
Innovation
2510 claims
Org Design
2469 claims
Skills & Training
2017 claims
Inequality
1378 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 402 112 67 480 1076
Governance & Regulation 402 192 122 62 790
Research Productivity 249 98 34 311 697
Organizational Efficiency 395 95 70 40 603
Technology Adoption Rate 321 126 73 39 564
Firm Productivity 306 39 70 12 432
Output Quality 256 66 25 28 375
AI Safety & Ethics 116 177 44 24 363
Market Structure 107 128 85 14 339
Decision Quality 177 76 38 20 315
Fiscal & Macroeconomic 89 58 33 22 209
Employment Level 77 34 80 9 202
Skill Acquisition 92 33 40 9 174
Innovation Output 120 12 23 12 168
Firm Revenue 98 34 22 154
Consumer Welfare 73 31 37 7 148
Task Allocation 84 16 33 7 140
Inequality Measures 25 77 32 5 139
Regulatory Compliance 54 63 13 3 133
Error Rate 44 51 6 101
Task Completion Time 88 5 4 3 100
Training Effectiveness 58 12 12 16 99
Worker Satisfaction 47 32 11 7 97
Wages & Compensation 53 15 20 5 93
Team Performance 47 12 15 7 82
Automation Exposure 24 22 9 6 62
Job Displacement 6 38 13 57
Hiring & Recruitment 41 4 6 3 54
Developer Productivity 34 4 3 1 42
Social Protection 22 10 6 2 40
Creative Output 16 7 5 1 29
Labor Share of Income 12 5 9 26
Skill Obsolescence 3 20 2 25
Worker Turnover 10 12 3 25
In the sentiment-analysis task, those individual differences do not produce human–AI complementarity: the joint performance of humans and AI did not exceed that of either alone.
Empirical finding reported from the preregistered sentiment-analysis experiment showing no complementarity effect (joint human-AI performance ≤ best individual performance). (Statistical tests and sample size not included in the excerpt.)
medium null result Who Needs What Explanation? How User Traits Affect Explanati... human–AI joint performance compared to human-alone and AI-alone performance (e.g...
We conducted a systematic review and meta-analysis of the literature on AI/HR analytics and organizational decision making, using 85 publications and grounding the work in theories of algorithm-automated decision-making (AST) and matching/hybrid models (STS).
Paper's methods: systematic review and meta-analysis; sample = 85 publications; theoretical framing explicitly stated as AST and STS.
medium null result ALGORITHMIC DETERMINISM VERSUS HUMAN AGENCY: A SYSTEMATIC RE... scope/coverage of literature (number of publications reviewed); theoretical fram...
Macroeconomic fiscal moderation remains empirically unvalidated.
Synthesis conclusion from the review noting an absence of empirical evidence that Agentic AI produces macroeconomic fiscal moderation; i.e., no validated studies showing broad fiscal relief effects were identified in the reviewed literature.
medium null result Agentic AI for Ageing Healthcare Systems in Advanced Economi... macro-fiscal outcomes (e.g., national fiscal pressure, public expenditure modera...
By 2024 the RL-FRB/US model produced a federal budget deficit similar to the baseline: RL-FRB/US model: -1,767 trillion $ vs. FRB/US model: -1,758 trillion $.
Reported fiscal balance (federal budget deficit) simulation outputs for 2024 from comparative model runs in the paper.
medium null result Fiscal Policy Towards Optimizing Macroeconomic Indicators by... Federal budget deficit (trillion $) for 2024
No significant differences emerged in job titles and industry suggested by GPT-5 across genders.
Empirical finding from analysis of GPT-5 outputs comparing suggested job titles and industries for the 24 profiles; exact statistical tests not specified in the summary.
medium null result Gender Bias in Generative AI-assisted Recruitment Processes suggested job titles and industry assignments by GPT-5 across male and female pr...
Self-generated (model-authored) Skills provide no average benefit.
Comparison of three evaluation conditions (no Skills, curated Skills, self-authored Skills) across SkillsBench. Averaged pass-rate deltas show that model-authored Skills do not increase average pass rate relative to baseline; analysis used 7,308 trajectories over 86 tasks and 7 agent–model configurations.
medium null result SkillsBench: Benchmarking How Well Agent Skills Work Across ... task pass rate (average delta for self-authored Skills vs. baseline)
AI will not cause permanent mass unemployment at the aggregate level.
Analytical argument and literature synthesis using labor-economics theory (Skill-Biased Technological Change and structural transformation). No primary microdata, no stated empirical identification strategy or sample size in the paper (methodology appears to be theoretical and sectoral synthesis).
medium null result Artificial Intelligence, Automation, and Employment Dynamics... aggregate employment / unemployment
Empirical evaluation is needed on how AI-induced productivity gains translate into aggregate demand and labor absorption.
Identified research priority in the paper, based on theoretical uncertainty about demand-side labor absorption and lack of conclusive empirical evidence.
medium null result Artificial Intelligence, Automation, and Employment Dynamics... relationship between productivity gains from AI and aggregate demand/employment
AI will not mechanically cause permanent mass unemployment at the aggregate level.
Theoretical framing and synthesis of existing empirical findings across task-based and macro studies; no single new dataset provided (paper draws on literature and conceptual models).
medium null result Artificial Intelligence, Automation, and Employment Dynamics... aggregate employment / unemployment (long-run)
Occupation-level analyses (e.g., BLS OEWS cross-occupation wage regressions) risk misleading conclusions about AI’s distributional effects because they aggregate over the task- and firm-level heterogeneity that drives the mechanism.
Theoretical argument and empirical illustration in the paper showing how aggregation masks within-task compression and firm-level rent capture; example regressions on OEWS used to demonstrate the limitation.
medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... accuracy of occupation-level analyses in capturing task-level mechanism (qualita...
Testing the model requires within-occupation, within-task panel data on task-level performance and wages linked to firm-level AI adoption, ownership of complementary assets, and measures of rent-sharing; such data are not available at scale.
Author statement about data requirements and current data limitations; empirical illustration and discussion note absence of large-scale linked microdata meeting these criteria.
medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... availability of suitable microdata for empirical testing (data coverage / scale)
Occupation-level regressions using BLS OEWS (2019–2023) are insufficient for testing the model’s task-level predictions because aggregation across tasks and firms hides the mechanism.
Empirical illustration in the paper using occupation-level regressions on BLS OEWS 2019–2023 showing that such aggregates do not reveal within-occupation, within-task dispersion or firm-level rent concentration effects; paper argues this is a data-adequacy limitation.
medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... ability of occupation-level regressions to detect task-level mechanism (qualitat...
A sensitivity decomposition shows five of the moments (the non‑ΔGini moments) identify internal mechanism rates (how AI changes task production, education responses, screening intensity) but do not determine the aggregate sign of inequality change.
Local identification / sensitivity decomposition performed on the calibrated model; decomposition results reported in the paper attribute mechanism-rate identification to five moments and show they leave the sign of ΔGini indeterminate.
medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... identification of mechanism parameters versus determination of aggregate ΔGini s...
The paper introduces a novel taxonomy that separates patenting into three domains: core AI, traditional robotics, and AI-enhanced robotics.
Methodological contribution of the paper: construction and application of a classification scheme that assigns patent filings (1980–2019) into three domains (core AI, traditional robotics, AI-enhanced robotics). Data source: patent filings 1980–2019 (aggregate counts by domain and country). Exact number of patents not provided in the summary.
medium null result The "Gold Rush" in AI and Robotics Patenting Activity. Do in... categorization/classification of patent filings into three domains
The proposed uncertainty measure connects to classical value-of-information concepts, bridging security mechanism analysis and economic theories of information, signaling, and screening.
Analytical comparison and discussion in the paper linking the entropy-style residual uncertainty metric to value-of-information literature (theoretical linkage).
medium null result Evaluating Synthetic Cyber Deception Strategies Under Uncert... conceptual/analytical alignment between residual uncertainty metric and value-of...
AI did not significantly moderate the relationship between workplace stress and job performance.
Moderation test in PLS-SEM (SmartPLS 4.0) on N = 350; reported non-significant AI × Stress → Performance moderator (paper reports no significant moderating effect).
Use of AI raises needs for traceability, explainability, and continuous validation to maintain compliance and avoid error propagation in curricular decisions.
Paper's AI governance recommendations (prescriptive), referencing general AI risk principles rather than empirical study.
medium null result Curriculum engineering: organisation, orientation, and manag... traceability/explainability measures, validation frequency, incidence of propaga...
There is no accepted integrative digital model that maps measured or perceived value to algorithmic pricing.
Absence of such a model in the SLR sample of 30 articles and thematic coding that identified this gap explicitly.
medium null result Pricing Strategy in Digital Marketing: A Systematic Review o... Existence of integrative digital VBP model (mapping perceived value to algorithm...
There is no evidence of nonlinearities in the relationship between digital trade and urban house prices (the effect is linear across the sample).
Explicit tests for nonlinearity reported in the econometric analysis (details of test specification not provided in the summary).
medium null result Is digital trade affecting city house prices? An artificial ... city-level house prices
When green-technology innovation is low (below the threshold), the main measurable effect of DE is on improving carbon emission efficiency (CEE), but DE does not yet reduce per capita emissions (PCE).
Results from the threshold-regression models on the 278-city panel (2011–2022) show that in the low-green-innovation regime DE coefficients are significant for CEE but not for PCE; mediating-effect models corroborate the efficiency channel in low-innovation contexts.
medium null result Digital Economy, Green Technology Innovation and Urban Carbo... Carbon emission efficiency (CEE) and Per capita carbon emissions (PCE)
Realising DT value requires upfront investment in sensors, integration, standards, and skills; economic viability depends on contract structures and how gains are allocated between investors, owners, contractors, and operators.
Synthesis of cost/benefit discussions and case descriptions in the reviewed literature; policy and procurement examples referenced.
medium null result Digital Twins Across the Asset Lifecycle: Technical, Organis... investment requirements and determinants of economic viability
HCI has explored usable consent, but there is no systematic framework for consent in the AI era.
Literature synthesis and gap identification from workshop participants and solicited position papers; no systematic review or meta-analysis with counted studies reported in the summary.
medium null result Moving Beyond Clicks: Rethinking Consent and User Control in... existence of a systematic AI-era consent framework
Privacy-leak framing (risk vs ambiguity or privacy-threatening vs neutral) did not change participants' subsequent bargaining behavior with pricing algorithms.
The experiment measured downstream bargaining behavior with algorithms after the adoption/label tasks (N = 610) and reports no detectable effect of the privacy/leak framing on those bargaining outcomes.
medium null result The Data-Dollars Tradeoff: Privacy Harms vs. Economic Risk i... Bargaining behavior with pricing algorithms (choices/offer responses in the down...
Under truthful bidding, the decentralised price-based market matches a centralised value-optimal benchmark (i.e., decentralised allocation equals centralised value-optimal allocation).
Paper presents both a theoretical argument (mechanism properties under quasilinear utilities and discrete slices) and empirical validation in simulation by comparing decentralised outcomes to a centralised value-optimal baseline across configurations in the ablation study.
medium null result Real-Time AI Service Economy: A Framework for Agentic Comput... allocation value (total value/throughput) relative to a centralised value-optima...
No clear evidence that project phase systematically shifts sentiment perception.
Project-phase indicators were collected each round and included in correlation and repeated-measures analyses; no consistent, systematic association between project phase and sentiment labeling was found.
medium null result Exploring Indicators of Developers' Sentiment Perceptions in... sentiment label distribution across project phases
Predictors of negative labeling are weak and at best trend-level (e.g., task conflict shows only weak/trend-level association with negative labels).
Correlation analyses and GEE models testing multiple predictors (mood states, life circumstances, team dynamics including task conflict) on negative vs other labels; effects for negative labeling were small and lacked robustness.
medium null result Exploring Indicators of Developers' Sentiment Perceptions in... probability of labeling a statement as negative
Experiments used realistic channel and beamforming datasets reflecting varying elevation angles and dynamic LEO link conditions.
Dataset description in the paper states use of realistic channel and beamforming data including varying elevation angles and dynamic links; no dataset size or public dataset identifiers provided in the summary.
medium null result Federated Learning-driven Beam Management in LEO 6G Non-Terr... data realism and coverage of elevation-angle dynamics (qualitative)
There is a need for causal studies (randomized pilots, phased rollouts) to quantify net welfare effects including patient trust, equity, legal risk, and long-run labor impacts.
Authors' recommendation based on gaps identified in the mixed-methods evidence and acknowledged limitations around causal identification and long-term measurement.
medium null result The Role of Artificial Intelligence in Healthcare Complaint ... recommended outcomes for future causal evaluation (patient trust, equity metrics...
Under the current estimated parameters, dynamics converge toward equilibria—implying convergent, policy-mediated adjustment rather than endogenous cyclical instability.
Inference from stability classification (stable-node equilibria) and model dynamics simulated or linearized around equilibria using 2016–2023–estimated parameters.
medium null result Governance of Technological Transition: A Predator-Prey Anal... convergence behavior of model trajectories (toward equilibrium)
Equilibrium points of the estimated three-stock system are classified as stable nodes (no persistent endogenous cycles under the estimated parameters).
Stability analysis: equilibria computed from estimated parameters and local stability assessed via Jacobian eigenvalues; eigenvalues indicate stable nodes.
medium null result Governance of Technological Transition: A Predator-Prey Anal... equilibrium stability classification (eigenvalues of Jacobian)
Results are robust across alternative AI index specifications, occupational classifications, and standard controls (country and year fixed effects, macroeconomic covariates).
Paper reports robustness checks across different index constructions and occupational taxonomies, with standard controls included in regressions.
medium null result Artificial Intelligence and Labor Market Transformation: Emp... Stability of estimated effects (robustness of employment and wage estimates)
Liability for harm from AI remains unresolved; current regulatory frameworks (notably in the EU) continue to emphasize human responsibility and require conformity and clinical validation.
Regulatory and legal analyses, with emphasis on European Union device regulation and liability principles, as reviewed in the paper.
medium null result Will AI Replace Physicians in the Near Future? AI Adoption B... legal liability allocation; regulatory requirements for conformity and clinical ...
On-Premise RAG matches commercial (cloud) RAG on standard quantitative retrieval and generation metrics.
Empirical comparative analysis using standard retrieval/generation benchmarks comparing three systems (zero-shot baseline, GPT RAG cloud, Open-source On-Prem RAG) under representative SME workloads; specific metric names and sample sizes not reported in the summary.
medium null result An Empirical Study on the Feasibility Analysis of On-Premise... standard retrieval and generation metrics (quantitative performance of retrieval...
State-level advances in worker-protective AI measures exist but are uneven and many proposed state bills aimed at strengthening workers’ rights related to AI have stalled.
Review of state legislative proposals and enacted laws as compiled in the commentary (state-level policy scan); no systematic quantitative legislative count or sample reported.
medium null result AI governance under the second Trump administration: implica... status of state-level legislation regarding AI and worker protections (enacted v...
Domain adaptation techniques (transfer learning, fine-tuning on local data) are underutilized in low-resource African contexts despite their potential to improve generalization to local populations and care processes.
Thematic coding of methodological sections across the reviewed literature showed relatively few studies employing transfer learning or local fine-tuning approaches in African or other low-resource settings; evidence comes from counts/qualitative summaries within the literature review rather than a formal meta-analysis.
medium null result On the use of synthetic data for healthcare AI in Africa: Te... use of domain adaptation methods and resulting generalization/performance improv...
Research priorities include causal studies on productivity gains from AI, firm‑level adoption dynamics, sectoral labor reallocation, long‑run general equilibrium effects, and heterogeneous impacts across regions and demographic groups.
Set of empirical research recommendations drawn from gaps identified in the literature review and limitations section; not an empirical claim but a prioritized research agenda based on secondary evidence.
medium null result AI and Robotics Redefine Output and Growth: The New Producti... knowledge gaps to be addressed (research outcomes)
Growth‑accounting frameworks and measurement approaches must be updated to capture AI/robotics as intangible and embodied capital, including quality improvements and spillovers.
Methodological argument grounded in literature on measurement challenges and examples of intangible capital; no new measurement exercise or empirical re‑estimation is provided in the paper.
medium null result AI and Robotics Redefine Output and Growth: The New Producti... measurement accuracy of productivity accounts, capture of intangible capital and...
Backtesting the proposed models against historical technological transitions (e.g., ATMs, robotics) and recent AI adoption episodes can validate model performance.
Recommended validation strategy; paper does not report backtest results but prescribes holdout/pseudo‑counterfactual experiments and calibration with administrative outcomes.
medium null result Enhancing BLS Methodologies for Projecting AI's Impact on Em... backtest performance metrics (forecast errors, calibration statistics) when appl...
Scenario modelling in the reviewed literature typically uses counterfactual simulations with different adoption speeds, policy responses, and initial conditions to bound possible employment, wage, and productivity trajectories.
Description and citations of scenario-modelling practices by think tanks and organisations (TBI, IPPR, IMF) and academic work referenced; evidence is methodological and report-based.
medium null result Recent Methodologies on AI and Labour - a Desk Review range of projected employment/wage/productivity trajectories across scenarios
NLP/LLM pipelines are used to extract tasks and skills from free-text job ads and to map those tasks to AI capabilities.
Described methods and citations (Xu et al., 2025; Hampole et al., 2025); evidence is methodological application of transformer-based models to job-ad text in recent studies.
medium null result Recent Methodologies on AI and Labour - a Desk Review task/skill extraction performance and task-to-capability mapping
Methods increasingly apply advanced NLP and large language models (BERT, LSTM, GPT-4) to parse job descriptions, map skills/tasks, and predict automation risk.
Cited methodological examples in the paper (Xu et al., 2025; Hampole et al., 2025) and discussion of common pipelines using transformer-based models to extract tasks from free-text job ads and to map tasks to AI capabilities; evidence is methodological and based on recent studies rather than a single benchmarked dataset.
medium null result Recent Methodologies on AI and Labour - a Desk Review task/skill extraction and AI-exposure prediction accuracy from free-text job des...
Some functional domains show varying maturity: for example, procurement has more applied work compared with other functions.
Reviewer observation from the systematic search and screening across 2020–2025 literature noting uneven distribution of empirical/ applied studies across functions.
medium null result Integrating Artificial Intelligence and Enterprise Resource ... relative maturity (volume of applied studies or case evidence per functional dom...
A centralized policy engine for access control, data handling rules, and change management is a necessary control point in the reference pattern.
Prescriptive recommendation in the paper supported by best-practice synthesis and case anecdotes; no direct empirical comparison of centralized vs federated policy engines provided.
medium null result Governed Hyperautomation for CRM and ERP: A Reference Patter... effectiveness of access control and change management (e.g., policy violations, ...
Research gaps include the need for standardized evaluation metrics, robustness- and consistency-focused XAI methods, domain-informed explanation frameworks, and longitudinal/clinical impact studies.
Recommendations section of the review synthesizing recurring deficits across papers and proposing priorities.
medium null result Explainable Artificial Intelligence (XAI) for EEG Analysis: ... recommended research directions / missing evaluation components
Recommendation for research and modeling: economic models of AI markets should incorporate institutional regime types (centralized vs decentralized), enforcement uncertainty, and legitimacy effects as parameters affecting data access costs, R&D productivity, and market concentration.
Normative recommendation based on the comparative typology and inferred mechanisms from the document analysis; not empirically validated within the study.
medium null result Balancing openness and security in scientific data governanc... modeling parameters (regime type, enforcement uncertainty, legitimacy effects) a...
Theoretical contribution: the paper extends modular coordination theory by treating openness–security trade‑offs as layered, adaptive institutional processes embedded in political regimes and 'legitimacy economies.'
Argumentative/theoretical development in the paper grounded in document analysis and literature on coordination and legitimacy.
medium null result Balancing openness and security in scientific data governanc... theoretical framing / extension of modular coordination theory
Providing optional LLM access without training did not increase average exam scores versus no LLM access.
Intent-to-treat comparisons across randomized arms reported in the study: comparison of optional-access-without-training arm to no-access arm showed no average score improvement (sample n = 164).
medium null result Training for Technology: Adoption and Productive Use of Gene... Exam score (grade points)
Cross-border coordination is crucial because platform services and data flows often transcend jurisdictions.
Policy analysis and descriptive examples of cross-border platform operations in the reviewed literature; not empirically quantified in the paper.
medium null result Financial Inclusion in the Age of FinTech Platforms: Opportu... need for cross-border regulatory coordination (qualitative importance)
Standardized metrics for 'inclusive outcomes' are needed beyond account ownership—e.g., active usage, quality of credit, stability of access, and welfare effects.
Critical assessment of measurement shortcomings in existing financial inclusion literature; prescriptive recommendation rather than empirical evidence.
medium null result Financial Inclusion in the Age of FinTech Platforms: Opportu... measurement quality of inclusion metrics (active usage, credit quality, access s...
Realizing AI’s potential for circular-economy and energy-efficiency goals requires coordinated interventions across environmental regulation, digital infrastructure, and workforce skill formation.
Policy interpretation drawn from heterogeneity results (regulation and infrastructure amplify AI effects) and the identified labor-market mechanism (skill composition matters); recommendation rather than direct causal estimate.
medium null result Artificial intelligence, greening of occupational structure ... Policy-relevant intermediate outcomes (regulation strength, infrastructure level...