The Commonplace
Home Dashboard Papers Evidence Digests 🎲

Evidence (2432 claims)

Adoption
5126 claims
Productivity
4409 claims
Governance
4049 claims
Human-AI Collaboration
2954 claims
Labor Markets
2432 claims
Org Design
2273 claims
Innovation
2215 claims
Skills & Training
1902 claims
Inequality
1286 claims

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome Positive Negative Mixed Null Total
Other 369 105 58 432 972
Governance & Regulation 365 171 113 54 713
Research Productivity 229 95 33 294 655
Organizational Efficiency 354 82 58 34 531
Technology Adoption Rate 277 115 63 27 486
Firm Productivity 273 33 68 10 389
AI Safety & Ethics 112 177 43 24 358
Output Quality 228 61 23 25 337
Market Structure 105 118 81 14 323
Decision Quality 154 68 33 17 275
Employment Level 68 32 74 8 184
Fiscal & Macroeconomic 74 52 32 21 183
Skill Acquisition 85 31 38 9 163
Firm Revenue 96 30 22 148
Innovation Output 100 11 20 11 143
Consumer Welfare 66 29 35 7 137
Regulatory Compliance 51 61 13 3 128
Inequality Measures 24 66 31 4 125
Task Allocation 64 6 28 6 104
Error Rate 42 47 6 95
Training Effectiveness 55 12 10 16 93
Worker Satisfaction 42 32 11 6 91
Task Completion Time 71 5 3 1 80
Wages & Compensation 38 13 19 4 74
Team Performance 41 8 15 7 72
Hiring & Recruitment 39 4 6 3 52
Automation Exposure 17 15 9 5 46
Job Displacement 5 28 12 45
Social Protection 18 8 6 1 33
Developer Productivity 25 1 2 1 29
Worker Turnover 10 12 3 25
Creative Output 15 5 3 1 24
Skill Obsolescence 3 18 2 23
Labor Share of Income 7 4 9 20
Clear
Labor Markets Remove filter
Rapid deployment of autonomous learners could accelerate displacement in affected sectors and widen inequality if gains concentrate among capital owners or platform providers.
Socioeconomic risk assessment and projection; conceptual and not empirically quantified in the paper.
speculative negative Why AI systems don't learn and what to do about it: Lessons ... displacement rates; inequality measures (e.g., Gini); concentration of gains
Faster, more generalist embodied AI could substitute for routine physical and social tasks, shifting human labor toward oversight, high-level planning, creativity, and flexible social cognition roles.
Labor-market impact hypothesis derived from automation literature; conceptual projection only.
speculative negative Why AI systems don't learn and what to do about it: Lessons ... occupational substitution rates; changes in labor demand composition
Unclear liability frameworks increase perceived and real costs and can slow adoption by hospitals and insurers.
Policy analyses and procurement narratives noting liability uncertainty cited as a barrier to procurement and deployment.
medium_high negative Human-AI interaction and collaboration in radiology: from co... time-to-adoption, procurement decisions citing liability concerns, insurance/cov...
Up-front implementation costs commonly include procurement, integration with PACS/EMR, UI/UX development, regulatory compliance, and staff training; recurring costs include monitoring, data labeling, software updates, and cybersecurity.
Implementation reports, vendor and hospital accounts, and qualitative studies documenting cost categories (specific dollar amounts vary across settings and are rarely published in detail).
medium_high negative Human-AI interaction and collaboration in radiology: from co... implementation capital expenditures, annual operating expenditures
These trends (job polarization and differential wage/mobility outcomes) may exacerbate economic disparities across regions.
Interpretation and projection based on the observed trends in the reviewed literature and reports; presented as a risk/implication rather than an empirically tested causal finding in the summary.
speculative negative Job Polarization in Solar Power Plants: A Systematic Literat... regional economic disparities (income inequality, regional employment quality di...
Without continuous support for upskilling/reskilling and inclusive policies, AI risks becoming a source of exclusion rather than an enabler of human advancement.
Normative conclusion derived from reviewed literature and thematic interpretation in the qualitative study (literature-based; evidence is secondary and not quantified).
speculative negative THE IMPACT OF ARTIFICIAL INTELLIGENCE IN THE WORKPLACE: OPPO... social inclusion versus exclusion related to AI adoption
Research literature synthesis demonstrates 70-75% automation potential.
Quantitative estimate offered by the authors (70-75%) as part of function-by-function analysis; no described empirical evaluation or sample supporting the figure.
speculative negative Are Universities Becoming Obsolete in the Age of Artificial ... percent automation potential for research literature synthesis
Knowledge transmission (teaching/lecturing) shows 75-80% AI substitutability.
Authors' quantitative estimate presented in the analysis (75-80%); the paper does not detail empirical methods or validation samples for this percentage.
speculative negative Are Universities Becoming Obsolete in the Age of Artificial ... percent substitutability/automation potential of knowledge transmission
Administrative tasks face 75-80% disruption risk from AI.
Paper provides a quantitative estimate (75-80%) as part of its functional disruption assessment; no empirical methodology, dataset, or sample size is described to support the numeric range.
speculative negative Are Universities Becoming Obsolete in the Age of Artificial ... percent disruption/substitutability of administrative tasks
Over 400,000 [individuals] are projected to die before obtaining permanent residency.
Mortality projection applied to the estimated backlog and projected wait times (authors' projection); exact demographic assumptions (age distribution, mortality rates) and method are not provided in the excerpt.
speculative negative The United States' Employment-Based Immigration System: An... Number of backlog applicants projected to die before receiving green cards
The remaining difference (roughly 70%) is not explained by the factors observed in the data, indicating additional influences not captured in the survey.
Residual (unexplained) component from decomposition analyses on ESJS data.
medium-high negative Squandered skills? Bridging the digital gender skills gap fo... Unexplained share (%) of the gender gap in advanced digital task use
The United States shows a more market-driven (firm-dominated) patenting profile and comparatively weaker integration between AI and robotics patent trajectories.
Country-level and actor-type decomposition for U.S. patent filings (1980–2019), showing higher firm share of patents and weaker long-run association/cointegration between core AI and AI-enhanced robotics series compared with China (as reported in the paper).
medium-high negative The "Gold Rush" in AI and Robotics Patenting Activity. Do in... share of patents by firms in U.S.; strength of long-run integration between U.S....
Improving photorealism with objective color-fidelity metrics and refinement reduces the need for manual color correction and retouching in downstream workflows.
Paper and summary argue this as an implication: higher-fidelity outputs from CFR/CFM reduce manual editing demand. This is an economic/market implication rather than a directly evidenced experimental result in the paper (no labor-market causal study reported).
speculative negative Too Vivid to Be Real? Benchmarking and Calibrating Generativ... demand for manual color correction / retouching services
If FDI brings capital‑intensive, AI‑enabled production without complementary upskilling, it may exacerbate wage inequality and deepen labor market dualism in SSA.
Theoretical inference and analogy from documented patterns of skill‑biased technological change and FDI-driven inequality in the reviewed literature; empirical evidence specific to AI in SSA is lacking in the review.
speculative negative Foreign Direct Investment, Labor Markets, and Income Distrib... wage inequality, labor market dualism, employment composition
Full replacement of physicians would require breakthroughs in robust generalization, embodied capabilities, and legal/regulatory change—currently lacking.
Conceptual inference based on documented limitations (OOD generalization, lack of embodied/sensorimotor capability, unsettled legal/regulatory environment) summarized in the review.
speculative negative Will AI Replace Physicians in the Near Future? AI Adoption B... feasibility/timeline for physician replacement
Centralized provision of high-quality coding models by a few vendors could produce vendor lock-in and increase platform power in software development inputs.
Market-structure analysis and industry observations synthesized in the paper; the claim is forward-looking and not established by longitudinal market data within the review.
speculative negative ChatGPT as a Tool for Programming Assistance and Code Develo... market concentration measures (e.g., HHI), indicators of vendor lock-in (switchi...
This reversal of the burden of proof creates moral-hazard-like behavior: incentives for speed reduce verification effort.
Theoretical argument built on the micro-coercion mechanism and economic reasoning; no empirical validation provided.
speculative negative Overton Framework v1.0: Cognitive Interlocks for Integrity i... verification effort per artifact (e.g., reviewer time), proportion of unchecked ...
Under time pressure, developers adopt an implicit default of accepting plausible machine outputs unless they can disprove them (the 'micro-coercion of speed'), effectively reversing the burden of proof.
Behavioral mechanism posited from descriptive reasoning and thought experiments; no behavioral experiments, surveys, or observational data reported.
speculative negative Overton Framework v1.0: Cognitive Interlocks for Integrity i... developer acceptance rate of machine-generated outputs under time pressure; rate...
Concentration risks exist because high fixed costs for safe integration and model adaptation may favor larger incumbents or platform providers.
Conceptual economic reasoning and practitioner commentary synthesized in the review; no empirical market-structure analysis or sample-based evidence included here.
speculative negative The Effectiveness of ChatGPT in Customer Service and Communi... market concentration indicators and barriers to entry related to AI integration ...
Imported AI systems may impose foreign values and norms, risking erosion of indigenous knowledge and social cohesion.
Normative and conceptual argument supported by cited case studies and policy analyses; no original anthropological or sociological fieldwork in the paper.
low-medium negative Towards Responsible Artificial Intelligence Adoption: Emergi... indicators of indigenous knowledge retention, measures of cultural alignment of ...
Deployed AI systems can produce algorithmic bias that harms marginalized groups when models are trained on skewed or non‑representative data.
Synthesis of prior empirical findings and case studies on algorithmic bias and fairness in ML systems; paper does not present new empirical tests.
medium-high negative Towards Responsible Artificial Intelligence Adoption: Emergi... fairness metrics, disparate error rates, incidence of discriminatory outcomes fo...
Human reviewers may over-trust machine-generated language and explanations (automation bias), reducing the likelihood of detecting fraudulent outputs.
Reference to automation-bias literature and conceptual examples; threat modeling and illustrative vignettes in the article.
medium-high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... detection rate of fraudulent outputs by human reviewers when outputs are machine...
Existing internal audit and compliance frameworks focus on access, transaction, and system controls, not on content-generation integrity.
Literature and standards review combined with threat-control mapping demonstrating gaps in content/provenance coverage.
medium-high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... coverage of content-generation integrity within existing audit/compliance framew...
Using calibrated, employee-level predictions enables marginal-cost analyses and prioritization (micro-targeting) to improve retention-efficiency versus uniform, across-the-board policies.
Methodological argument: calibrated individual probabilities plus counterfactual impact estimates enable ranking employees by expected gain from interventions and thus marginal-cost prioritization (no empirical cost–benefit calculations provided).
speculative null result Explainable AI for Employee Retention in Green Human Resourc... potential efficiency gains in retention resource allocation (theoretical outcome...
There are research opportunities to measure returns to 'teaching' (causal impact of configuring agents on human skill accumulation and earnings) and to model agent-platform ecosystems with network effects, spillovers, and endogenous quality hierarchies.
Author-stated research agenda and proposed empirical questions derived from the observed phenomena; not empirical results but recommended directions.
speculative null result When Openclaw Agents Learn from Each Other: Insights from Em... need for future causal estimates of returns to teaching and formal models of eco...
Regulators and payers will require clinical validation, safety guarantees, and clear liability frameworks for human–AI shared decision-making before widescale deployment.
Policy implication stated in the paper's discussion section based on general regulatory considerations; not an empirical result from the study.
speculative null result Hierarchical Reinforcement Learning Based Human-AI Online Di... regulatory requirements / safety validation (anticipated, not measured)
Future research should explore sector-specific AI adoption challenges and long-term workforce adaptation strategies.
Author recommendation presented in the paper's discussion/future work section of the summary.
speculative null result Artificial intelligence and organisational transformation: t... N/A (recommended future research topics)
Key research priorities include improving measurement of AI usage across countries, causal identification of long-run effects, and sectoral reskilling strategy evaluation.
Identified gaps and methodological limitations in the reviewed empirical literature (measurement heterogeneity, limited long-run panels, sectoral variation) motivating suggested future research agenda.
speculative null result S-TCO: A Sustainable Teacher Context Ontology for Educationa... quality and scope of future empirical evidence on AI economic effects
To measure and monitor these effects, researchers should track firm-level adoption of AI features, fulfillment automation intensity, platform-mediated market entry, and task-level labor shifts.
Author recommendations based on gaps identified in the case-based and multi-modal empirical work and the sensitivity of results to adoption measures; not an empirical finding but a methodological claim.
speculative null result Artificial Intelligence–Enabled E-Commerce Systems and Autom... measurement coverage metrics (availability/quality of adoption and task-shift da...
Policy priorities should differ by national Skill Imbalance: countries with strong demand for new skills should prioritize education and reskilling, while countries with strong supply should prioritize firm absorption (innovation, financing, technology adoption).
Interpretation of cross-country Skill Imbalance Index and its implications; prescriptive recommendation based on the observed demand–supply patterns rather than causal testing of policies.
speculative null result Bridging Skill Gaps for the Future Policy emphasis (education/reskilling versus firm absorption) inferred from Skil...
The threshold for taxing AI may be crossed once AI becomes sufficiently capable in substituting humans across cognitive tasks.
Model-based comparative-static/threshold analysis showing that higher AI substitutability for cognitive tasks increases the likelihood that cognitive workers will consider switching to manual jobs, thereby meeting the model's tax-initiation condition.
speculative positive Workers' Incentives and the Optimal Taxation of AI whether/when the model's tax-initiation threshold is crossed as a function of AI...
Economic and organizational benefits (e.g., cost-effective retention, preserved human capital for environmental innovation) are plausible outcomes of applying the approach, but require further causal and cost analyses.
Paper discusses implications and hypothesizes ROI from reduced turnover (less recruiting/onboarding/productivity loss) and preservation of green capabilities; no empirical cost or productivity data provided in the presented summary.
speculative positive Explainable AI for Employee Retention in Green Human Resourc... organizational outcomes: turnover costs avoided, retained human capital, product...
Realizing net societal gains from AI requires human-centered design, regulatory and control measures, and integration of sustainability indicators into technological development.
Normative conclusion drawn from the narrative review of interdisciplinary evidence and policy recommendations; not an empirically validated claim within this paper.
speculative positive The Evolution and Societal Impact of Artificial Intelligence... net societal welfare/benefits conditional on governance, design, and sustainabil...
If banks operationalize NLP for personalization and acquisition at scale, this could increase differentiation, raise switching costs, and potentially affect market concentration—warranting antitrust monitoring.
Theoretical implication extrapolated from identified capability gaps and economic reasoning about differentiation, switching costs, and scaling advantages; not empirically tested in the reviewed papers.
speculative positive Natural language processing in bank marketing: a systematic ... market structure indicators (differentiation, switching costs, market concentrat...
Limited applied research on NLP for acquisition and personalization implies unrealized value in banking: NLP could enable more efficient, targeted customer acquisition and cross‑sell, potentially lowering customer‑acquisition cost (CAC) and increasing lifetime value (LTV).
Inference drawn from observed topical gaps (low article counts on acquisition/personalization) and standard marketing economics linking targeting/personalization to CAC and LTV; no direct causal evidence provided in the reviewed literature.
speculative positive Natural language processing in bank marketing: a systematic ... customer‑acquisition cost (CAC), customer lifetime value (LTV), acquisition effi...
A concrete empirical test recommended by the paper is to run controlled comparisons of distribution-shift generalization between negative-only, preference-only, and hybrid-trained models across safety and usefulness metrics.
Methodological recommendation given in the paper; it is not an empirical result but an explicitly proposed verifiable experiment for future work.
speculative positive Via Negativa for AI Alignment: Why Negative Constraints Are ... relative generalization performance (safety and usefulness) under distribution s...
Regulators could feasibly focus on certifying constraint datasets and testing model adherence to explicit prohibitions, since constraint compliance is empirically testable and verifiable.
Policy recommendation derived from the paper's epistemic argument about constraints being verifiable; presented as a plausible regulatory strategy rather than one already validated by policy experiments.
speculative positive Via Negativa for AI Alignment: Why Negative Constraints Are ... feasibility and effectiveness of regulatory certification schemes for constraint...
There is a commercial opportunity for startups and vendors to specialize in 'constraint datasets' and constitutional-rule libraries as tradable assets.
Market/economic inference made from the technical claim that constraints are verifiable and reusable; no empirical industry survey data provided—this is a forward-looking implication.
speculative positive Via Negativa for AI Alignment: Why Negative Constraints Are ... emergence and market size of firms/products supplying constraint datasets and ru...
If negative/safety-focused signals are more sample- and compute-efficient for certain alignment goals, firms may reallocate labeling budgets away from costly preference elicitation toward collecting high-quality negative examples and rule sets.
Economic implication extrapolated from the paper's sample-efficiency claim; the paper reasons from technical sample-efficiency arguments and cited empirical parity but does not present market-level empirical data.
speculative positive Via Negativa for AI Alignment: Why Negative Constraints Are ... organizational allocation of labeling budget and labor-hours (shift in proportio...
Improved alignment can reduce harms from misinterpretation (incorrect decisions, misinformation), lowering downstream liability and reputational risk for vendors and customers.
Paper's safety and externalities discussion argues this as a likely consequence; the claim is theoretical and not supported by empirical incident data in the paper.
speculative positive A Context Alignment Pre-processor for Enhancing the Coherenc... error/externality rates, number of downstream incidents, liability/claims metric...
Providers may charge a premium for alignment-enabled API tiers or incorporate C.A.P. into enterprise plans because of additional compute per interaction, affecting pricing and unit economics.
Paper's pricing and costs discussion predicts potential monetization strategies and pricing experiments (A/B pricing, willingness-to-pay studies) but does not report market data.
speculative positive A Context Alignment Pre-processor for Enhancing the Coherenc... price differentials for alignment features, willingness-to-pay, revenue per user
C.A.P. has potential economic effects: it can reduce time lost to misinterpretation, thereby increasing effective throughput and productivity, though net gains depend on trade-offs with pre-processing overhead.
Economic implications section provides conceptual cost–benefit arguments and recommends pilot measurements (time saved, reduced human review cost) but provides no empirical economic measurement.
speculative positive A Context Alignment Pre-processor for Enhancing the Coherenc... time saved per session, throughput, reduction in correction cycles, net producti...
C.A.P. shifts interactions from one-way command-execution to two-way, partnership-style collaboration, increasing perceived partnerliness.
Theoretical argument drawing on cognitive science and Common Ground theory and proposed human-evaluation measures (satisfaction, perceived collaboration); no empirical human-subject results reported.
speculative positive A Context Alignment Pre-processor for Enhancing the Coherenc... perceived collaboration / user satisfaction / partnerliness ratings
C.A.P. improves long-term and dynamic dialogue alignment and reduces off-topic or mechanically incorrect responses.
Main argument of the paper based on the combined functions (expansion, weighted retrieval, alignment verification, clarification); the paper provides conceptual/theoretical justification but does not report large-scale empirical results.
speculative positive A Context Alignment Pre-processor for Enhancing the Coherenc... dialogue alignment metrics, off-topic response rate, correctness of responses
The benchmark provides a testbed useful for studying strategic behavior, coordination failures, and market-like interactions among agents, which can inform economic research and policy.
Paper claims the benchmark's multi-agent, strategic tasks can be used as experimental environments for economic and policy research; this is a normative claim supported by the benchmark's design rather than by empirical studies in the paper.
speculative positive The PokeAgent Challenge: Competitive and Long-Context Learni... utility of benchmark as a research/testbed for studying strategic/multi-agent ph...
Open-source orchestration lowers entry barriers, broadening participation and potentially compressing rents that would otherwise accrue to well-resourced incumbents.
Paper's discussion section argues that releasing orchestration and evaluation tools publicly reduces the technical overhead for entrants; this is a theoretical/observational claim rather than empirically measured in the paper.
speculative positive The PokeAgent Challenge: Competitive and Long-Context Learni... predicted change in barrier-to-entry and market rents (qualitative)
The clear performance gaps indicate high returns to specialized efforts (RL, domain-specific engineering) relative to generalist LLM-only approaches, shaping where teams invest labor and compute.
Paper links benchmarking results (performance gaps between baselines and humans) to economic implications, arguing specialization yields higher returns; this is an interpretive claim based on reported performance differentials.
speculative positive The PokeAgent Challenge: Competitive and Long-Context Learni... economic return on investment inference based on performance differences between...
Benchmarks like PokeAgent will reallocate researcher and industry attention toward multi-agent, partial-observability, and long-horizon planning problems—likely increasing funding and compute investment in RL and hybrid LLM+RL methods.
Paper offers an economic/implication analysis arguing that introducing such a benchmark changes incentives and investment patterns; this is a reasoned projection rather than an empirical observation.
speculative positive The PokeAgent Challenge: Competitive and Long-Context Learni... predicted shifts in researcher/industry attention and investment (qualitative fo...
Public investment in open environments, robotics testbeds, and safety research can reduce concentration risks and externalities and democratize access to embodied AI research.
Policy recommendation based on anticipated strategic importance of shared infrastructure; not empirically validated here.
speculative positive Why AI systems don't learn and what to do about it: Lessons ... accessibility of research infrastructure; distribution of research capabilities ...
Value in the AI ecosystem may shift from passive text/image corpora toward rich interaction datasets and simulated/real environments; ownership and control of simulation platforms and testbeds could become strategically important assets.
Economic and strategic inference from the proposed technical emphasis on embodied/interaction learning; no supporting market data in the paper.
speculative positive Why AI systems don't learn and what to do about it: Lessons ... asset valuations for simulation/testbed providers; transaction volumes for inter...