Evidence (4175 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	758	199	100	900	2007
Governance & Regulation	826	400	191	122	1563
Organizational Efficiency	777	193	124	84	1189
Technology Adoption Rate	635	233	124	97	1098
Research Productivity	422	128	57	336	954
Output Quality	476	179	59	47	761
Decision Quality	328	177	81	47	640
Firm Productivity	435	57	88	20	606
AI Safety & Ethics	218	277	65	33	599
Market Structure	180	170	123	24	502
Task Allocation	213	64	72	33	387
Skill Acquisition	170	61	61	17	309
Innovation Output	203	27	43	18	292
Employment Level	105	54	107	13	281
Fiscal & Macroeconomic	131	69	43	26	276
Consumer Welfare	117	63	42	11	233
Firm Revenue	153	48	26	3	230
Task Completion Time	173	31	8	12	225
Inequality Measures	44	122	49	6	221
Worker Satisfaction	89	65	22	12	188
Error Rate	69	92	10	2	173
Regulatory Compliance	77	69	14	5	165
Automation Exposure	56	56	26	13	154
Training Effectiveness	94	21	13	19	149
Wages & Compensation	77	36	25	6	144
Team Performance	86	17	27	10	141
Developer Productivity	95	17	14	6	133
Job Displacement	12	80	20	1	113
Hiring & Recruitment	52	7	8	3	70
Creative Output	31	18	8	3	61
Skill Obsolescence	5	46	6	1	58
Social Protection	27	16	8	2	53
Labor Share of Income	17	19	17	—	53
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Org Design Remove filter

Principal barriers to DT adoption include paper‑based or legacy regulatory/compliance processes that slow digitisation.

Findings from reviewed studies noting regulatory and compliance processes as impediments to digital handover and automated workflows.

medium negative Digital Twins Across the Asset Lifecycle: Technical, Organis... regulatory/compliance digitisation level and its impact on adoption

Principal barriers to DT adoption include misaligned stakeholder incentives and fragmented project delivery models.

Synthesis of conceptual and case literature describing contractual and incentive misalignments that impede lifecycle data continuity.

medium negative Digital Twins Across the Asset Lifecycle: Technical, Organis... stakeholder incentive alignment / project delivery fragmentation

Principal barriers to DT adoption include low digital maturity and uneven capabilities across supply chains.

Recurring observations in the literature review about heterogeneous digital skills and maturity across firms in the supply chain.

medium negative Digital Twins Across the Asset Lifecycle: Technical, Organis... digital maturity/capability distribution across supply chain

Principal barriers to DT adoption include data quality and continuity problems at handover.

Thematic synthesis across reviewed literature reporting frequent issues with data quality and handover continuity between project phases.

medium negative Digital Twins Across the Asset Lifecycle: Technical, Organis... data quality/continuity issues at handover

Principal barriers to DT adoption include interoperability gaps and lack of standards.

Thematic findings from qualitative synthesis of the 160 reviewed studies (recurring theme across conceptual papers, case studies and pilots).

medium negative Digital Twins Across the Asset Lifecycle: Technical, Organis... presence of interoperability/standards barriers affecting adoption

ANN analysis ranks need-for-human-interaction barriers as the most important predictor of GAICS adoption outcome.

ANN feature-importance analysis reported in the paper that ranks predictors for adoption outcome and finds the human-interaction barrier as the top predictor; paper abstract does not include details on ANN implementation or sample characteristics.

medium negative Reimagining Stakeholder Engagement Through Generative AI: A ... GAICS adoption (likelihood/decision to adopt)

Harms from manipulation, harassment, and de‑anonymizing biometric data create negative social externalities (mental health impacts, discrimination); without regulation, platforms may under‑invest in protective measures.

Synthesis of harms and economic externality reasoning from the reviewed studies; claim is theoretical and policy‑oriented rather than empirically quantified in the paper.

medium negative Securing Virtual Reality: Threat Models, Vulnerabilities, an... social harms and degree of private investment in protections absent regulation (...

Ongoing operational costs for safe multi‑user VR services (model updates, policy tuning, user support, human moderators) raise marginal costs relative to less‑protected services.

Qualitative cost components identified in the literature and by the authors; no empirical cost accounting or per‑unit estimates provided.

medium negative Securing Virtual Reality: Threat Models, Vulnerabilities, an... marginal operational costs of providing protected VR services (conceptual)

Implementing TVR‑Sec requires upfront investments in secure hardware, AI monitoring engines, and moderation infrastructure, increasing entry costs for new VR platforms and favoring incumbents or well‑capitalized entrants.

Authors' economic analysis based on component cost categories identified across the reviewed literature; no quantitative cost estimates provided.

medium negative Securing Virtual Reality: Threat Models, Vulnerabilities, an... effect on entry costs and market concentration (proposed effect, not empirically...

At the organizational scale, AI adoption is constrained and shaped by compliance requirements, formal policies, and prevailing norms.

Participants' accounts in workshops (n=15) noting compliance and policy considerations; thematic analysis classified these as organizational-level constraints.

medium negative The Values of Value in AI Adoption: Rethinking Efficiency in... organizational-level constraints on adoption (compliance, policy, norms) and res...

Rapid coherence decay with thread depth suggests collective problem solving or consensus formation among these agents will be shallow and brittle.

Embedding-based coherence metrics demonstrating fast decline in similarity with increasing thread depth across the dataset; inferential claim about effects on deliberation and consensus processes.

medium negative What Do AI Agents Talk About? Emergent Communication Structu... coherence as a function of thread depth and inferred effect on multi-turn delibe...

Low emotional alignment and frequent affective redirection indicate human emotional contagion models may not apply to AI-agent interaction, which could produce unstable or counterintuitive coordination dynamics.

Emotion-classification results showing 32.7% mean self-alignment and 33% fear→joy response rate; theoretical interpretation comparing these patterns to human emotional contagion expectations.

medium negative What Do AI Agents Talk About? Emergent Communication Structu... emotional self-alignment and emotion transition rates; implication for coordinat...

Ritualized signaling could create apparent activity (volume, buzz) without substantive informational content, opening avenues for manipulation or mispriced assets.

Observed high rates of patterned/formulaic replies and concentrated non-informational activity patterns in Moltbook; inferential reasoning about how signal amplification without content could affect market perception and asset pricing.

medium negative What Do AI Agents Talk About? Emergent Communication Structu... volume of formulaic/ritualized activity and potential effect on perceived market...

High prevalence of formulaic comments (≈56%+) implies large volumes of low-information signaling that can degrade signal-to-noise ratio in information environments, harming price discovery and liquidity forecasting.

Empirical observation of >56% formulaic comments via lexical-pattern analysis, combined with theoretical inference about information quality and market microstructure (argument linking high low-information reply volume to degraded signal-to-noise).

medium negative What Do AI Agents Talk About? Emergent Communication Structu... percentage of formulaic replies and inferred effect on information quality metri...

Instability of agent rankings across configurations makes procurement and deployment decisions based on narrow benchmarks risky; firms should evaluate agents under their own scaffolds, datasets, and workflows before committing.

Empirical finding of ranking instability across models, scaffolds, and datasets; methodological recommendation derived from that instability.

medium negative Re-Evaluating EVMBench: Are AI Agents Ready for Smart Contra... robustness_of_benchmark_based_procurement (risk_of_misleading_benchmarks)

Claims that AI will imminently replace human auditors are overstated; real-world economic benefits are more likely to come from complementary automation (breadth + triage) rather than full substitution.

Interpretation based on empirical failures in end-to-end exploitation, instability across configurations, and scaffold sensitivity observed in this study.

medium negative Re-Evaluating EVMBench: Are AI Agents Ready for Smart Contra... economic_value_of_automation (qualitative_assessment_of_substitution_vs_compleme...

Detection and exploitation rankings are unstable: rankings shift across model configurations, tasks, and datasets, so results are not robust to evaluation choices.

Observed variability in detection/exploitation rankings across the expanded matrix of models, scaffolds, and datasets in the study's experiments.

medium negative Re-Evaluating EVMBench: Are AI Agents Ready for Smart Contra... ranking_stability (consistency_of_model_rankings_across_configs_and_datasets)

Standardized platforms and benchmarks may create network effects and lock-in around dominant hardware–software stacks; antitrust and standards policy will matter to preserve competition.

Workshop participants' market-structure analysis and policy discussion included in the summary recommendations (NSF workshop, Sept 26–27, 2024).

medium negative Report for NSF Workshop on Algorithm-Hardware Co-design for ... market concentration metrics, prevalence of platform lock-in, and competition in...

Key technical and organizational risks include model brittleness, privacy and IP concerns in code generation (training-data provenance), and increased governance and QA burdens.

Literature review highlighting known risks and survey responses reporting practitioner concerns; no quantified incident rates provided.

medium negative Artificial Intelligence as a Catalyst for Innovation in Soft... reported incidence or concern levels about risks (qualitative)

Practitioners report barriers to adoption including integration costs, lack of trust/explainability, poor data quality, and skills gaps.

Thematic analysis / coding of open-ended survey responses and literature review identifying common adoption barriers; survey sample size not specified.

medium negative Artificial Intelligence as a Catalyst for Innovation in Soft... prevalence of reported barriers in survey responses

Prior work often conflates feedback source and feedback model; this study isolates them through controlled experiments.

Authors' literature review and the paper's experimental design explicitly constructed to disentangle source and model effects.

medium negative A Systematic Study of Pseudo-Relevance Feedback with LLMs Degree to which prior studies separate PRF design dimensions (methodological ass...

QCSC systems are capital- and skill-intensive, favoring well-resourced incumbents (large tech firms, national labs, major pharma/materials companies), potentially increasing concentration in compute-enabled domains.

Economic and industry-structure reasoning based on anticipated capital costs, specialized skills required, and comparison to existing capital-intensive compute infrastructures; no empirical market-share data.

medium negative Reference Architecture of a Quantum-Centric Supercomputer market concentration and firm advantage in compute-enabled R&D domains

Recent quantum advantage demonstrations for quantum-system simulation show utility, but practical applied research requires hybrid workflows that neither QPUs nor classical HPC can efficiently execute alone.

Review and synthesis of published quantum-simulation demonstrations and known performance/scaling limits of classical HPC; qualitative analysis of hybrid algorithm requirements; no new experiments.

medium negative Reference Architecture of a Quantum-Centric Supercomputer ability of standalone QPUs or classical HPC to execute full applied-research hyb...

Under realistic limitations (distribution shift, very large prompt inventories, or severe cold starts), DPS’s realized rollout savings and performance gains may be reduced.

Authors list these scenarios as potential limitations and caveats in the Discussion/Limitations section; no quantification provided in the summary.

medium negative Dynamics-Predictive Sampling for Active RL Finetuning of Lar... magnitude of rollout savings and performance gains under adverse conditions

Expect diminishing returns from AI investments if parallel investments in organizational change and data governance are not made.

Synthesis of case evidence and theoretical argument: instances where additional AI investment produced limited marginal benefit absent organizational complements.

medium negative Optimizing integrated supply planning in logistics: Bridging... marginal returns to AI (performance per unit AI investment)

Legacy systems and siloed organizational structures produce persistent forecasting inaccuracies, operational disconnects, and constrained responsiveness.

Cross-case interview narratives documenting continued forecasting issues and operational misalignment in firms with legacy IT and functional silos.

medium negative Optimizing integrated supply planning in logistics: Bridging... forecasting accuracy, operational alignment, responsiveness (lead times)

MLOps and governance provisions shift costs from one-off implementation to ongoing maintenance, implying recurring costs that should be captured in economic evaluations.

Analytical/economic argument presented in the paper as an implication of including an MLOps layer (conceptual; no empirical cost accounting provided).

medium negative ALGORITHM FOR IMPLEMENTING AI IN THE MANAGEMENT LOOP OF SMES... cost structure (recurring maintenance costs vs one-off implementation costs)

Adoption complementarities (AI tools + developer skill + organizational processes) favor larger incumbents and well‑funded firms, possibly increasing concentration in tech sectors.

Theoretical argument about complementarities and returns to scale; illustrative examples; lacks firm‑level empirical testing.

medium negative How AI Will Transform the Daily Life of a Techie within 5 Ye... market concentration measures (market share, concentration ratios) and different...

In the near term, displacement risks concentrate on junior or highly routine roles; mobility and retraining will determine realized unemployment impacts.

Task automatability mapping indicating routine tasks more automatable and qualitative reasoning on labor mobility; no empirical unemployment projections.

medium negative How AI Will Transform the Daily Life of a Techie within 5 Ye... employment outcomes for junior/highly routine roles (displacement rates, unemplo...

Adoption will be heterogeneous: larger firms and well‑resourced teams will capture more gains earlier, producing competitive advantages.

Theoretical argument about adoption complementarities (AI tools + developer skill + organizational processes) and illustrative examples; no cross‑firm empirical analysis.

medium negative How AI Will Transform the Daily Life of a Techie within 5 Ye... heterogeneity in productivity gains and market advantage by firm size/resource l...

Differential adoption across firms (due to modular, scalable designs and data advantages) may create winner‑takes‑most effects and increase market concentration, benefiting early adopters with rich data/integration capabilities.

Market-structure claim supported by economic reasoning about scale and data advantages; no cross-firm empirical adoption study or market concentration time‑series is provided.

medium negative Next-Generation Financial Analytics Frameworks for AI-Enable... market concentration metrics (e.g., HHI), firm market shares, adoption timing di...

Organizations will incur additional governance and procurement costs (diversity audits, recalibration of reward models, multi-model infrastructures) to mitigate homogenization, shifting some economic benefits of AI toward governance spending.

Cost implication argued from the need for auditing and multi-model procurement described in recommendations; not supported by quantified cost analyses in the paper.

medium negative The Artificial Hivemind: Rethinking Work Design and Leadersh... governance and procurement costs associated with LLM deployment

Inter-model convergence undermines product differentiation across AI providers and could accelerate commoditization of base LLM outputs.

Market-structure inference built on empirical finding of high cross-model output similarity across 70+ models and theoretical discussion of vendor differentiation; no market-level price or adoption time-series analyzed in the paper.

medium negative The Artificial Hivemind: Rethinking Work Design and Leadersh... vendor product differentiation / commoditization of base outputs

Homogenized AI outputs reduce the value of AI as a source of varied cognitive complements to human labor, potentially lowering productivity gains from human–AI collaboration in tasks requiring creativity and exploration.

Economic argument drawing on measured decreases in model output diversity and theoretical literature on complementarities between diverse AI outputs and human creativity; no direct measured productivity changes reported in field settings within the paper.

medium negative The Artificial Hivemind: Rethinking Work Design and Leadersh... productivity gains from human–AI collaboration (theoretical implication inferred...

Reward-model and evaluation miscalibration can cause organizations to prefer models that maximize apparent evaluation scores at the expense of useful stylistic or cognitive diversity.

Comparative analyses between automated evaluation/reward-model rankings and human preference/diversity assessments reported in the paper; examples where high-scoring models produced more consensus-style outputs.

medium negative The Artificial Hivemind: Rethinking Work Design and Leadersh... model selection bias driven by automated evaluation scores; reduction in diversi...

Homogenized outputs increase organizational susceptibility to groupthink and correlated errors across teams using different models.

Argument based on observed inter-model convergence (high similarity across models) implying correlated outputs and thus correlated mistakes across teams; no randomized organizational field experiment reported, this is an inferred risk from the empirical convergence data.

medium negative The Artificial Hivemind: Rethinking Work Design and Leadersh... risk of correlated errors / susceptibility to groupthink (conceptual risk inferr...

Homogenization of LLM outputs erodes creative diversity in AI-assisted work and reduces the variety of solutions produced.

Inference drawn from measured decreases in response diversity (entropy/distinct-n) and the observed inter-model convergence across real-world queries; argument linking lower measured diversity to fewer distinct solution proposals in AI-augmented workflows.

medium negative The Artificial Hivemind: Rethinking Work Design and Leadersh... creative diversity / number of distinct solution variants produced

Current reward models and automated evaluation metrics are biased toward consensus/high-probability responses, preferring consensus-style outputs even when stylistically diverse alternatives are judged equally high-quality by humans.

Reported human preference assessments and comparisons between human judgments and automated/reward-model scores showing cases where reward models favor higher-probability/consensus outputs despite no human-quality advantage; analyses described comparing reward-model scores to human judgments on stylistically diverse outputs.

medium negative The Artificial Hivemind: Rethinking Work Design and Leadersh... alignment between reward-model/automated evaluation scores and human quality jud...

Introducing ‘agent capital’ (AI that lowers coordination costs) reduces coordination costs inside firms (coordination compression).

Definition and central assumption of the paper's formal task-based model; analytical setup assumes agent capital parametrically reduces coordination frictions.

medium negative AI as Coordination-Compressing Capital: Task Reallocation, O... coordination costs (firm-internal coordination friction parameter)

Uneven inclusion in digital/AI deployments risks exacerbating digital divides and creating distributional harms.

Descriptive and case-based studies report differential access and uptake among demographic groups; limited causal quantification and varying measurement approaches across studies.

medium negative Digital Transformation and AI Adoption in Government: Evalua... service coverage across demographic groups, measures of digital divide (access, ...

Limited auditability and explainability of AI systems increase trust and legitimacy risks.

Technical governance literature and case reports show challenges in model explainability and external audit; evidence is technical and illustrative rather than based on large-sample causal studies.

medium negative Digital Transformation and AI Adoption in Government: Evalua... auditability metrics, transparency indicators, public trust measures

Inadequate regulatory frameworks raise privacy, accountability, and fairness concerns for AI in government.

Governance reviews and risk assessments documented in the literature highlight regulatory gaps and associated incidents/risks; empirical incident counts are not comprehensively tabulated in the review.

medium negative Digital Transformation and AI Adoption in Government: Evalua... privacy breaches, accountability/audit findings, measures of fairness/bias incid...

Procurement, budgeting rules, and siloed incentives discourage cross-cutting transformation and modular iterative deployments.

Policy and institutional analyses in the reviewed literature point to rigid procurement cycles, capital budgeting practices, and siloed funding as obstacles; examples and case narratives are provided but systematic quantification is limited.

medium negative Digital Transformation and AI Adoption in Government: Evalua... frequency of modular/iterative procurements, number of cross-cutting projects fu...

Organizational resistance and fragmented coordination block integrated rollouts of cross-cutting digital reforms.

Qualitative case studies and governance analyses repeatedly identify intra-governmental silos, conflicting incentives, and change-resistance as implementation barriers; evidence is primarily descriptive.

medium negative Digital Transformation and AI Adoption in Government: Evalua... degree of cross-agency integration, completion rates of integrated projects, imp...

Skills shortages (technical, managerial, data literacy) impede adoption and maintenance of digital and AI systems.

Multiple surveys, policy briefs and qualitative studies cited in the review report workforce capacity gaps; often based on targeted assessments or organizational audits rather than representative sampling.

medium negative Digital Transformation and AI Adoption in Government: Evalua... adoption rates, system maintenance capacity, time-to-value for deployments

Infrastructure deficits (connectivity, legacy systems) limit scale and reliability of digital/AI initiatives.

Recurring barrier documented across governance analyses and case studies; evidence includes reports of downtime, integration failures, and limited geographic reach; no unified cross-study sample provided.

medium negative Digital Transformation and AI Adoption in Government: Evalua... system reliability/uptime, scalability, geographic/service coverage

Scalability and rapid model improvements provided by cloud vendors are harder to capture on-premise.

Comparative discussion in TOE analysis about vendor-managed continuous model improvements and cloud scalability versus on-prem constraints; not backed by longitudinal empirical comparison in the summary.

medium negative An Empirical Study on the Feasibility Analysis of On-Premise... ability to capture rapid model improvements and scalability

Loss of control over research data impedes local capture of value (knowledge, IP, downstream services) and can create externalities when data are repurposed or commercialized without equitable benefit sharing.

Conceptual argument grounded in case observations about data flows and provider practices; no quantitative measures of value capture provided.

medium negative Emerging ethical duties in AI-mediated research: A case of d... local value capture; intellectual property and benefit sharing

Dominant AI/cloud providers become de facto gatekeepers of data processing and storage; researchers and institutions, particularly in lower‑capacity jurisdictions, have limited bargaining power to enforce data‑sovereignty or transparency terms.

Mapping of third‑party dependencies and interview/observational evidence of institutional procurement constraints in the Chile case; normative discussion of market power implications.

medium negative Emerging ethical duties in AI-mediated research: A case of d... bargaining power; market gatekeeping

Algorithmic opacity and cross‑border regulatory fragmentation raise monitoring, compliance, and contractual costs for collaborative research, effectively increasing the transaction costs of data‑intensive science.

Analytical inference from qualitative findings (opacity, legal fragmentation) and normative economic reasoning presented in the implications section; no quantitative transaction‑cost measurement reported.

medium negative Emerging ethical duties in AI-mediated research: A case of d... transaction costs; monitoring and compliance costs

« Prev 1 2 3 … 61 62 63 … 83 84 Next »