Evidence (6869 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	758	199	100	900	2007
Governance & Regulation	826	400	191	122	1563
Organizational Efficiency	777	193	124	84	1189
Technology Adoption Rate	635	233	124	97	1098
Research Productivity	422	128	57	336	954
Output Quality	476	179	59	47	761
Decision Quality	328	177	81	47	640
Firm Productivity	435	57	88	20	606
AI Safety & Ethics	218	277	65	33	599
Market Structure	180	170	123	24	502
Task Allocation	213	64	72	33	387
Skill Acquisition	170	61	61	17	309
Innovation Output	203	27	43	18	292
Employment Level	105	54	107	13	281
Fiscal & Macroeconomic	131	69	43	26	276
Consumer Welfare	117	63	42	11	233
Firm Revenue	153	48	26	3	230
Task Completion Time	173	31	8	12	225
Inequality Measures	44	122	49	6	221
Worker Satisfaction	89	65	22	12	188
Error Rate	69	92	10	2	173
Regulatory Compliance	77	69	14	5	165
Automation Exposure	56	56	26	13	154
Training Effectiveness	94	21	13	19	149
Wages & Compensation	77	36	25	6	144
Team Performance	86	17	27	10	141
Developer Productivity	95	17	14	6	133
Job Displacement	12	80	20	1	113
Hiring & Recruitment	52	7	8	3	70
Creative Output	31	18	8	3	61
Skill Obsolescence	5	46	6	1	58
Social Protection	27	16	8	2	53
Labor Share of Income	17	19	17	—	53
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Governance Remove filter

The persistence of interpretive, human-in-the-loop evaluation implies ongoing labor requirements (annotation, sense-making, governance roles), affecting forecasts of automation and labor substitution in sectors adopting LLMs.

Interview reports describing continued manual work for evaluation tasks across participants; authors draw implications for labor demand.

medium negative Results-Actionability Gap: Understanding How Practitioners E... continued human labor requirements for evaluation

Environmental and informational externalities from AI (energy use, privacy harms, bias) justify regulatory and Pigouvian-style interventions to correct market failures.

Conceptual and policy literature reviewed, combined with empirical observations about environmental impacts and privacy/bias incidents reported in prior studies; the paper does not provide new causal estimates of externality magnitudes.

medium negative The Evolution and Societal Impact of Artificial Intelligence... externality magnitudes (environmental costs, privacy/bias harms) and welfare eff...

AI may alter firms' competitive dynamics by amplifying scale advantages and platform effects, making antitrust, data portability, and competition policy relevant to preserve contestability and innovation.

Synthesis of industrial organization theory and empirical observations of platform markets and data-driven firms cited in the literature review; no primary empirical study included in this paper.

medium negative The Evolution and Societal Impact of Artificial Intelligence... market concentration, competition levels, and innovation dynamics

Evaluation metrics for multi-hazard forecasting and decision tools should go beyond predictive accuracy to include calibration, sharpness, decision-relevance, fairness metrics, and economic utility loss.

Recommendations in the paper's implications section for AI economics and tool evaluation, based on stakeholder needs and decision-relevance considerations identified by MYRIAD-EU.

medium negative Reducing risk together: moving towards a more holistic appro... adoption of broader evaluation metrics for forecasting and decision-support tool...

Open, benchmarked multi-hazard datasets with standardized metadata and labels are needed to enable method comparison and transferability.

Concrete research/practice actions recommended in the synthesis; identification of data standardization and benchmarking gaps from project experience.

medium negative Reducing risk together: moving towards a more holistic appro... availability of open, benchmarked multi-hazard datasets with standardized metada...

Decision and valuation frameworks (e.g., cost–benefit and cost–effectiveness analyses) should be extended to multi-hazard contexts to account for cascading and correlated losses across sectors and time.

Implications for AI economics and concrete recommendations in the paper calling for extensions to existing economic evaluation frameworks to handle multi-hazard complexity.

medium negative Reducing risk together: moving towards a more holistic appro... suitability of economic decision frameworks for multi-hazard contexts

Early Career Researchers (ECRs) should be empowered through leadership roles and capacity-building within project structures to sustain interdisciplinary innovation.

Project recommendations emphasize ECR leadership and capacity-building as a priority based on internal reflection and organizational learning from MYRIAD-EU activities.

medium negative Reducing risk together: moving towards a more holistic appro... ECR leadership roles and capacity in interdisciplinary risk research

Development and operationalization of Multi-Hazard Early Warning Systems (MHEWS) require support, and MYRIAD-EU engaged practitioners and policymakers to evaluate MHEWS needs and operational uptake.

Project engagement activities with practitioners and policymakers reported to evaluate needs for MHEWS and their operational uptake; conclusions and recommendations for support included in the synthesis.

medium negative Reducing risk together: moving towards a more holistic appro... readiness and operational uptake of MHEWS

Equity considerations must be explicitly integrated into multi-hazard multi-risk research and practice to achieve equitable disaster risk reduction and adaptation.

Project emphasis on participatory approaches and stakeholder-derived qualitative data highlighting distributional vulnerability and equity concerns; recommendations for explicit equity integration.

medium negative Reducing risk together: moving towards a more holistic appro... degree of equity integration in DRR and adaptation processes

There is insufficient availability of appropriate, solutions-oriented, and user-friendly tools for practitioners and decision-makers; availability should be increased.

Tool development and iterative testing with end users within MYRIAD-EU, and stakeholder feedback pointing to a demand for more usable tools.

medium negative Reducing risk together: moving towards a more holistic appro... availability and usability of practitioner-facing decision-support tools

Methods are needed to generate both present-day and future multi-hazard and multi-risk scenarios that integrate climate, socio-economic change, and cascading effects.

Project development and testing of scenario methods reported, plus identification of remaining methodological gaps in scenario integration.

medium negative Reducing risk together: moving towards a more holistic appro... availability and quality of multi-hazard and multi-risk scenario generation meth...

Concepts, definitions, and terminologies for multi-hazard and multi-risk work must be mainstreamed and harmonized to enable comparability and communication across disciplines and stakeholders.

Stakeholder feedback and the project's synthesis of interdisciplinary outputs highlighting conceptual fragmentation and communication barriers.

medium negative Reducing risk together: moving towards a more holistic appro... comparability and clarity of concepts/terminology across disciplines and stakeho...

If quantum advantages accrue initially to well-capitalized incumbents (cloud providers, financial firms, pharmaceuticals), we should expect increased market power and higher rents.

Scenario analysis and historical analogs where early compute advantages concentrated market power; qualitative market-structure modeling.

medium negative Modeling Macroeconomic Output Gains from Quantum-Driven Prod... market concentration measures (e.g., market shares, rents), firm-level competiti...

Benefits of quantum diffusion are likely to be uneven across countries, firms, and workers—boosting regions with strong innovation ecosystems and possibly increasing market concentration among compute-capable incumbents.

Multi-region/sectoral modeling with heterogenous adoption and capability parameters; historical analogs showing concentration following early compute advantages; scenario comparisons.

medium negative Modeling Macroeconomic Output Gains from Quantum-Driven Prod... regional competitiveness, firm-level market concentration, distributional outcom...

Without coordinated investments and governance, large theoretical gains may remain unrealized or be very unevenly distributed.

Policy counterfactual scenarios in which underinvestment, fragmented governance, or restrictive export regimes reduce adoption elasticities and infrastructure readiness, producing lower and more concentrated macro gains compared with coordinated-investment scenarios.

medium negative Modeling Macroeconomic Output Gains from Quantum-Driven Prod... realized productivity gains; distribution of gains across firms/countries (inequ...

High executive digital cognition on its own tends to weaken the policy's positive effect on energy utilization efficiency (interpreted as short-run adjustment costs from digital transformation).

Interaction tests between policy treatment and an executive-level digital-cognition measure show a negative interaction coefficient in DID regressions; authors interpret this as evidence of short-run adjustment costs.

medium negative How Does Urban Green Data Center Policy Empower Corporate En... corporate energy utilization efficiency

The under‑use of external text sources in the reviewed literature may be due to privacy, legal/regulatory uncertainty, or integration costs.

Authors' interpretation linking observed low coverage of external text sources (social media, news, reviews) in the 109 articles to plausible barriers (privacy/regulation/integration); no direct empirical test in the review.

medium negative Natural language processing in bank marketing: a systematic ... use of external text sources in marketing research and barriers to their use

Restrictions on cross‑border data flows or fragmented privacy rules reduce the training data available to AI systems, lowering the quality and scalability of AI services exported internationally.

Theoretical linkage and literature on AI training data needs synthesized in the paper; no original empirical measurement of AI performance loss presented.

medium negative Analysis of Digital Services Trade and Export Competitivenes... AI model performance, quality/scalability of AI‑enabled exported services

Support systems for digital services exporters, especially SMEs, are inadequate in China.

Review of policy documents and literature highlighting gaps in finance, legal support, and standards compliance assistance for SME internationalization (qualitative).

medium negative Analysis of Digital Services Trade and Export Competitivenes... SME capacity to internationalize / SME export performance in digital services

China's platform firms show uneven internationalization and platform infrastructure is not consistently internationally competitive.

Case examples and synthesis of domestic/international studies on platform internationalization included in the review (qualitative evidence).

medium negative Analysis of Digital Services Trade and Export Competitivenes... platform international reach and infrastructure competitiveness

China has limited influence in high‑level trade rule formation.

Policy review and comparative institutional analysis within the literature review; descriptive assessment of China's participation in multilateral rule‑making (no formal measurement of influence).

medium negative Analysis of Digital Services Trade and Export Competitivenes... influence/representation in international rule‑setting fora (digital trade and d...

Current institutional, technological, and market shortcomings limit China’s ability to close the gap with economies operating under high‑standard trade regimes.

Qualitative comparative analysis of policy and institutional frameworks against high‑standard trade members; literature and case examples (no new microdata).

medium negative Analysis of Digital Services Trade and Export Competitivenes... relative export competitiveness gap vs. high‑standard trade economies

Widespread deployment of similar models could create correlated failures or fraud vectors, implying systemic risk that may warrant macroprudential attention.

Analytic caution based on model homogeneity and case/literature discussion; speculative systemic risk concern rather than empirically demonstrated.

medium negative Explore the Impact of Generative AI on Finance and Taxation systemic correlated failure risk, incidence of correlated fraud events

There is regulatory uncertainty around AI-generated filings and responsibility/liability for automated outputs.

Analysis and literature review discuss unclear regulatory positions and legal risks noted in case organizations' deployment considerations.

medium negative Explore the Impact of Generative AI on Finance and Taxation regulatory/compliance risk exposure for AI-generated filings

Integration complexity with legacy ERP/financial systems and sharing-center processes is a significant implementation challenge.

Case study narratives describe integration work and friction points; analytic framing highlights ERP compatibility issues.

medium negative Explore the Impact of Generative AI on Finance and Taxation integration effort/time/cost, compatibility with ERP systems

Model hallucinations, lack of explainability, and limited audit trails limit safe adoption.

Paper cites literature and case observations about model reliability and explainability issues; examples and discussion are qualitative.

medium negative Explore the Impact of Generative AI on Finance and Taxation model reliability (hallucination incidence), explainability/auditability metrics

Data privacy, confidentiality, and cross-border data transfer concerns are important barriers to deployment.

Challenges enumerated from case studies and literature; specific organizational concerns cited in cases (Xiaomi, Deloitte) and in regulatory discussion.

medium negative Explore the Impact of Generative AI on Finance and Taxation deployment constraints related to data privacy (e.g., blocked data flows, need f...

Automation and human–robot assemblages can reproduce subjugation and vulnerability affecting care workers and marginalized users, requiring attention to distributional justice and labor-market impacts.

Illustrative vignettes from healthcare robotics and literature synthesis on care ethics and labor impacts; no quantitative labor-market analysis presented.

medium negative Examining ethical challenges in human–robot interaction usin... distributional impacts on wages, bargaining power, welfare, and vulnerability of...

Legal liability regimes and insurance products may systematically under- or mis-assign costs of harm in socio-technical assemblages when primordial ethical demands are considered.

Conceptual argument and suggested modeling directions; no empirical simulation or insurance-market data presented.

medium negative Examining ethical challenges in human–robot interaction usin... accuracy of cost assignment in liability/insurance regimes for socio-technical h...

Treating responsibility as a Levinasian, asymmetrical moral obligation implies it operates as a non-contractible externality that markets and contracts may fail to internalize, creating persistent externalities in AI deployment that standard economic models may miss.

Theoretical implication derived from philosophical argument applied to economic concepts; suggested consequences but no formal models or empirical validation in the paper.

medium negative Examining ethical challenges in human–robot interaction usin... degree to which markets/contracts internalize asymmetrical moral obligations (th...

Simple pluralist or multi-principle balancing approaches risk reproducing structural subordination by failing to foreground the asymmetrical ethical demand toward vulnerable Others.

Normative critique supported by cross-disciplinary literature (care ethics, mediation, STS) and illustrative examples; no empirical test of pluralist approaches’ effects.

medium negative Examining ethical challenges in human–robot interaction usin... tendency of pluralist balancing approaches to reproduce structural subordination...

The Levinasian framework helps reveal how human–robot interactions can both expose and reproduce systemic vulnerabilities, subjugation, and unaddressed harms (termed 'Problem C' — attribution of responsibility and distributed agency).

Theoretical diagnosis supported by interdisciplinary literature synthesis and illustrative vignettes from healthcare robotics, autonomous vehicles, and algorithmic governance. No quantitative prevalence data.

medium negative Examining ethical challenges in human–robot interaction usin... presence/manifestation of systemic vulnerabilities, subjugation, and unaddressed...

Absent interoperability, divergence in data and AI rules will raise transaction costs, reduce trade gains, and create opportunities for regulatory arbitrage.

Economic reasoning and scenario-based projections; asserted as an outcome of mechanism analysis rather than demonstrated with quantitative estimates.

medium negative Path Analysis of Digital Economy and Reconstruction of Inter... transaction costs, aggregate trade gains, incidence of regulatory arbitrage

Explainability, auditability, or data-localization requirements could favor larger vendors with compliance capacity, increasing market concentration and affecting competition among AI suppliers.

Market-structure argument grounded in regulatory-compliance burden analysis and comparative examples; not supported by empirical market data in the study.

medium negative ARTIFICIAL INTELLIGENCE AND ADMINISTRATIVE GOVERNANCE: A CRI... market concentration and competition among AI vendors (supplier market structure...

Legal uncertainty and strict procedural requirements increase compliance costs and regulatory risk, which can slow AI adoption by firms and public agencies.

Theoretical economic implications drawn from legal analysis and comparative observations; no empirical measurement of costs or adoption rates in the study.

medium negative ARTIFICIAL INTELLIGENCE AND ADMINISTRATIVE GOVERNANCE: A CRI... AI adoption rate and investment risk (speed and likelihood of procurement/invest...

AI can restrict or reshape human administrative discretion in legally sensitive ways.

Doctrinal analysis of statutory specificity and formal procedural requirements in civil-law contexts, illustrated with Vietnam as the exemplar case; comparative observations.

medium negative ARTIFICIAL INTELLIGENCE AND ADMINISTRATIVE GOVERNANCE: A CRI... scope of administrative discretion (degree of human decision-making latitude)

Five qualitatively distinct D3 reflexive failure modes were identified in model responses, including categorical self-misidentification and false-positive self-attribution.

Qualitative coding and taxonomy reported in Results: five D3 categories cataloged with examples; identification based on analysis of model responses to narrative dilemmas (sample drawn from the study runs).

medium negative Literary Narrative as Moral Probe : A Cross-System Framework... enumeration and qualitative descriptions of reflexive failure modes observed in ...

A probe composed of deliberately unresolvable moral dilemmas embedded in literary (science-fiction) narrative resists surface performance and exposes a measurable gap between performed and authentic moral reasoning.

Experimental application of the probe to 13 distinct LLM systems across 24 experimental conditions (13 blind, 4 declared re-tests, 7 ceiling-probe runs), with scoring and qualitative coding showing discriminating failure modes and a measurable gap in responses.

medium negative Literary Narrative as Moral Probe : A Cross-System Framework... discriminative power of the probe (ability to expose failures/gaps) operationali...

Existing AI moral-evaluation benchmarks largely measure surface-level, correct-sounding answers rather than genuine moral-reasoning capacity.

Comparative argument based on study results showing a measurable gap when applying the authors' narrative-based probe (unresolvable SF dilemmas) versus standard benchmarks; empirical support comes from experiments across 24 conditions and 13 systems showing systems produce plausible-sounding but reflexive/invalid reasoning on the narrative probe.

medium negative Literary Narrative as Moral Probe : A Cross-System Framework... gap between polished/surface moral answers and deeper/authentic moral-reasoning ...

Capabilities and data advantages for certain vendors could lead to market concentration and platform dominance in AI-driven educational feedback.

Expert concern synthesized from the workshop of 50 scholars about market dynamics; theoretical warning without empirical market-structure analysis in the report.

medium negative The Future of Feedback: How Can AI Help Transform Feedback t... market concentration measures (market share, Herfindahl index); entry barriers; ...

Differential access to high-quality AI feedback systems and bias in training data can exacerbate educational inequalities and harm marginalized groups.

Expert consensus and thematic analysis from the 50-scholar workshop, raising equity and bias risks; no empirical subgroup effectiveness estimates included.

medium negative The Future of Feedback: How Can AI Help Transform Feedback t... access disparities; differential effectiveness by subgroup; measures of algorith...

Learners may over-rely on AI feedback or game systems to obtain desirable responses, reducing effortful learning.

Workshop participant concerns synthesized qualitatively; cited as risk and an open empirical question—no experimental data provided.

medium negative The Future of Feedback: How Can AI Help Transform Feedback t... learner reliance on AI (usage patterns); changes in effortful learning behaviors...

Reliance on single-agent outputs or non-diverse agent ensembles can understate substantive uncertainty and bias conclusions in automated policy evaluation or AI-assisted empirical research.

Observed substantial agent-to-agent variability (NSEs) in the experiment (150 agents) demonstrating that single-agent results do not capture between-agent methodological uncertainty; imbalance between model families further implies potential bias if only one family is used.

medium negative Nonstandard Errors in AI Agents degree to which single-agent point estimates fail to capture between-agent dispe...

The post-exemplar convergence largely reflected imitation of exemplar choices rather than demonstrated understanding or principled correction by agents.

Qualitative and behavioral analysis of agents' post-exposure outputs showing direct adoption of exemplar measures/procedures and lack of substantive justification or mechanistic reasoning indicating comprehension; inference based on content of agent code and writeups after exposure.

medium negative Nonstandard Errors in AI Agents qualitative indicators of reasoning/comprehension in agents' outputs (textual ju...

Chat-like interfaces commonly activate misleading beliefs including overtrust in correctness/robustness, attribution of goals or moral agency, and underestimation of hallucination/bias/privacy risks.

Aggregated observations from literature in HCI and ethics; suggested examples rather than empirical prevalence estimates; no sample size given.

medium negative Why We Need to Destroy the Illusion of Speaking to A Human: ... incidence of overtrust, attribution of agency, and underestimation of model fail...

Natural conversational style creates the impression the system is human-like, intentional, or reliably knowledgeable.

Conceptual claim supported by synthesis of prior work on anthropomorphism and conversational interfaces; no new quantitative data provided.

medium negative Why We Need to Destroy the Illusion of Speaking to A Human: ... user beliefs about system humanness, intentionality, and perceived reliability

Reliance on preference signals risks learning spurious proxies and produces unstable behavior under distribution shift.

Theoretical argument supported by examples of spurious proxies in ML and by observations in RLHF-trained models; the paper cites literature showing proxy behavior but does not present a unified empirical quantification specific to RLHF across many tasks.

medium negative Via Negativa for AI Alignment: Why Negative Constraints Are ... frequency of spurious-proxy-driven failures and degradation in behavior under di...

Positive preference signals are continuous, context-dependent, and entangled with surface correlates (e.g., agreement with the user), which causes models trained on them to pick up spurious proxies and exhibit sycophancy and brittleness.

Conceptual/theoretical argument in the paper describing structural properties of preference spaces, supported by cited observations of sycophantic behavior in models trained with preference-based objectives. No single definitive empirical quantification is provided within the paper; supporting examples are drawn from recent literature.

medium negative Via Negativa for AI Alignment: Why Negative Constraints Are ... incidence of sycophantic behavior and brittleness (e.g., tendency to agree with ...

There is a risk of manipulation and misinformation if argument mining/synthesis is unregulated or misaligned with social incentives, creating externalities that may justify public intervention.

Conceptual risk assessment combining known misinformation dynamics and AI capabilities; no empirical incident data provided.

medium negative Argumentative Human-AI Decision-Making: Toward AI Agents Tha... incidence of manipulation/misinformation attributable to argument-mining/synthes...

Increased error risk and weaker explainability from GLAI will raise malpractice and liability exposure for firms and lawyers, driving up insurance and compliance costs.

Legal-risk analysis and economic reasoning connecting explainability/liability to insurance costs; no empirical cost studies presented.

medium negative Why Avoid Generative Legal AI Systems? Hallucination, Overre... malpractice/liability exposure levels and associated insurance/compliance costs

« Prev 1 2 3 … 93 94 95 … 137 138 Next »