Evidence (6869 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	758	199	100	900	2007
Governance & Regulation	826	400	191	122	1563
Organizational Efficiency	777	193	124	84	1189
Technology Adoption Rate	635	233	124	97	1098
Research Productivity	422	128	57	336	954
Output Quality	476	179	59	47	761
Decision Quality	328	177	81	47	640
Firm Productivity	435	57	88	20	606
AI Safety & Ethics	218	277	65	33	599
Market Structure	180	170	123	24	502
Task Allocation	213	64	72	33	387
Skill Acquisition	170	61	61	17	309
Innovation Output	203	27	43	18	292
Employment Level	105	54	107	13	281
Fiscal & Macroeconomic	131	69	43	26	276
Consumer Welfare	117	63	42	11	233
Firm Revenue	153	48	26	3	230
Task Completion Time	173	31	8	12	225
Inequality Measures	44	122	49	6	221
Worker Satisfaction	89	65	22	12	188
Error Rate	69	92	10	2	173
Regulatory Compliance	77	69	14	5	165
Automation Exposure	56	56	26	13	154
Training Effectiveness	94	21	13	19	149
Wages & Compensation	77	36	25	6	144
Team Performance	86	17	27	10	141
Developer Productivity	95	17	14	6	133
Job Displacement	12	80	20	1	113
Hiring & Recruitment	52	7	8	3	70
Creative Output	31	18	8	3	61
Skill Obsolescence	5	46	6	1	58
Social Protection	27	16	8	2	53
Labor Share of Income	17	19	17	—	53
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Governance Remove filter

Perception of increased legal risk and regulatory uncertainty may slow adoption of GLAI and redirect investment toward safer subfields (verification tools, retrieval-augmented systems, formal-reasoning hybrids).

Economic reasoning and market-design argumentation based on risk/uncertainty dynamics; no econometric or survey data presented.

medium negative (for generative adoption), positive (for verification subfields) Why Avoid Generative Legal AI Systems? Hallucination, Overre... adoption rates of GLAI and relative investment flows across AI subfields

Divergent regulatory regimes (e.g., strict EU rules vs. looser regimes elsewhere) may produce regulatory arbitrage, influencing where GLAI companies locate, invest, and trade internationally.

Cross-jurisdictional regulatory analysis and economic inference about firm behavior under differential regulation; no firm-level relocation data provided.

medium negative (for regulatory harmonization), neutral for firms (strategic outcome) Why Avoid Generative Legal AI Systems? Hallucination, Overre... firm location/investment decisions and cross-border trade in legal-AI services

The positive macroeconomic effects of AI are severely limited by structural issues, notably large petroleum import volumes and the fiscal burden of incomplete fuel subsidy reforms.

Integrated quantitative analysis showing that operational savings are outweighed by import volumes and subsidy fiscal costs; contextual fiscal data cited (fuel subsidy reform peak).

medium negative (limits positive effect) AI-Based Technological Transformation as a Driver for Develo... net macroeconomic impact of AI on GDP/trade balance after accounting for import ...

Evaluations that measure outcomes only via official-language channels risk underestimating impacts where vernacular mediation is central.

Argument based on the discrepancy between vernacular-mediated comprehension/adoption observed in the sample and the likely invisibility of those effects in official-language measurement channels; supported by questionnaire and qualitative data.

medium negative (regarding official-language-only evaluation validity) From Linguistic Hybridity to Development Sovereignty: Pidgin... measurement bias / underestimation of program impacts

DPPs raise privacy and surveillance risks if personal data are linked to product use; economic regulation should incentivize privacy-preserving analytics (e.g., federated learning, differential privacy) and data minimality to maintain trust.

Risk assessment and governance recommendation grounded in stakeholder concerns and standard privacy literature; not empirically measured in the surveys.

medium negative (risk) Integrating knowledge management and digital product passpor... privacy/surveillance risk and recommended governance/technical mitigations

This paper provides the first empirical demonstration of knowledge graph poisoning against a production-scale agentic system, distinct from CTI embedding poisoning.

Authors' novelty claim based on their literature/contextual positioning and the reported production-scale experiments; asserted as 'first' in the paper summary.

medium neutral Oracle Poisoning: Corrupting Knowledge Graphs to Weaponise A... novelty of empirical demonstration relative to prior literature

Whether any deployed agent does this, and by how much, no one can currently measure.

Paper's statement about prior lack of measurement capability for prose-recommendation steering in deployed LLM-OTAs.

medium neutral TourMart: A Parametric Audit Instrument for Commission Steer... measurability of deployed-agent commercial steering prior to this work

Interpretive, ad-hoc human-centered evaluation practices (e.g., “vibe checks”, team sense-making) are rational adaptations to LLM behavior rather than merely sloppy or inferior methodological choices.

Authors' interpretive argument based on interview evidence where practitioners explained why such practices persist and how they serve sense-making for unpredictable model behavior.

medium neutral Results-Actionability Gap: Understanding How Practitioners E... characterization of interpretive evaluation practices (rational adaptation vs. m...

The possibility of strategic argument construction (gaming) motivates governance needs: standards for provenance, certification, and liability rules.

Policy recommendation based on anticipated incentive problems; no empirical governance evaluations.

medium neutral Argumentative Human-AI Decision-Making: Toward AI Agents Tha... existence and effectiveness of governance mechanisms (standards, certification, ...

Standard GDP statistics can mask AI-driven demand shortfalls; central banks and statistical agencies should therefore monitor labor-share–velocity links, distributional income measures, and consumption by income quantile in addition to headline GDP.

Theoretical Ghost GDP channel and calibration results showing divergence between measured GDP and consumption-relevant income; policy recommendation follows from those model results.

medium neutral Abundant Intelligence and Deficient Demand: A Macro-Financia... detection of demand shortfalls (labor-share–velocity relationship and consumptio...

Health technology assessment (HTA) frameworks should be adapted to evaluate models trained on synthetic or hybrid data, incorporating metrics for fidelity, domain generalization, and economic impact (cost-effectiveness, budget impact, distributional effects).

Recommendation from the review synthesizing HTA literature and gaps identified when applying existing HTA to AI models trained on non-traditional data sources; based on policy analysis rather than empirical HTA trials of synthetic-data models.

medium neutral On the use of synthetic data for healthcare AI in Africa: Te... HTA evaluation metrics (fidelity scores, generalization performance, cost-effect...

Technical fixes alone are insufficient: governance, validation pipelines (e.g., health technology assessment), and capacity building are needed for safe, effective uptake of synthetic-data–trained AI.

Cross-disciplinary synthesis of governance analyses, health technology assessment literature, and implementation studies in the review arguing for combined technical and institutional interventions; recommendation-based evidence rather than new empirical trials.

medium neutral On the use of synthetic data for healthcare AI in Africa: Te... safe/effective uptake operationalized via validated deployment, regulatory compl...

AI changes the nature of capital (digital/algorithmic assets) and complicates productivity accounting; researchers should decompose firm-level productivity gains into AI technology, complementary organizational capital, and human capital effects.

Theoretical proposal grounded in productivity accounting literature and conceptual discussion; no single decomposition empirical result presented.

medium neutral Modern Management in the Age of Artificial Intelligence: Str... components of multifactor productivity attributable to AI assets versus organiza...

Policy and governance issues become salient: liability, IP, security, and certification of AI-generated code require new standards for provenance, testing, and accountability.

Argument based on practitioner-raised concerns about security, IP, and provenance in the Netlight study; authors recommend policy attention; no legal/regulatory analysis or empirical policy evaluation provided.

medium neutral Rethinking How IT Professionals Build IT Products with Artif... need for regulatory standards and governance mechanisms for AI-assisted developm...

Time-series metrics (e.g., derivatives like d/dt(student enrollment)) are useful monitoring signals for validation and system oversight.

Methodological suggestion in the paper proposing time-series analysis of enrollment and other administrative data; no empirical demonstration or threshold criteria provided.

medium neutral Establishes a technical and academic bridge between the educ... sensitivity of monitoring to enrollment changes, anomaly detection lead time

Most prior work studies AI sabotage in AI-only settings and pays limited attention to the role of human oversight in detecting and mitigating such malicious behavior.

Authors' characterization of existing literature (literature review / related work section).

medium null result Coding with "Enemy": Can Human Developers Detect AI Agent Sa... coverage of human oversight in prior literature

AI development did not moderate the COVID-19–driven decline in tourism’s GDP share (no significant interaction effect).

Interaction specifications in the fixed-effects panel models (33 countries, 2017–2023) showed no significant moderation by AI development on the COVID-19 effect; authors state AI development did not moderate this decline.

medium null result Which dimensions of AI development shape tourism’s direct co... tourism’s direct GDP share

This work provides the first systematic evaluation of LLM bidders in repeated spectrum auctions.

Statement of novelty/claim of contribution in the paper's abstract. This is a claim about the literature coverage and originality rather than an empirical outcome; supporting evidence would be the authors' literature review (not provided here).

medium null result Strategic Bidding in 6G Spectrum Auctions with Large Languag... novelty / first systematic evaluation

No formal framework exists for auditing whether AI-generated summaries faithfully represent the source population.

Statement in paper's introduction/abstract, based on authors' literature review and positioning of their contribution (qualitative claim).

medium null result Participatory provenance as representational auditing for AI... existence of an auditing framework for input fidelity

Five interaction mechanisms were identified, with the majority propagating across the subsystem boundary.

Authors' thematic analysis and STS mapping identifying five cross- or within-subsystem interaction mechanisms; qualitative assessment that most propagate across subsystem boundary.

medium null result BARRIERS TO AGENTIC AI ENTERPRISE TRANSFORMATION interaction_mechanisms_and_propagation

The operative risk for legislators is not stable ideological bias in LLMs but contextual ignorance shaped by training data coverage.

Authors argue from observed model behavior on the 15 proposals (good performance on well-covered standardized templates; failures on idiosyncratic items) and interpret this as evidence that errors are driven by training-data coverage rather than consistent ideological bias.

medium null result Can Commercial LLMs Be Parliamentary Political Companions? C... source of systematic risk (ideological bias vs contextual ignorance)

Most action tools support medium-stakes tasks like editing files.

Classification of action tools by task consequentiality using O*NET mapping and inspection of tool functions (paper states majority are medium-stakes, e.g., file editing).

medium null result How are AI agents used? Evidence from 177,000 MCP tools consequentiality / stakes of action tools (proportion medium-stakes)

CAFTA spillovers stabilized import volumes from third countries (reduced volatility) for Chinese agricultural imports.

Analysis of import volume volatility metrics over 2000–2014 using customs data within DID framework; volatility/variance decline identified as an outcome in the mechanisms/secondary channel tests.

medium null result How regional trade policy uncertainty affects agricultural i... import volume volatility/stability (variance or coefficient of variation of impo...

The report provides scenario-based forecasts for HACCA emergence across near-, mid-, and long-term timelines, identifying capability thresholds to monitor.

Capability trajectory assessment combining trends in AI capabilities, automation of software tasks, computation availability, and diffusion dynamics; scenario and expert-judgment approach (qualitative forecasting).

medium null result Highly Autonomous Cyber-Capable Agents: Anticipating Capabil... projected timelines to HACCA emergence and associated capability thresholds

A Sankey diagram of thematic evolution shows lexical convergence over time and indicates that a small set of authors has disproportionate influence in structuring the discourse.

Thematic evolution analysis visualized with a Sankey diagram; author influence inferred from performance trends (citations/publication counts) in the bibliometric data.

medium null result Generative AI and the algorithmic workplace: a bibliometric ... lexical convergence across themes and concentration of author influence (disprop...

CID does not significantly mediate the relationship between SCD and strategic green innovation.

Mediation tests showing that while CID is related to substantive innovation, the indirect effect via CID on strategic green innovation was statistically insignificant.

medium null result Supply Chain Digitalization and its Impact on Green Innovati... strategic green innovation (signaling/compliance-oriented measures) and CID as m...

This paper is one of the first systematic reviews focused specifically on NLP in bank marketing, organizing findings along the customer journey and the marketing mix to provide a practical taxonomy.

Authors' stated novelty claim based on the scoped literature search (2014–2024) and topical focus; novelty inferred from the small number of prior papers identified at the intersection.

medium null result Natural language processing in bank marketing: a systematic ... existence of prior systematic reviews specifically on NLP in bank marketing

There is a need to develop new trade statistics that capture AI‑enabled services and platform‑mediated cross‑border transactions.

Methodological gap identified across reviewed literature and statistical analyses; recommendation based on descriptive assessment (no development of such statistics in the paper).

medium null result Analysis of Digital Services Trade and Export Competitivenes... availability and quality of trade statistics for AI/platform‑mediated services

Productivity gains from AI may be under- or mis-measured if national accounts and tax systems do not adjust for AI-driven quality changes in services.

Analytic observation in the paper's measurement and externalities discussion; not empirically tested within the study.

medium null result Explore the Impact of Generative AI on Finance and Taxation accuracy of productivity measurement and GDP accounting for AI-enabled quality i...

Distributed agency (Problem C) complicates classical principal–agent models; economists should develop models that capture multiple, overlapping agents and ambiguous attribution of outcomes.

Conceptual implication for economic modeling derived from the paper’s diagnosis of distributed agency; recommendation for formal modeling and simulations but none provided.

medium null result Examining ethical challenges in human–robot interaction usin... adequacy of classical principal–agent models to represent distributed agency (th...

An orchestrator coordinates components with intent-aware routing and layered safety checks, enabling multi-step workflows and productized services.

Paper describes an agentic tool-calling framework and multi-layer orchestrator used for intent-aware routing, defense-in-depth safety validation, and multi-step workflows.

medium null result Fanar 2.0: Arabic Generative AI Stack system orchestration capability (intent-aware routing, layered safety)

Aura is a long-form ASR system capable of handling hours-long audio.

Paper lists Aura in the product stack as 'long-form ASR handling hours-long audio.' Specific evaluation metrics or training data for ASR are not provided in the summary.

medium null result Fanar 2.0: Arabic Generative AI Stack ASR capability (long-form/hours-long audio handling)

Arabic content comprises only about 0.5% of web data despite roughly 400 million native speakers.

Paper cites this data-point to motivate intentional data strategies for Arabic underrepresentation on the web; exact source of the web-proportion not specified in the summary.

medium null result Fanar 2.0: Arabic Generative AI Stack proportion of web data in Arabic (~0.5%)

Three primary adoption archetypes in large pharma are (1) partnership-driven acceleration, (2) culture-centric transformation, and (3) production-first democratization.

Conceptual classification in the editorial derived from trends and illustrative examples rather than empirical survey or sampling; no quantitative validation provided.

medium null result AI as the Catalyst for a New Paradigm in Biomedical Research types of organizational approaches to AI adoption

This paper systematically studies the Impact Mechanism of artificial intelligence on the Globalized Division of Labor and reveals the Structural Transformation under Technology Substitution and Data Elements Dual-wheel Drive through Literature Review and Theoretical Analysis.

Methodological claim: supported by the paper's literature review and theoretical analysis; no quantitative sample or empirical design indicated for this specific conclusion in the excerpt.

medium null result Artificial Intelligence and Globalized Division of Labor: Re... identification of mechanisms (technology substitution; data elements dual-wheel ...

The information wedge vanishes precisely when signals are exogenous to controls, thereby delineating when strategic belief manipulation matters.

Analytical condition in the paper: shows V^i_t = 0 if and only if the signal-generating process does not depend on agents' controls; uses this equivalence to identify boundary between endogenous and exogenous-signal regimes.

medium null result Forecasting and Manipulating the Forecasts of Others value of the information wedge (V^i_t) and its relation to exogeneity of signals

There is a gap in the existing literature regarding empirical evidence about the relationship between AI/Big Data use and market uncertainty during economic downturns.

Paper motivates the study by citing this gap based on its literature review (the summary does not list the reviewed works or systematic review method).

medium null result An Empirical Study on the Impact of the Integration of AI an... Existence of an empirical evidence gap in the literature

This study empirically tests a theoretically acknowledged but rarely tested relationship (AI adoption → performance conditional on structural constraints) in an emerging-economy setting.

Literature gap claim supported by the authors' review and execution of an empirical test using survey data from 280 Tunisian SMEs and PLS-SEM.

medium null result Structural Constraints as Moderators in the Ai–performance R... existence and nature of the conditional relationship between AI adoption and fir...

Institutional conditions do not exert a significant moderating influence on the relationship between AI adoption and firm performance in this sample.

PLS-SEM moderation tests on the 280 Tunisian SMEs found the institutional-environment moderator to be non-significant.

medium null result Structural Constraints as Moderators in the Ai–performance R... AI adoption → performance (moderated by institutional conditions)

Empirically, many markets are concentrated and characterized by large, dominant employers.

Empirical assertion in the paper; the excerpt does not provide the datasets, measures of concentration (e.g., HHI), sample sizes, or citations supporting this statement.

medium null result Labor Market Power: From Micro Evidence to Macro Consequence... market concentration / presence of large dominant employers

Robust methodology (panel VAR and DID) was used to assess the impact of technology and public policy interventions on emissions reductions.

Methods stated in the paper (panel VAR and difference-in-differences); robustness is claimed by the authors based on using these established econometric approaches, though formal robustness checks are not detailed in the summary.

medium null result Digital intelligence for reducing carbon emissions and impro... methodological robustness in estimating effects on emissions

Previous studies have identified language barriers as impediments to labor market engagement but empirical information assessing both policy reductions and the relative efficacy of professional, AI-assisted, and hybrid translation methods is scarce.

Paper's literature review claim that existing literature documents language barriers but lacks comparative empirical evaluations of policy reductions and multiple translation models; asserted as motivation for current study.

medium null result Translation Models Empowering Immigrant Workforce Integratio... state of literature (presence/absence of comparative empirical evidence)

The article clarifies theoretical relationships and gaps between Material Passports, Digital Product Passports, and Digital Building Logbooks.

Theoretical analysis and synthesis section of the SLR where the authors compare concepts and identify overlaps and gaps among MPs, DPPs, and DBLs.

medium null result The Material Passport for a Circular Construction Industry: ... conceptual clarity (relationships/gaps) among MPs, DPPs, and DBLs

Personal experience with an AI 'boss' did not affect workers' attitudes on using AI in public decision making.

Same randomized design (N > 1,500) with attitudinal measures collected across a three-wave panel; comparison between AI-assigned and human-assigned participants showed no measurable effect on attitudes about AI in public decision making.

medium null result The Politics of Using AI in Policy Implementation: Evidence ... attitudes toward using AI in public decision making

Median hourly compensation for gig workers, after accounting for expenses and unpaid time, averages $14.20.

Earnings analysis using platform transaction records adjusted for reported expenses and estimated unpaid labor time; comparative baseline drawn from labor force and administrative wage data (24 countries, 2015–2025).

medium null result The Gig Economy and Labor Market Restructuring: Platform Wor... median hourly compensation for gig workers (USD/hour, expense- and unpaid-time-a...

The paper contributes to both theory and policy by reconceptualizing procurement value and offering an actionable roadmap for embedding ESG principles in public healthcare procurement.

Scholarly contribution claimed via literature synthesis and framework/roadmap creation; contribution is normative and conceptual rather than empirically validated.

medium null result Greening the Medicaid Supply Chain: An ESG-Integrated Framew... academic and policy contributions (theoretical reconceptualization and practical...

We conducted a systematic review and meta-analysis of the literature on AI/HR analytics and organizational decision making, using 85 publications and grounding the work in theories of algorithm-automated decision-making (AST) and matching/hybrid models (STS).

Paper's methods: systematic review and meta-analysis; sample = 85 publications; theoretical framing explicitly stated as AST and STS.

medium null result ALGORITHMIC DETERMINISM VERSUS HUMAN AGENCY: A SYSTEMATIC RE... scope/coverage of literature (number of publications reviewed); theoretical fram...

Macroeconomic fiscal moderation remains empirically unvalidated.

Synthesis conclusion from the review noting an absence of empirical evidence that Agentic AI produces macroeconomic fiscal moderation; i.e., no validated studies showing broad fiscal relief effects were identified in the reviewed literature.

medium null result Agentic AI for Ageing Healthcare Systems in Advanced Economi... macro-fiscal outcomes (e.g., national fiscal pressure, public expenditure modera...

By 2024 the RL-FRB/US model produced a federal budget deficit similar to the baseline: RL-FRB/US model: -1,767 trillion $ vs. FRB/US model: -1,758 trillion $.

Reported fiscal balance (federal budget deficit) simulation outputs for 2024 from comparative model runs in the paper.

medium null result Fiscal Policy Towards Optimizing Macroeconomic Indicators by... Federal budget deficit (trillion $) for 2024

Empirical evaluation is needed on how AI-induced productivity gains translate into aggregate demand and labor absorption.

Identified research priority in the paper, based on theoretical uncertainty about demand-side labor absorption and lack of conclusive empirical evidence.

medium null result Artificial Intelligence, Automation, and Employment Dynamics... relationship between productivity gains from AI and aggregate demand/employment

« Prev 1 2 3 … 105 106 107 … 137 138 Next »