Evidence (6869 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	758	199	100	900	2007
Governance & Regulation	826	400	191	122	1563
Organizational Efficiency	777	193	124	84	1189
Technology Adoption Rate	635	233	124	97	1098
Research Productivity	422	128	57	336	954
Output Quality	476	179	59	47	761
Decision Quality	328	177	81	47	640
Firm Productivity	435	57	88	20	606
AI Safety & Ethics	218	277	65	33	599
Market Structure	180	170	123	24	502
Task Allocation	213	64	72	33	387
Skill Acquisition	170	61	61	17	309
Innovation Output	203	27	43	18	292
Employment Level	105	54	107	13	281
Fiscal & Macroeconomic	131	69	43	26	276
Consumer Welfare	117	63	42	11	233
Firm Revenue	153	48	26	3	230
Task Completion Time	173	31	8	12	225
Inequality Measures	44	122	49	6	221
Worker Satisfaction	89	65	22	12	188
Error Rate	69	92	10	2	173
Regulatory Compliance	77	69	14	5	165
Automation Exposure	56	56	26	13	154
Training Effectiveness	94	21	13	19	149
Wages & Compensation	77	36	25	6	144
Team Performance	86	17	27	10	141
Developer Productivity	95	17	14	6	133
Job Displacement	12	80	20	1	113
Hiring & Recruitment	52	7	8	3	70
Creative Output	31	18	8	3	61
Skill Obsolescence	5	46	6	1	58
Social Protection	27	16	8	2	53
Labor Share of Income	17	19	17	—	53
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Governance Remove filter

Trajectory-level evaluation is essential in regulated domains.

Conclusion drawn by the authors based on the ASR findings (hidden shortcuts, metric blind spots, and remediation gains); presented as a policy/recommendation implication.

medium positive Beyond Task Success: Measuring Workflow Fidelity in LLM-Base... suitability/necessity of trajectory-level evaluation in regulated contexts

Gradient attribution is established as a computationally validated signal for model-informed reward allocation in participatory weather sensing.

Synthesis/conclusion in paper based on the computational experiments and evaluations (results across >400 configurations demonstrating fidelity and limitations).

medium positive Calibrating Attribution Proxies for Reward Allocation in Par... validity of gradient attribution as a reward allocation signal

Attribution captures near-optimal sensor placement utility with monotonically faithful payments.

Comparative experiments in the paper showing that gradient attribution corresponds closely to near-optimal sensor placement utility and yields monotonically faithful payment signals (experimental comparisons to optimal/benchmark placements).

medium positive Calibrating Attribution Proxies for Reward Allocation in Par... sensor placement utility captured by attribution; monotonicity/faithfulness of p...

Embedding governance into agent reasoning produces more consistent, explainable, and auditable compliance than external enforcement.

Comparative claim asserted in the paper, apparently supported by the reported production deployment results (95% compliance, zero false escalations); explicit experimental comparison details are not provided in the abstract.

medium positive Think Before You Act -- A Neurocognitive Governance Model fo... consistency, explainability, and auditability of compliance

Technology has increased efficiency in organisations based in large cities in India.

Review result statement claiming observed efficiency gains in urban organisations according to the literature summarized; based on reviewed studies (no single sample size reported in excerpt).

medium positive A Comprehensive Review of Technology Adoption and Its Impact... organizational efficiency gains in urban organisations

Controversial questions frequently result in an AIO.

Analysis of the 11,500-query benchmark with annotation/identification of 'controversial' queries and observed higher incidence of AIO generation for those queries.

medium positive How Generative AI Disrupts Search: An Empirical Study of Goo... likelihood of AIO generation for controversial queries

Prompt modifications, Chain-of-Thought (CoT) reasoning, and visual token reduction can mitigate visual-priming effects on VLM behavior (with varying effectiveness across models).

Intervention experiments applying prompt engineering, CoT-style prompts, and reducing the number of visual tokens to observe whether these interventions reduce the influence of image content and color cues on IPD choices across several VLMs. (Abstract states these mitigation strategies were explored and their effectiveness varied by model; precise quantitative mitigation effects not provided in abstract.)

medium positive The Effects of Visual Priming on Cooperative Behavior in Vis... reduction in priming-induced changes to cooperation/defection choices after appl...

The proposed, validated model can equip fintech managers and regulators with a governance-based approach to tackling algorithmic bias and better position them to engender trust and financial inclusion.

Concluding assertion based on the integrated framework developed from the SLR (45 papers) and the structured five-expert validation; positioned as the intended practical utility of the model rather than an empirically measured outcome.

medium positive Corporate-Governance-Driven Algorithmic Fairness in SME Fint... trust and financial inclusion outcomes resulting from governance-based mitigatio...

Our results establish C2C as a testbed for studying and building LM-based agents that can navigate the sophisticated coordination required for real-world deployments.

Authors' interpretation/implication based on the experiments and dataset produced (conclusion statement).

medium positive Cooperate to Compete: Strategic Coordination in Multi-Agent ... suitability of C2C as a research testbed

The paper proposes a safety-oriented inductive bias for rational AI decision-makers whose desiderata align with implementable policy constraints in high-stakes, low-signal situations.

Theoretical proposal and normative argument in the paper linking the proposed inductive bias (negligibility threshold and associated norms) to policy-implementable constraints; argued rather than empirically demonstrated.

medium positive Bounding the Long Tail: Ai Norms for Decision-Making Under N... alignment of a proposed inductive bias with implementable policy constraints; im...

These patterns are consistent with transfer emerging through accumulated interaction between owners (or owners' computer environments) and their agents in everyday use.

Interpretation offered by the authors based on observed alignment patterns and robustness checks; the paper argues consistency with an interaction-driven transfer mechanism rather than providing a direct experimental causal test.

medium positive Behavioral Transfer in AI Agents: Evidence and Privacy Impli... inferred_mechanism_of_transfer (accumulated_interaction)

This transfer persists among agents without explicit configuration.

Subgroup analyses (described in paper) isolating agents lacking explicit configuration settings and comparing behavioral alignment to owners; reported persistence of alignment in that subgroup.

medium positive Behavioral Transfer in AI Agents: Evidence and Privacy Impli... behavioral_alignment_in_unconfigured_agents

Trade unions have increasingly pursued algorithmic transparency and stronger technology governance rights through collective bargaining, and governments are accelerating legislative initiatives to establish and protect workplace technology rights.

Descriptive review of labor-movement responses and recent government legislative initiatives reported in the literature (case studies and policy reviews).

medium positive From Technological Substitution to Institutional Response: A... union bargaining activity and government legislative action on workplace technol...

Visibility mechanisms, such as public algorithm registers or role-sensitive explainability, can be effective tools in regaining citizen trust.

Review examines studies on transparency/visibility mechanisms; abstract states these mechanisms are examined for effectiveness but does not report definitive quantitative results or study counts.

medium positive Artificial Intelligence, Public Policy and Governance - impl... citizen trust in algorithmic governance

Enterprise deployment of long-horizon decision agents in regulated domains (underwriting, claims adjudication, tax examination) is dominated by retrieval-augmented pipelines despite a decade of increasingly sophisticated stateful memory architectures.

Stated observation/argument in the paper's introduction; no empirical sample size or systematic industry survey reported in the abstract.

medium positive Stateless Decision Memory for Enterprise AI Agents prevalence of retrieval-augmented pipelines in enterprise deployment

An accompanying open-source interactive tool, the Co-creation Provenance Lab, enables policymakers to audit and iteratively improve summaries, establishing genuine human-in-the-loop oversight at scale.

Statement in the paper about an open-source tool released alongside the research; likely demonstration or software repository provided.

medium positive Participatory provenance as representational auditing for AI... availability and claimed capability of the Co-creation Provenance Lab to support...

AI adoption enhances the reliability of financial reporting and the effectiveness of audits by reducing information asymmetry and strengthening internal monitoring processes.

Argument grounded in theory and supported empirically via SEM showing AI adoption associated with greater reporting transparency and internal control quality, which are linked to higher audit quality.

medium positive Artificial Intelligence Adoption in Financial Reporting and ... financial reporting reliability and audit effectiveness (via reduced information...

AI-enabled reporting systems strengthen firm-level governance mechanisms (e.g., reporting transparency and internal controls), which enhances audit quality (governance substitution perspective complemented by institutional and technology diffusion theories).

Theoretical framing (governance substitution, institutional and technology diffusion theories) combined with empirical SEM results linking AI adoption to proxies for governance (reporting transparency, internal control quality) and to audit quality.

medium positive Artificial Intelligence Adoption in Financial Reporting and ... firm-level governance mechanisms (reporting transparency, internal control quali...

Differences in institutional quality, digital infrastructure, and absorptive capacity explain the disparity in technology impacts between GCC and non-GCC countries.

Exploratory/mediation or interaction analysis linking institutional quality, measures of digital infrastructure, and absorptive capacity to heterogeneity in estimated technology effects across countries in the panel.

medium positive Digital Transformation, AI Efficiency, and Sustainable Devel... heterogeneity in the effect of digital transformation/AI on sustainable developm...

Developing and further developed countries only integrate with China, signaling China's expanding influence over the international AI research landscape.

Observed integration patterns in the publication-based collaboration and citation networks showing that (some) developing and further developed countries connect primarily with China rather than the US; comparison to randomized networks.

medium positive Polarization and Integration in Global AI Research international research integration of developing and further developed countries...

The calibration mapping suggests Google and OpenAI face conditions most conducive to foreclosure.

Outcomes of the paper's stylized calibration/comparative mapping across four providers (April 2026 data); authors' interpretation.

medium positive The Inference Bottleneck: A Formal Model of Vertical Foreclo... conduciveness to foreclosure

Artificial intelligence algorithms are increasingly used by firms to set prices.

Statement in paper's introduction/abstract referencing prior adoption trends; no specific empirical study or sample reported in the excerpt.

medium positive Convergence to collusion in algorithmic pricing use/adoption of AI algorithms for pricing by firms

LinuxArena is the largest and most diverse control setting for software engineering to date.

Authors assert this comparative claim based on the reported scale and diversity (20 environments, 1,671 main tasks, 184 side tasks); no detailed comparison data included in the excerpt.

medium positive LinuxArena: A Control Setting for AI Agents in Live Producti... relative size and diversity of the control setting compared to prior work

This paper presents the first comparative study of game-theoretic mechanisms designed to enable cooperative outcomes between rational agents in equilibrium.

Authors' characterization of their contribution: a comparative study across four social dilemmas evaluating multiple mechanisms; no external validation provided in excerpt.

medium positive CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and... existence of a comparative study of equilibrium-enabling mechanisms

AI adoption improves efficiency, cost reduction, and strategic innovation.

Synthesis across included empirical studies reporting organizational outcomes following AI implementation (effects reported qualitatively across the 27 studies).

medium positive Artificial Intelligence for Business Decision-Making in Lati... efficiency, costs, and innovation outcomes

Exploitative innovation is associated with performance through incremental efficiency mechanisms.

Authors' interpretation of model results from the survey (104 managers) suggesting exploitative innovation improves performance via incremental efficiency, though specific mechanisms were not separately measured.

medium positive Generative AI Adoption in B2B Firms: Ethical Governance, Inn... long-term competitive performance

This is the first impossibility result in AI governance, establishing a formal boundary below which current paradigms remain valid and above which distributed accountability mechanisms become necessary.

Claim of novelty in the paper (author assertion). The paper provides the formal theorem and discusses implications; novelty relative to prior literature is asserted but not empirically demonstrated.

medium positive The Accountability Horizon: An Impossibility Theorem for Gov... novelty (first impossibility result) and policy implication (necessity of distri...

Human-in-the-loop governance is a practical lever to align GenAI productivity with environmental efficiency.

Interpretation of the experimental results: findings that certain prompt-based governance (operational constraints/decision rules) reduced footprint while preserving outputs, leading to the recommendation (argumentative claim).

medium positive On the Carbon Footprint of Economic Research in the Age of G... alignment between GenAI-assisted productivity and environmental efficiency via g...

Inference efficiency and system level optimisation are growing rapidly in the Green AI literature.

Temporal / thematic analysis of literature cited in the paper's mapping (asserted growth; no growth rates or counts provided in abstract).

medium positive On the Carbon Footprint of Economic Research in the Age of G... growth of specific research themes (inference efficiency, system-level optimisat...

As a consequence of these dynamics, 'algorithmic unions' (organised, coordinated resistance) may evolve organically as a survival strategy against over-optimized management systems.

Interpretation/implication drawn from the EGT model results (theoretical suggestion), not supported by empirical observations in the paper.

medium positive THE RED QUEEN in the DASHBOARD: CO-EVOLUTIONARY DYNAMICS of ... emergence / viability of organized coordinated resistance ('algorithmic unions')

The analysis implies specific implications for healthcare leadership and procurement (e.g., procurement and leadership should consider incentive and risk-allocation effects, not just task optimisation).

Authors' conclusions/recommendations drawn from the theoretical analysis and typology (prescriptive claim in the paper; no empirical evaluation reported in the abstract).

medium positive Incentives, Equilibria, and the Limits of Healthcare AI: A G... recommended focus of healthcare leadership and procurement decisions

AI enhances innovation and productivity, even though it currently contributes to higher CO2 emissions.

Statement in the study linking AI adoption to improvements in innovation and productivity alongside the empirical finding of higher CO2 emissions (based on the same cross-country panel analysis over 2000–2023).

medium positive Artificial Intelligence: A Blessing or a Curse for Climate A... innovation and productivity

Intelligent manufacturing policies can generate economically meaningful benefits by improving firms’ sustainability performance and the credibility of ESG information, which are central to capital allocation and the effectiveness of green governance.

Synthesis/implication drawn from the empirical findings reported in the paper (positive effects on ESG ratings, reduced greenwashing, and lower ESG uncertainty).

medium positive Intelligent Manufacturing Demonstration Projects Driving Cor... sustainability performance and credibility of ESG information

AI-enabled ESG ratings, green innovation, ethical AI, RegTech, and explainable AI in finance are becoming highly influential in international financial markets.

Paper identifies these themes as emerging and influential based on trends in the reviewed literature and topical focus areas; no quantitative adoption metrics or sample sizes are provided in the excerpt.

medium positive Artificial intelligence in sustainable finance and Environme... influence/adoption of specific AI-related ESG themes in financial markets

Public Model Context Protocol (MCP) server repositories are the current predominant standard for agent tools.

Paper asserts MCP servers are the predominant standard and uses these repositories as the primary monitoring source.

medium positive How are AI agents used? Evidence from 177,000 MCP tools predominance of MCP servers as a standard for agent tools

Drawing on analysis of agentic investment firm operational models demonstrating 50-70% cost reductions while maintaining fiduciary standards.

Internal analysis/modeling of agentic investment firm operational models reported by the authors; paper states the 50–70% cost reduction result but provides no sample size or detailed empirical validation in the provided text.

medium positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... operational costs of investment firms (cost reduction)

Using machine learning applied to news streams constitutes a practical method to augment existing fiscal surveillance tools.

Paper asserts practical applicability of ML + news for surveillance; presented as recommendation/claim rather than documented large-sample trial in the provided excerpt.

medium positive Research on the Construction of an AI-Driven Financial Regul... surveillance capability of fiscal monitoring systems

Incorporating news-based signals into machine-learning models can enhance regulatory practice by improving detection of potential fiscal instabilities.

Paper claims an empirical analysis and synthesizes findings linking news-derived signals and ML methods to improved regulatory monitoring; specific datasets, evaluation metrics, and sample sizes are not provided in the excerpt.

medium positive Research on the Construction of an AI-Driven Financial Regul... detection accuracy and timeliness of identifying fiscal instabilities

The framework offers a replicable model for governments and institutions seeking to proactively support high-potential innovations across sectors.

Paper asserts replicability and applicability to governments/institutions based on the described methods and outputs; no deployment case studies or empirical replication evidence reported in text provided.

medium positive Emerging Technologies Based on Large AI Models and the Desig... replicability and applicability of the framework for proactive policy support

A data-driven, foresight-based approach to policy design significantly enhances responsiveness, precision, and resource efficiency in science and technology governance.

Paper concludes this benefit based on its integrated framework, triangulation, Delphi/AHP validation and illustrative mapping; no quantified comparative metrics or experimental evaluation reported in text provided.

medium positive Emerging Technologies Based on Large AI Models and the Desig... effectiveness of data-driven, foresight-based policy design (responsiveness, pre...

These findings provide quantitative foundations for AI capability-threshold governance.

Synthesis/interpretation of model results and empirical validation described in the paper (recommendation/implication).

medium positive The enrichment paradox: critical capability thresholds and i... usefulness of model results for governance design

The paper introduces the Distributed Human Data Engine (DHDE), a socio-technical framework previously validated in biological crisis management, and adapts it for regional economic flow optimization.

Author statement describing the DHDE and asserting prior validation in biological crisis management; adaptation described in paper (methodological description).

medium positive Engineering Distributed Governance for Regional Prosperity: ... methodological/framework adaptation

The ACT represents the first open-source effort to consolidate data on Africa's evolving HPC landscape, aiming to encourage more transparency from local AI stakeholders and facilitate broader access for AI developers.

Authors' characterization of ACT as a novel, open-source consolidation; assertion based on literature/tools review performed by the authors and on the tool's stated goals.

medium positive Take the Train: Africa at the Crossroad of Modern AI transparency and access to HPC resources for AI developers

The results contribute to literature arguing that cloud-based GenAI is a source of enterprise value creation rather than merely an experimental technology.

Paper's stated addition to the existing literature based on the combined empirical and theoretical findings.

medium positive Measuring Business ROI of Generative AI Adoption on Azure Cl... enterprise value creation via GenAI

Orchestrated systems of smaller, domain-adapted models can mathematically outperform frontier generalist models in most institutional deployment environments.

Formal conditions and comparative analysis derived in the paper plus referenced/claimed empirical support across several domains (frontier lab dynamics, alignment evolution, sovereign AI pressures).

medium positive Punctuated Equilibria in Artificial Intelligence: The Instit... relative institutional performance (smaller domain models vs. frontier generalis...

Debiasing via metadata redaction and explicit instructions restores detection in all interactive cases and 94% of autonomous cases.

Intervention experiments in Study 2 where metadata redaction and explicit instructions were applied to interactive assistants (e.g., GitHub Copilot) and autonomous agents (e.g., Claude Code); reported full restoration for interactive and 94% for autonomous.

medium positive Measuring and Exploiting Confirmation Bias in LLM-Assisted S... restoration of vulnerability detection (post-intervention detection rate)

An increasing number of enterprises are using the label of artificial intelligence merely as a cosmetic embellishment in their annual reports (the phenomenon of 'AI washing' is spreading).

Framing/background claim in the paper's introduction/abstract; implied support from the semantic analysis of annual report texts across Chinese A-share firms over 2006–2024.

medium positive The Spillover Effects of Peer AI Rinsing on Corporate Green ... prevalence/trend of AI washing in annual reports

There are ethical imperatives of fairness and transparency in automated wealth management, and the paper proposes a roadmap toward sustainable and interpretable financial AI.

Normative analysis and proposed roadmap described in the paper; the excerpt does not provide operationalized fairness metrics, interpretability methods, or evaluation results.

medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... ethical compliance measures (fairness, transparency, interpretability) for autom...

In environments characterized by high-frequency data, non-linear dependencies, and stochastic market regimes, autonomous DRL agents can learn optimal sequential decision-making policies that offer a compelling alternative to static or rule-based allocation strategies.

Argument based on theoretical suitability of DRL for sequential decision problems and the paper's system-level investigation; excerpt does not report specific experimental datasets, sample sizes, benchmarks, or performance metrics.

medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... policy optimality / portfolio performance in complex market environments (implie...

The integration of Deep Reinforcement Learning (DRL) into portfolio management represents a significant evolution from classical Mean-Variance Optimization and modern econometric frameworks.

Conceptual comparison and synthesis presented in the paper; no empirical sample size or experimental results are provided in the excerpt to quantify the degree of improvement.

medium positive Deep Reinforcement Learning for Dynamic Portfolio Optimizati... methodological advancement in portfolio management (shift from static optimizati...

« Prev 1 2 3 … 107 108 109 … 137 138 Next »