Evidence (13661 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	740	192	95	871	1945
Governance & Regulation	796	388	185	119	1512
Organizational Efficiency	765	186	123	82	1166
Technology Adoption Rate	610	227	121	95	1061
Research Productivity	409	121	56	331	928
Output Quality	464	174	58	47	743
Decision Quality	318	173	75	42	615
Firm Productivity	432	55	88	20	601
AI Safety & Ethics	214	273	65	33	589
Market Structure	175	165	120	24	489
Task Allocation	206	64	70	31	376
Skill Acquisition	161	57	57	16	291
Innovation Output	201	27	41	18	288
Fiscal & Macroeconomic	130	69	43	26	275
Employment Level	104	50	105	13	274
Consumer Welfare	116	62	42	11	231
Firm Revenue	149	45	26	3	223
Inequality Measures	43	120	49	6	218
Task Completion Time	164	29	8	12	214
Worker Satisfaction	89	60	20	12	181
Error Rate	69	89	9	2	169
Regulatory Compliance	74	67	14	4	159
Training Effectiveness	91	19	13	19	144
Wages & Compensation	77	33	25	6	141
Team Performance	86	17	27	9	140
Automation Exposure	49	50	22	12	136
Developer Productivity	91	17	14	5	128
Job Displacement	12	80	19	1	112
Hiring & Recruitment	51	7	8	3	69
Creative Output	31	16	7	2	57
Skill Obsolescence	5	43	6	1	55
Social Protection	27	16	8	2	53
Labor Share of Income	17	17	17	—	51
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

SPEC achieves 89% overall accuracy, while appropriately deferring when evidence is insufficient.

Empirical evaluation of SPEC reported in paper: overall accuracy reported as 89% and behavior of proper deferral on insufficient-evidence cases.

high positive Learning When Not to Decide: A Framework for Overcoming Fact... overall accuracy and appropriate deferral on insufficient-evidence cases

We introduce SPEC (Structured Prompting for Evidence Checklists), a structured framework requiring explicit identification of missing information before any determination.

Methodological contribution described in paper: new prompting/framework (SPEC) that enforces explicit missing-information identification prior to decision.

high positive Learning When Not to Decide: A Framework for Overcoming Fact... framework implementation that forces evidence-checklist and missing-information ...

Through a collaboration with the Colorado Department of Labor and Employment, we secured access to official training materials and guidance to design a novel benchmark that systematically varies information completeness.

Methodological description in paper: collaboration with state agency and dataset/benchmark construction using official training materials and guidance.

high positive Learning When Not to Decide: A Framework for Overcoming Fact... creation of a benchmark varying information completeness

Long-term prospects of agentic AI include catalyzing accelerated innovation in physical design via autonomous algorithm discovery, continuous tool improvement, and closed-loop learning from large design corpora.

Forward-looking conclusion in the paper; framed as the authors' projection based on survey synthesis rather than as an empirically demonstrated outcome in the abstract.

high positive Invited: Agentic AI for Physical Design R&D: Status and Pros... autonomous algorithm discovery, continuous tool improvement, closed-loop learnin...

Interfaces between agentic systems and traditional EDA frameworks are a key area of focus and enable tighter integration of agent capabilities into existing design workflows.

Survey highlights interfaces between agents and EDA frameworks as a focus area; claim is descriptive of research direction rather than reporting empirical outcomes.

high positive Invited: Agentic AI for Physical Design R&D: Status and Pros... development and importance of interfaces between agents and EDA frameworks

Autonomous agents can explore heuristic spaces for placement, routing, and partitioning, enabling autonomous exploration of design heuristics.

Presented as an emphasized capability/area of research in the survey; the abstract asserts this possibility but does not report empirical benchmarks or sample sizes.

high positive Invited: Agentic AI for Physical Design R&D: Status and Pros... autonomous exploration of heuristic spaces (placement, routing, partitioning)

Tool-integrated agents can be used for algorithm evolution, debugging, and workflow automation in physical design R&D.

Paper emphasizes this as a primary area of application in the survey; rationale and examples are discussed but no quantitative trial sizes are given in the abstract.

high positive Invited: Agentic AI for Physical Design R&D: Status and Pros... use of agents for algorithm evolution, debugging, and workflow automation

Agentic AI systems can comprehend user specifications, modify code, run EDA tools, analyze results, perform multi-step reasoning, and iteratively refine design heuristics—unlike earlier ML uses that focused narrowly on prediction or optimization subroutines.

Descriptive claim in the paper contrasting agentic AI capabilities with earlier ML approaches; presented as an overview of functional capabilities rather than empirical measurement.

high positive Invited: Agentic AI for Physical Design R&D: Status and Pros... breadth of tasks agentic AI systems can perform (spec comprehension, code modifi...

Recent advances in large language models (LLMs) and tool-using autonomous agents present new opportunities for accelerating research and development in physical design.

Stated as a central thesis in the paper's abstract/survey; based on the authors' synthesis of recent advances and emerging applications (no empirical sample or quantified evaluation reported in the abstract).

high positive Invited: Agentic AI for Physical Design R&D: Status and Pros... acceleration of research and development in physical design

The framework is applied to Canada's 2025-2026 national AI Strategy consultation with n = 5,253 respondents across two independent policy topics.

Empirical application reported in the paper; dataset description gives sample size and two policy topics.

high positive Participatory provenance as representational auditing for AI... sample and context for empirical evaluation

This paper introduces 'participatory provenance': a measurement framework grounded in optimal transport theory, causal inference and semantic analysis that tracks how individual public submissions are transformed, filtered or lost through AI-mediated summarization.

Methodological contribution described in the paper (framework design combining optimal transport, causal inference, semantic analysis).

high positive Participatory provenance as representational auditing for AI... ability to track transformations/filtration/loss of individual submissions

Local governments should develop coordinated AI policy mixes, align differentiated policy pathways with regional conditions, and prioritize technology R&D support, talent cultivation and collaboration, and application demonstration and promotion to sustain long-term regional competitiveness.

Authors' policy recommendations derived from the fsQCA findings and interpretation of which conditions are recurrent/core across configurations.

high positive How Can Artificial Intelligence Policies Promote the Sustain... regional science and technology industrial competitiveness (policy recommendatio...

Technology R&D support, talent cultivation and collaboration, and application demonstration and promotion are the most recurrent core policy conditions across the identified configurations.

Frequency/core-condition analysis within the fsQCA configurations reported by the authors showing these three policy instruments repeatedly appear as core conditions.

high positive How Can Artificial Intelligence Policies Promote the Sustain... regional science and technology industrial competitiveness

The study identifies three driving pathways to sustained competitiveness: (supply and demand)-environmental resonance; demand-driven (supply-environmental) assurance; and supply–demand complementarity, which together cover five specific configurations.

Reported fsQCA solution paths (three aggregated driving pathways and five specific configurations) derived from the analysis of provincial AI policy instruments.

high positive How Can Artificial Intelligence Policies Promote the Sustain... regional science and technology industrial competitiveness

Sustained competitiveness is achieved through multiple equivalent configurations of policy instruments (i.e., policy instrument combinations rather than single instruments).

fsQCA results reported in the paper showing multiple configurations (solution paths) that are associated with high regional competitiveness.

high positive How Can Artificial Intelligence Policies Promote the Sustain... regional science and technology industrial competitiveness

AI systems currently provide more consistent fraud warnings than lay humans in an identical advisory role.

Aggregate comparison from the preregistered experiment showing humans had nonzero endorsement and higher suppression rates while all tested LLMs showed 0% endorsements and lower suppression under pressure (human n=1,201; AI conversations n=3,360).

high positive Large Language Models Outperform Humans in Fraud Detection a... consistency of fraud warnings between advisors (LLMs vs. lay humans)

Human advisors endorsed fraudulent investments at baseline rates of 13-14%.

Human benchmark of 1,201 participants run in the preregistered experiment; reported baseline endorsement rates for fraudulent scenarios.

high positive Large Language Models Outperform Humans in Fraud Detection a... baseline endorsement rate of fraudulent investments by human advisors

Motivated investor framing did not suppress AI fraud warnings; if anything, it marginally increased them.

Preregistered experiment across seven leading LLMs and twelve investment scenarios; 3,360 AI advisory conversations analyzed comparing motivated vs. baseline investor framings.

high positive Large Language Models Outperform Humans in Fraud Detection a... frequency of AI fraud warnings under motivated investor framing

Future research should prioritize hybrid human-AI decision frameworks, robust evaluation in diverse emerging market contexts, and development of regulatory technology solutions that balance innovation with systemic stability.

Recommendations and Conclusion section derived from identified gaps and themes in the scoping review.

high positive AI-Driven Financial Risk Management and Decision Intelligenc... recommended research and policy priorities

AI-driven approaches show substantial promise for enhancing financial risk management in emerging markets, particularly in credit scoring, fraud detection, and market forecasting.

Overall conclusion synthesizing reported improvements and application areas across the 64 studies; qualitative and quantitative findings summarized by authors.

high positive AI-Driven Financial Risk Management and Decision Intelligenc... effectiveness/promise of AI in specific financial risk domains (credit scoring, ...

Neural networks and ensemble methods demonstrate superior predictive accuracy compared to traditional methods.

Synthesis of comparative results across included studies indicating better predictive performance of neural networks and ensemble methods in market prediction, credit scoring, and related tasks.

high positive AI-Driven Financial Risk Management and Decision Intelligenc... predictive accuracy of neural networks and ensemble methods

Performance improvements (of AI methods) range from 15% to 35% over traditional methods.

Aggregate statement in Results summarizing reported performance improvements across reviewed studies (no single-trial RCT; based on comparative performance metrics reported by included studies).

high positive AI-Driven Financial Risk Management and Decision Intelligenc... predictive/performance improvement (accuracy/performance metrics) of AI methods ...

This work provides a replicable methodology for auditing institutional ML systems and highlights the importance of evaluating construct validity alongside statistical fairness.

Paper presents the ASP-HEI Cycle-informed replica-based audit method and argues for assessing construct validity in addition to statistical fairness metrics.

high positive Fairness Audits of Institutional Risk Models in Deployed ML ... availability/replicability of audit methodology and emphasis on construct validi...

We evaluate disparities by gender, age, and residency status across the full pipeline (training data, model predictions, and post-processing) using standard fairness metrics.

Paper reports conducting evaluation across the full ML pipeline using standard fairness metrics disaggregated by gender, age, and residency status.

high positive Fairness Audits of Institutional Risk Models in Deployed ML ... fairness metrics (disparities) across pipeline stages

We present a replica-based audit of a deployed Early Warning System (EWS), replicating its model using institutional training data and design specifications.

Statement in paper describing a replica-based audit using Centennial College's institutional training data and the system's design specifications; multi-year collaboration and prior ethnographic work informing approach.

high positive Fairness Audits of Institutional Risk Models in Deployed ML ... successful replication of the deployed EWS

Under these conditions (alignment of forces and AI-driven ideation cost reductions), PIM offers a framework for organising governed discovery in real time and provides the methodological foundation for later applied work.

The paper presents PIM as a proposed framework and positioning statement for future applied research and implementations (theoretical proposal; no applied trials reported).

high positive Probabilistic Innovation Methodology: A Scientific Methodolo... feasibility of using PIM to organise real-time governed discovery

Organised attacks on complex problems can generate an epistemic mode transition: a shift from predominantly Knightian uncertainty toward probabilistically characterisable innovation dynamics as relevant structures become more visible, decomposed, coordinated, and testable.

The paper states and formalises this methodological claim within PIM as a central proposition (theoretical argumentation; no empirical validation reported).

high positive Probabilistic Innovation Methodology: A Scientific Methodolo... degree of uncertainty characterization (Knightian vs probabilistic)

When problem-relevant causal, informational, and coordinative forces become sufficiently aligned, the epistemic character of search changes and open-ended uncertainty can be progressively transformed into structured probabilistic search.

The claim is presented as the central theoretical argument and formalised within the PIM conceptual framework (theoretical/model-based argumentation; no empirical sample).

high positive Probabilistic Innovation Methodology: A Scientific Methodolo... epistemic character of search (shift from Knightian uncertainty to probabilistic...

The same user study (n=32) reports improvements in subjective measures including fluency and user preference for RAPIDDS over non-adaptive systems.

User study (n=32) reporting subjective questionnaire/ratings (fluency, preference) comparing RAPIDDS vs non-adaptive baselines.

high positive Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teamin... subjective fluency and user preference

A user study (n=32) shows significant plan improvement compared to non-adaptive systems across objective metrics such as efficiency and proximity.

User study reported in paper with sample size n=32 comparing RAPIDDS to non-adaptive systems on objective metrics (efficiency, proximity); significance claimed.

high positive Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teamin... efficiency and proximity (objective plan metrics)

An ablation study in simulation and a physical robot scenario demonstrates the importance of dual (task + motion) adaptation.

Ablation experiments reported in paper (simulation and physical robot experiments comparing full RAPIDDS to ablated variants).

high positive Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teamin... plan performance when removing components (effect of dual adaptation)

RAPIDDS jointly adapts task schedules and steers diffusion models of robot motions to maximize efficiency and minimize proximity accounting for individualized models.

Algorithmic method described in paper combining schedule optimization with motion steering (method section).

high positive Multi-Cycle Spatio-Temporal Adaptation in Human-Robot Teamin... efficiency and proximity of joint plans

In the AI era, digital sovereignty is more plausibly pursued through institutionally governed interdependence than through technological autonomy.

Normative/conclusive argument presented by the paper (theoretical recommendation). This is an argumentative conclusion rather than an empirically demonstrated finding in the provided text.

high positive Digital Sovereignty in the Global Cognitive-Informational Or... preferred strategy for pursuing digital sovereignty (governed interdependence vs...

The sovereign SLM+RAG configuration is discussed as one possible operational pathway through which the Governance Membrane architecture may be instantiated in contexts where embedded-mode governance is feasible.

Specific implementation pathway proposed/discussed by the authors (design suggestion). No empirical testing or sample information provided in the supplied text.

high positive Digital Sovereignty in the Global Cognitive-Informational Or... feasibility and instantiation of an SLM+RAG sovereign configuration for embedded...

As a secondary, design-oriented contribution, the paper proposes the Governance Membrane as a reference architecture for operationalizing the Governed Interdependence paradigm, and introduces the Normative Compliance Model, the Infrastructure Status Index, and the Cognitive Dependence Index as complementary instruments for normative alignment and governance calibration.

Design-oriented conceptual proposal described in the paper (framework/instrument design). No empirical evaluation or sample details reported in the provided text.

high positive Digital Sovereignty in the Global Cognitive-Informational Or... existence of reference architecture and governance instruments for aligning and ...

The paper develops the Governed Interdependence paradigm, which reconceptualizes digital sovereignty as the institutional capacity to govern structured participation in globally distributed AI infrastructures rather than to achieve full technological autonomy.

Primary theoretical contribution described in the paper (conceptual/model development). This is a proposed framework introduced by the authors rather than an empirically validated result.

high positive Digital Sovereignty in the Global Cognitive-Informational Or... conceptualization of digital sovereignty and institutional governance capacity

The paper provides firm-level empirical evidence from an underexplored emerging market context (Nigerian listed firms) on the relationship between AI adoption in financial reporting and audit quality.

Study sample and context are Nigerian listed firms; empirical analyses (content analysis, archival audit data, SEM) reported in the paper.

high positive Artificial Intelligence Adoption in Financial Reporting and ... contextual evidence (country-level / sample scope) for AI adoption effects

The study operationalizes AI adoption using a disclosure-based AI adoption index, representing a methodological advancement for measuring firm-level AI adoption in financial reporting.

Content analysis of corporate annual reports used to construct a disclosure-based AI adoption index; index applied in SEM analysis.

high positive Artificial Intelligence Adoption in Financial Reporting and ... AI adoption (measurement / adoption index)

The positive relationship between AI adoption and audit quality is partially mediated by improvements in internal control quality.

SEM mediation analysis including internal control quality as a mediator; internal control quality measured through disclosure/content analysis and related archival indicators; audit quality captured via restatements and audit fees.

high positive Artificial Intelligence Adoption in Financial Reporting and ... audit quality (mediated by internal control quality)

The positive relationship between AI adoption and audit quality is partially mediated by improvements in reporting transparency.

SEM mediation analysis including a reporting transparency measure derived from content analysis of annual reports; archival audit data used for audit quality indicators.

high positive Artificial Intelligence Adoption in Financial Reporting and ... audit quality (mediated by reporting transparency)

AI adoption is positively associated with audit quality in Nigerian listed firms.

Mixed-method quantitative design combining content analysis of corporate annual reports (to construct a disclosure-based AI adoption index) and archival audit data; Structural Equation Modeling (SEM) used to test the direct relationship. Audit quality modeled as a latent construct reflected by financial restatements and audit fees.

high positive Artificial Intelligence Adoption in Financial Reporting and ... audit quality (latent construct reflected by financial restatements and audit fe...

Sustainable development outcomes in MENA economies are driven not only by technology adoption but by the interaction between digital infrastructure, AI, and institutional readiness.

Regression models including interaction terms between digital transformation, AI measures, and indicators of institutional readiness within the System GMM analysis.

high positive Digital Transformation, AI Efficiency, and Sustainable Devel... sustainable development

There is significant regional heterogeneity: Gulf Cooperation Council (GCC) countries exhibit stronger effects of digital transformation and AI on sustainable development than non-GCC MENA economies.

Subgroup/interaction analyses by region (GCC vs non-GCC) within the System GMM framework reported differential coefficients.

high positive Digital Transformation, AI Efficiency, and Sustainable Devel... sustainable development

Artificial intelligence (AI) has a positive but weaker impact on sustainable development relative to digital transformation, reflecting its complementary and maturity-dependent role within the digital ecosystem.

Same System GMM regressions on panel of MENA economies (2010–2023) that include measures of AI and digital transformation; reported positive but smaller coefficient for AI.

high positive Digital Transformation, AI Efficiency, and Sustainable Devel... sustainable development

Digital transformation is the primary driver of sustainable development in MENA economies, exerting a stronger and more consistent effect than AI.

Dynamic panel data analysis of MENA economies (2010–2023) using System GMM; reported comparative effect sizes of digital transformation vs. AI in regression results.

high positive Digital Transformation, AI Efficiency, and Sustainable Devel... sustainable development

In the ICT industry, Tobin's Q significantly increased following AI adoption (heterogeneous positive effect).

Subgroup/heterogeneity analysis within the main sample (KOSDAQ firms 2018–2025), estimating the post-adoption effect of AI on Tobin's Q in firms classified as ICT.

high positive The Dynamic Causal Effects of Corporate AI Adoption on Profi... Tobin's Q (market value) in ICT-industry firms

The authors propose corresponding analytical extensions to the framework to address the three structural breaks in agentic systems.

Paper presents proposed analytical extensions (methodological proposals) tied to each identified structural break.

high positive Governed Auditable Decisioning Under Uncertainty: Synthesis ... availability of proposed analytical extensions for governance framework

Cross-architecture comparison reveals a governance coverage gradient: deterministic rule engines achieve full DES-property fillability.

Analytic cross-architecture comparison reported in the paper (comparative analysis across four architectures); deterministic rule engines identified as achieving 'full' fillability of DES-properties.

high positive Governed Auditable Decisioning Under Uncertainty: Synthesis ... DES-property fillability (completeness of governance evidence coverage)

The paper synthesizes an operational governance evidence framework composed of: structural accountability collapse diagnostics, decision trace schemas, evidence sufficiency measurement, and label-free monitoring, integrated into a chain.

Methodological contribution: authors construct and present a synthesized framework from those four components (conceptual/analytical synthesis).

high positive Governed Auditable Decisioning Under Uncertainty: Synthesis ... presence and structure of an operational governance evidence framework

The Barcelona Declaration offers a promising forum for boundary governance.

Policy recommendation pointing to an existing initiative (Barcelona Declaration) as a suitable forum; stated without empirical evaluation in the excerpt.

high positive Market Dynamics, Governance and Open Research Metadata in th... suitability of the Barcelona Declaration as a forum for boundary governance

« Prev 1 2 3 … 134 135 136 … 273 274 Next »