Evidence (4793 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	402	112	67	480	1076
Governance & Regulation	402	192	122	62	790
Research Productivity	249	98	34	311	697
Organizational Efficiency	395	95	70	40	603
Technology Adoption Rate	321	126	73	39	564
Firm Productivity	306	39	70	12	432
Output Quality	256	66	25	28	375
AI Safety & Ethics	116	177	44	24	363
Market Structure	107	128	85	14	339
Decision Quality	177	76	38	20	315
Fiscal & Macroeconomic	89	58	33	22	209
Employment Level	77	34	80	9	202
Skill Acquisition	92	33	40	9	174
Innovation Output	120	12	23	12	168
Firm Revenue	98	34	22	—	154
Consumer Welfare	73	31	37	7	148
Task Allocation	84	16	33	7	140
Inequality Measures	25	77	32	5	139
Regulatory Compliance	54	63	13	3	133
Error Rate	44	51	6	—	101
Task Completion Time	88	5	4	3	100
Training Effectiveness	58	12	12	16	99
Worker Satisfaction	47	32	11	7	97
Wages & Compensation	53	15	20	5	93
Team Performance	47	12	15	7	82
Automation Exposure	24	22	9	6	62
Job Displacement	6	38	13	—	57
Hiring & Recruitment	41	4	6	3	54
Developer Productivity	34	4	3	1	42
Social Protection	22	10	6	2	40
Creative Output	16	7	5	1	29
Labor Share of Income	12	5	9	—	26
Skill Obsolescence	3	20	2	—	25
Worker Turnover	10	12	—	3	25

Productivity Remove filter

GUIDE consists of 67.5 hours of screen recordings from 120 novice user demonstrations with think-aloud narrations, across 10 software.

Paper's dataset description: dataset construction of screen recordings, number of demonstrations, duration, participant expertise (novice), and inclusion of think-aloud narrations across 10 software.

high positive GUIDE: A Benchmark for Understanding and Assisting Users in ... dataset size and composition (hours, number of demonstrations, software covered)

Automatic speech recognition (ASR) has shown increasing potential to assist in the transcription of endangered language data.

Background claim in the paper, referring to advances in ASR and prior work suggesting utility for endangered-language transcription; stated as motivation rather than a novel empirical finding in this paper.

high positive Automatic Speech Recognition for Documenting Endangered Lang... utility/potential of ASR for endangered-language transcription

We train an ASR model that achieves a character error rate as low as 15%.

Reported quantitative evaluation of the trained ASR model on the constructed Ikema dataset (character error rate = 15%). Exact evaluation protocol, test set size, and train/test split not provided in the abstract.

high positive Automatic Speech Recognition for Documenting Endangered Lang... character error rate

We construct a {\totaldatasethours}-hour speech corpus from field recordings.

Stated in paper as an outcome of the authors' data-collection and corpus-construction effort from field recordings; no numeric value resolved in the provided text (placeholder present).

high positive Automatic Speech Recognition for Documenting Endangered Lang... size of speech corpus (hours)

With calibrated oversight that aligns accountability to real-world risks, AI can secure the profession’s future.

Normative/prognostic claim in the Article (argument that appropriate governance will preserve or strengthen the legal profession).

high positive Rewired: Reconceptualizing Legal Services for the AI Age long-term resilience/stability of the legal profession

With calibrated oversight that aligns accountability to real-world risks, AI can improve service quality in legal services.

Normative/prognostic claim in the Article (argument that governance plus AI yields quality improvements). No empirical effect sizes reported in the excerpt.

high positive Rewired: Reconceptualizing Legal Services for the AI Age service quality of legal services

While the risks of AI are real, they must not eclipse the opportunity: with calibrated oversight that aligns accountability to real-world risks, AI can expand access to legal services.

Normative claim and projected benefit argued by the authors (theoretical/argumentative; no empirical evidence in excerpt).

high positive Rewired: Reconceptualizing Legal Services for the AI Age expansion of access to legal services

The framework provides a roadmap for coordinated response across educational institutions, government agencies, and industry to ensure workforce resilience and domestic leadership in the emerging agentic finance era.

Authors' proposed integrated roadmap (prescriptive recommendation; no empirical testing or outcome measurement reported in the provided text).

high positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... workforce resilience and domestic leadership in agentic finance

We develop a comprehensive government policy framework including: 1) Federal AI literacy mandates for post-secondary business education; 2) Department of Labor workforce retraining programs with income support for displaced financial professionals; 3) SEC and Treasury regulatory innovations creating market incentives for workforce development; 4) State-level workforce partnerships implementing regional transition support; and 5) Enhanced social safety nets for workers navigating career transitions during the estimated 5-15 year transformation period.

Author-presented policy framework and recommendations (policy design proposals and an asserted 5–15 year transformation timeframe; no empirical evaluation reported).

high positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... policy adoption and worker support measures during technological transition

We propose a multi-layered integration strategy for higher education encompassing: 1) Foundational AI literacy modules for all business students; 2) A specialized "Agentic Financial Planning" course with hands-on labs; 3) AI-augmented redesign of core courses (Investments, Portfolio Management, Ethics); 4) Interdisciplinary project-based learning with Computer Science; and 5) A governance and policy module addressing regulatory compliance (NIST AI RMF, SEC regulations).

Proposed curricular framework presented by the authors (recommendation/proposal, not empirically tested within the paper).

high positive STRENGTHENING FINANCIAL WORKFORCE COMPETITIVENESS: A CURRICU... student AI-related skills and preparedness for agentic finance roles

The ultimate competitive edge lies in an organization's ability to treat AI not as a standalone tool, but as a core component of sustainable, long-term corporate strategy.

Concluding normative claim in the paper; presented as an interpretation/synthesis rather than supported by cited empirical evidence in the abstract.

high positive The AI Advantage: Strategic Innovation and Global Expansion ... competitive advantage derived from integrating AI into corporate strategy

Successful global expansion is no longer predicated solely on physical presence but on the deployment of scalable, localized AI models that navigate diverse regulatory, linguistic, and cultural landscapes.

Argumentative claim in the paper describing a strategic determinant for global expansion; no empirical sample or quantified outcomes presented in the abstract.

high positive The AI Advantage: Strategic Innovation and Global Expansion ... drivers of successful global expansion (physical presence vs. localized AI deplo...

AI hyper-personalizes customer engagement.

Declarative claim in the paper about AI's effect on customer engagement personalization; no experimental or observational data reported in the abstract.

high positive The AI Advantage: Strategic Innovation and Global Expansion ... degree of personalization in customer engagement

AI acts as an internal engine for operational agility by compressing R&D cycles.

Claim made in the paper asserting R&D cycle compression due to AI; no empirical data, sample size or quantitative measures provided in the abstract.

high positive The AI Advantage: Strategic Innovation and Global Expansion ... length/duration of R&D cycles (time-to-iteration)

The strategic focus has transitioned from mere process automation to autonomous orchestration, where multi-agent systems independently manage complex, cross-border operations and real-time decision-making.

Analytic statement from the paper describing an observed/argued shift in strategic focus; no empirical methodology or sample reported.

high positive The AI Advantage: Strategic Innovation and Global Expansion ... shift in strategic focus from automation to autonomous orchestration via multi-a...

Organizations leverage agentic workflows and domain-specific intelligence to catalyse strategic innovation and facilitate global expansion in the digital era.

Conceptual claim in the paper describing how organizations use specific AI capabilities; no empirical design or sample described in the abstract.

high positive The AI Advantage: Strategic Innovation and Global Expansion ... use of agentic workflows and domain-specific models to drive innovation and glob...

The rapid evolution of Artificial Intelligence (AI) has shifted from a disruptive trend to the fundamental operating layer of the modern enterprise.

Statement/assertion in the paper (conceptual/positioning claim); no empirical method, sample size, or statistical analysis reported in the abstract.

high positive The AI Advantage: Strategic Innovation and Global Expansion ... role of AI in enterprise operations (from peripheral/disruptive to core/operatin...

A Metacognitive Co-Regulation Agent (in CRDAL) assists the Design Agent in metacognition to mitigate design fixation, thereby improving system performance for engineering design tasks.

Mechanistic claim supported by the paper's experimental results on the battery pack design problem showing CRDAL outperforming SRL and RWL; detailed measures of fixation reduction not provided in the excerpt.

high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... reduction in design fixation / improvement in performance due to co-regulation

The CRDAL system navigated through the latent design space more effectively than both SRL and RWL.

Empirical analysis on the battery pack design task comparing latent-space trajectories/exploration between CRDAL, SRL, and RWL; details on how 'more effectively' was quantified and sample size are not provided in the excerpt.

high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... quality/coverage of exploration in latent design space

The CRDAL system achieves better design performance without significantly increasing the computational cost compared to SRL and RWL.

Empirical claim based on experiments on the battery pack design problem comparing computational cost across CRDAL, SRL, and RWL; exact computational metrics and sample size not provided in the excerpt.

high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... computational cost (efficiency/resource usage) of design-generation process

In the battery pack design problem examined here, the CRDAL system generates designs with better performance compared to a plain Ralph Wiggum Loop (RWL) and the metacognitively self-assessing Self-Regulation Loop (SRL).

Empirical comparison on a battery pack design task between CRDAL, SRL, and RWL reported in the paper; exact number of test instances or runs not stated in the excerpt.

high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... design performance (battery pack designs)

We propose a novel Co-Regulation Design Agentic Loop (CRDAL), in which a Metacognitive Co-Regulation Agent assists the Design Agent in metacognition to mitigate design fixation.

Methodological contribution presented in the paper (proposed system architecture). No empirical sample size reported for the proposal itself.

high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... proposed agent architecture (Co-Regulation Design Agentic Loop)

We propose a novel Self-Regulation Loop (SRL), in which the Design Agent self-regulates and explicitly monitors its own metacognition.

Methodological contribution presented in the paper (proposed system architecture). No empirical sample size reported for the proposal itself.

high positive Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regul... proposed agent architecture (Self-Regulation Loop)

Policy efficacy varies significantly across corporate profiles, with the strongest effects observed in non-state-owned enterprises, high-tech firms, and firms located in eastern regions.

Heterogeneity analyses reported in the study (subgroup analysis by ownership, technology intensity, and geographic region).

high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... heterogeneous policy impact on corporate NQPF across firm subgroups

The estimated positive effect of the pilot zones on corporate NQPF is robust across a comprehensive battery of robustness and endogeneity tests.

Paper reports multiple robustness and endogeneity checks (details not provided in abstract) that reportedly do not overturn main findings.

high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... robustness of estimated policy effect on NQPF

Mechanism analysis identifies three systemic transmission pathways for the policy: optimizing factor allocation, deepening digital technology empowerment, and promoting green innovation and sustainability.

Mechanism analysis reported in the study (methods not detailed in abstract) attributing the policy effect to three pathways.

high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... mechanistic channels: factor allocation, digital technology empowerment, green i...

The pilot zones create an optimized 'digital environment' that underlies the positive impact on corporate NQPF.

Empirical analysis in the paper attributes improved corporate NQPF to an optimized digital environment created by the policy intervention; mechanism analysis referenced.

high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... presence/quality of digital environment / organizational digital infrastructure

The DML approach flexibly controls for high-dimensional confounding variables and functional form misspecification, enabling highly rigorous causal inference compared with traditional linear models.

Methodological claim based on use of Double Machine Learning in the study (described as addressing high-dimensional confounders and misspecification).

high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... quality of causal inference / methodological rigor

Establishment of China’s National Digital Economy Innovation and Development Pilot Zones significantly enhances corporate New Quality Productive Forces (NQPF).

Quasi-natural experiment using Double Machine Learning (DML) framework applied to A-share listed companies over 2015–2023; empirical results reported as statistically significant.

high positive The Impact of Digital Economy Pilot Zones on Corporate New Q... corporate New Quality Productive Forces (NQPF)

AlphaFold represents an 'oracle' breakthrough in AI for scientific discovery.

Cited as an example of an algorithmic breakthrough that changed a specific scientific subtask (protein structure prediction). The paper frames AlphaFold as a milestone in the history reviewed; no new experimental data presented.

high positive A Brief History of AI for Scientific Discovery: Open Researc... impact of AlphaFold on a scientific subtask (protein structure prediction)

Phase Three employs AI for comprehensive sensitivity analysis while humans provide strategic interpretation.

Descriptive claim about the third phase of the framework and its use in the paper's applied test; presented as the intended role split between AI (computational sensitivity tasks) and humans (interpretation).

high positive AI-Augmented Real Estate Underwriting: A Practical Framework... task_allocation

Phase One leverages AI for rapid market research aggregation and preliminary pro forma generation.

Descriptive claim about the first phase of the proposed three-phase framework as presented in the paper; conceptual rather than a separate empirical finding.

high positive AI-Augmented Real Estate Underwriting: A Practical Framework... task_allocation

The framework achieved seventy-one to ninety percent time reduction while maintaining analytical quality comparable to traditional methods.

Empirical result reported from the controlled ChatGPT-4 test on the single 150-unit scenario comparing time to complete underwriting tasks versus traditional methods.

high positive AI-Augmented Real Estate Underwriting: A Practical Framework... task_completion_time

This research develops and empirically validates a three-phase framework for AI-augmented multifamily underwriting through controlled testing with ChatGPT-4 using a standardized 150-unit development scenario in Seattle's Greenwood neighborhood.

Controlled testing described in paper: use of ChatGPT-4 on a single standardized 150-unit development scenario in Seattle Greenwood to evaluate the proposed three-phase framework.

high positive AI-Augmented Real Estate Underwriting: A Practical Framework... task_completion_time

Generative artificial intelligence demonstrates significant promise for efficiency gains across financial services.

Introductory assertion in paper; general statement about the potential of generative AI, not directly derived from the paper's controlled test.

high positive AI-Augmented Real Estate Underwriting: A Practical Framework... organizational_efficiency

Empirical findings demonstrate that digitalization significantly boosts efficiency and competitiveness of industrial production.

Correlation and regression analyses reported in the study linking digitalization measures to indicators of efficiency and competitiveness across levels of analysis.

high positive Digitalization and labor costs: efficiency of industrial ent... production efficiency and competitiveness

Digital technologies (automation, IIoT, ERP systems, AI applications) reduce nonproductive costs, increase per-worker output, and improve the cost-efficiency of production in Kazakhstani enterprises.

Case studies and real examples from named enterprises (Asia Auto, Karaganda Foundry and Engineering Plant, Eurasian Resources Group) presented in the article.

high positive Digitalization and labor costs: efficiency of industrial ent... per-worker output (and labor costs per unit of production / nonproductive costs)

The number of employees and working time have a positive but limited effect on labor productivity.

Results from the study's correlation and regression analysis comparing labor input measures (employee count and working time) with productivity outcomes.

high positive Digitalization and labor costs: efficiency of industrial ent... labor productivity

Digitalization is the key driver of labor productivity growth in Kazakhstan.

Empirical correlation and regression analysis reported in the study across enterprise, industry, and national economy levels.

high positive Digitalization and labor costs: efficiency of industrial ent... labor productivity

Opportunities arising from cyborg workflows include hyper-personalized narratives, democratized production, and ethical augmentation of underrepresented voices.

Forward-looking/interpretive claim in the paper describing potential benefits and opportunities; conceptual rather than empirically demonstrated in the excerpt.

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... personalization, access to production, representation

Scalability is addressed via edge computing to support cyborg workflows.

Design/architectural claim in the paper mentioning edge computing as a scalability mechanism; no deployment-scale measurements reported in the excerpt.

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... scalability/adoption feasibility

The proposed workflows include robust bias mitigation strategies.

Paper asserts bias mitigation approaches are included and demonstrated in case studies; no quantitative fairness metrics or evaluation details provided in the excerpt.

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... bias reduction / fairness

Cyborg workflows produce enhanced creative output via iterative human–AI refinement.

Qualitative claim supported by case studies and examples presented in the paper (no quantitative creativity metrics or sample sizes reported in the excerpt).

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... creative output

Empirical evaluations validate 25-60% improvements in key metrics.

Paper states empirical evaluation results with a 25–60% improvement range; specific metrics, methods, and sample sizes are not provided in the excerpt.

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... key metrics (unspecified)

Case studies in content generation, news curation, and immersive production demonstrate efficiency gains of up to 3x in throughput.

Reported results from unspecified case studies described in the paper; numeric claim provided but case study sample sizes and methodological details are not reported in the excerpt.

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... throughput

The paper proposes a comprehensive framework encompassing modular architectures, hybrid protocols, and real-time collaboration interfaces informed by cognitive science, AI engineering, and media studies.

Architectural and methodological proposal described in the paper (the claim is descriptive of the proposed system; no quantitative evaluation of the framework components provided).

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... framework components (architecture, protocols, interfaces)

Cyborg workflows fuse human judgment with agentic AI autonomous systems capable of goal-directed planning and execution.

Conceptual description and framework proposed in the paper (no empirical sample or trial details reported).

high positive Cyborg Workflows Merging Human Judgment and Agentic AI for D... human-AI task coordination

RL-based AVs improve average fuel efficiency by about 1.86% at lower speeds (below 50 km/h) compared to the IDM.

Macroscopic-level fuel efficiency comparison between RL-based AV model and IDM in simulation, stratified by speed (<50 km/h). Number of simulation runs not stated.

high positive Macroscopic Characteristics of Mixed Traffic Flow with Deep ... average fuel efficiency at speeds < 50 km/h

RL-based AVs improve average fuel efficiency by about 28.98% at higher speeds (above 50 km/h) compared to the IDM.

Macroscopic-level fuel efficiency comparison between RL-based AV model and IDM in simulation, stratified by speed (>50 km/h). Number of simulation runs not stated.

high positive Macroscopic Characteristics of Mixed Traffic Flow with Deep ... average fuel efficiency at speeds > 50 km/h

Transitioning from fully human-driven to fully RL-controlled traffic can increase road capacity by approximately 7.52%.

Macroscopic simulation experiments producing Fundamental Diagrams comparing fully human-driven traffic to fully RL-controlled traffic. Exact number of simulation scenarios or replicates not provided in the claim text.

high positive Macroscopic Characteristics of Mixed Traffic Flow with Deep ... road capacity

« Prev 1 2 3 … 27 28 29 … 95 96 Next »