Evidence (2954 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	369	105	58	432	972
Governance & Regulation	365	171	113	54	713
Research Productivity	229	95	33	294	655
Organizational Efficiency	354	82	58	34	531
Technology Adoption Rate	277	115	63	27	486
Firm Productivity	273	33	68	10	389
AI Safety & Ethics	112	177	43	24	358
Output Quality	228	61	23	25	337
Market Structure	105	118	81	14	323
Decision Quality	154	68	33	17	275
Employment Level	68	32	74	8	184
Fiscal & Macroeconomic	74	52	32	21	183
Skill Acquisition	85	31	38	9	163
Firm Revenue	96	30	22	—	148
Innovation Output	100	11	20	11	143
Consumer Welfare	66	29	35	7	137
Regulatory Compliance	51	61	13	3	128
Inequality Measures	24	66	31	4	125
Task Allocation	64	6	28	6	104
Error Rate	42	47	6	—	95
Training Effectiveness	55	12	10	16	93
Worker Satisfaction	42	32	11	6	91
Task Completion Time	71	5	3	1	80
Wages & Compensation	38	13	19	4	74
Team Performance	41	8	15	7	72
Hiring & Recruitment	39	4	6	3	52
Automation Exposure	17	15	9	5	46
Job Displacement	5	28	12	—	45
Social Protection	18	8	6	1	33
Developer Productivity	25	1	2	1	29
Worker Turnover	10	12	—	3	25
Creative Output	15	5	3	1	24
Skill Obsolescence	3	18	2	—	23
Labor Share of Income	7	4	9	—	20

Human Ai Collab Remove filter

Better contestability may reduce litigation and regulatory frictions if decisions are transparently defensible.

Speculative legal-economic claim; no case studies or empirical legal analysis provided.

low positive Argumentative Human-AI Decision-Making: Toward AI Agents Tha... frequency/cost of litigation and regulatory disputes post-adoption of contestabl...

New service layers may emerge (argumentation-as-a-service, audit firms, explanation certification, human-in-the-loop orchestration platforms).

Speculative market/industry evolution claim based on analogous tech-service cretions; no empirical evidence.

low positive Argumentative Human-AI Decision-Making: Toward AI Agents Tha... emergence and market size of new service verticals around argumentative AI

New metrics are needed to value resilience (robustness to out-of-distribution events, graceful degradation) in procurement and contracting; performance-based contracts and regulated minimums for oversight mode selection can help align incentives.

Prescriptive recommendation based on gaps identified in procurement and contracting practice; conceptual proposal without empirical testing.

low positive Resilience Meets Autonomy: Governing Embodied AI in Critical... existence and use of resilience metrics in procurement/contracts and resulting a...

Demand will grow for tools and services that enable oversight (auditability, explainability, safe fallbacks), creating markets for verification, certification, safety middleware, and human-in-the-loop platforms.

Market-structure and demand-side reasoning based on the proposed governance needs; forecast-style projection without empirical market-data analysis.

low positive Resilience Meets Autonomy: Governing Embodied AI in Critical... market growth for oversight-enabling products and services (demand, number of ve...

Allocation decisions should be explicit, auditable, and adaptive — with provisions for overriding, fallbacks, and graceful degradation during unanticipated conditions.

Normative recommendation based on safety and accountability principles combined with crisis-management practices; argued via conceptual analysis and illustrative design features.

low positive Resilience Meets Autonomy: Governing Embodied AI in Critical... auditability, adaptability, and existence of override/fallback mechanisms in dep...

Collaborative VR features can change team workflows (remote, synchronous inspection sessions), potentially lowering coordination costs across geographically distributed teams.

Paper lists collaborative multi-user sessions as a planned capability and posits organizational effects; no user studies or measurements of coordination cost savings presented.

low positive iDaVIE v1.0: A virtual reality tool for interactive analysis... coordination costs / team workflow efficiency in distributed teams

Public funding for shared VR-capable data-exploration infrastructure could yield high leverage by improving returns on large observational investments.

Policy recommendation deriving from the platform and ROI arguments in the paper; no cost-benefit analysis or quantified ROI provided.

low positive iDaVIE v1.0: A virtual reality tool for interactive analysis... policy leverage (ROI) from funding shared VR infrastructure

Using iDaVIE increases the usable fraction of large observational datasets by improving QC and annotation throughput, thereby raising returns to telescope investments and downstream AI efforts.

This is an inferred implication in the paper (returns-to-scale/platform effects) based on improved QC/annotation throughput; no empirical measurement of usable-fraction increases provided.

low positive iDaVIE v1.0: A virtual reality tool for interactive analysis... usable fraction of observational datasets and downstream value for AI/modeling

Higher-quality labels produced via immersive inspection can reduce label noise and lower required training-data sizes for a target ML performance level.

Paper presents this as an implication/expected outcome based on improved annotation quality from immersive inspection; no empirical ML training experiments or quantitative reductions reported.

low positive iDaVIE v1.0: A virtual reality tool for interactive analysis... label noise level and required training-data size for target model performance

iDaVIE demonstrably reduces cognitive load for multidimensional-data tasks compared with 2D-slice inspection.

Paper asserts reduced cognitive load and faster, more intuitive exploration as an aim and reported outcome; no formal user-study metrics, sample size, or statistical analysis provided.

low positive iDaVIE v1.0: A virtual reality tool for interactive analysis... cognitive load (mental effort) for multidimensional-data inspection

The inverse-specification reward offers a domain-agnostic, holistic metric for fidelity to user intent and is recommended for measurement of model value/service quality.

Method introduces inverse-specification reward and asserts domain-agnostic applicability; recommendation based on its conceptual ability to recover briefs as fidelity measure (not necessarily validated across many domains).

low positive Learning to Present: Inverse Specification Rewards for Agent... Utility of inverse-specification recovery accuracy as a fidelity metric (concept...

High-quality automated slide generation has potential to reduce time spent on business presentation creation and produce productivity gains with partial substitution of routine creative/knowledge-worker tasks.

Empirical demonstration of near-SOTA automated slide generation capability on 48 briefs; domain-level economic implication extrapolated from performance improvements.

low positive Learning to Present: Inverse Specification Rewards for Agent... Potential time savings/productivity gains (not directly measured in the study)

Economic agents and risk models that integrate LLM outputs should weight inferences more heavily in structured domains (capacity estimates, trade flows, sanctions impact) and downweight or cross-validate politically ambiguous predictions.

Implication drawn from domain heterogeneity in model performance observed in the study (better structured-domain performance, weaker political forecasting).

low positive When AI Navigates the Fog of War recommended weighting/usage strategy for LLM-derived inputs in economic risk mod...

Investment in data quality and feature engineering yields tangible predictive gains for workforce performance models.

Paper emphasizes use of engineered features capturing engagement dynamics and learning trends and reports better model performance relative to baseline; however, no isolated ablation study quantifying the sole contribution of data-quality investments is reported in the summary.

low positive Adoption of AI-Based HR Analytics and Its Impact on Firm Pro... Predictive performance gains attributable to data quality/feature engineering (i...

Tools that improve detection or quantification may reduce downstream costs from missed diagnoses or unnecessary follow-ups, improving cost-effectiveness in some scenarios.

Economic modeling and limited observational analyses that extrapolate diagnostic improvements to downstream resource use; direct empirical cost-effectiveness studies are scarce.

low positive Human-AI interaction and collaboration in radiology: from co... downstream healthcare utilization (additional tests, treatments), cost per diagn...

The metacognitive reliability metric can reduce adoption risk for purchasers by providing transparent error-risk assessments and enabling performance-based autonomy thresholds.

Conceptual claim supported by the existence of an empirical confidence metric from the recursive meta-model and discussion of procurement/decision-making implications; not empirically tested with purchasers or procurement outcomes.

low positive Human Autonomy Teaming and AI Metacognition in Maritime Thre... adoption risk (qualitative or procurement decision proxies)

HACL/CS supports human trust and situational awareness.

Human factors measured with trust and situational awareness questionnaires in the simulation; summary reports supportive effects on trust and situational awareness but lacks sample-size/statistical detail.

low positive Human Autonomy Teaming and AI Metacognition in Maritime Thre... self-reported trust and situational awareness scores

Intelligent turn-level assignment can reduce costly human attention to only high-value moments, improving overall system productivity.

Conceptual implication from the assignment-layer design and empirical trade-offs reported; presented as an advantage in the paper rather than a directly measured economic productivity study.

low positive Hierarchical Reinforcement Learning Based Human-AI Online Di... distribution of human attention / system productivity (conceptual, not directly ...

HADT demonstrates a concrete way to substitute expensive human diagnostic labor with AI assistance while preserving high accuracy, implying reductions in marginal cost per consultation.

Inference drawn in the paper's implications section based on reported reductions in required human effort and maintained diagnostic accuracy (economic claim extrapolating from experimental results; not directly measured as cost in experiments).

low positive Hierarchical Reinforcement Learning Based Human-AI Online Di... implied marginal cost per consultation (not directly measured)

Organizational norms and UX influence adoption rates and diffusion of AI: social calibration processes at the team level matter for adoption beyond individual cost–benefit calculations.

Reported by interviewees (N=40) as factors shaping whether and how teams incorporated AI into routines; integrated into theoretical implications for diffusion modeling.

low positive AI in project teams: how trust calibration reconfigures team... AI adoption/diffusion rates at team/organization level

Well-calibrated trust tends to encourage AI being used as a complement to human labor (augmentation), increasing effective productivity; miscalibration (over- or under-trust) can lead to productivity losses.

Inferential claim drawn from interviewees' accounts of when teams appropriately relied on AI (augmentation) versus when inappropriate reliance or avoidance occurred; supported by thematic interpretation rather than quantitative measurement.

low positive AI in project teams: how trust calibration reconfigures team... productive use of AI (complementarity vs substitution) and effective productivit...

Policymakers should support standards for auditability, human‑in‑the‑loop thresholds and training subsidies to reduce coordination failures and make the social benefits of AI adoption more widely shared.

Normative policy recommendation derived from the paper’s analysis of risks, governance needs and distributional concerns; not empirically validated within the paper.

low positive Symbiarchic leadership: leading integrated human and AI cybe... adoption of standards; breadth of social benefits; coordination failure reductio...

Organisations will invest more in training for AI‑related sensemaking, trust calibration and governance competencies; returns to such training should be evaluated relative to investments in model quality.

Prescriptive inference from the framework and human‑capital theory; supported by referenced literature but not empirically tested in this paper.

low positive Symbiarchic leadership: leading integrated human and AI cybe... training investment levels; returns on training; comparative returns vs model in...

Explicit comparative‑advantage allocation will shift the composition of tasks across humans and AI, altering demand for routine versus non‑routine skills and potentially increasing demand for high‑level judgement, oversight and sensemaking skills.

Projected labour‑market implication based on theoretical reasoning and prior literature on task‑based skill demand; not empirically estimated in the paper.

low positive Symbiarchic leadership: leading integrated human and AI cybe... task composition; demand for routine vs non‑routine skills; demand for oversight...

Operationalising the four symbiarchic practices through updated HR systems lets firms capture AI‑enabled productivity gains without eroding trust, ethics or employee well‑being.

Normative claim based on theoretical synthesis and managerial prescription; no empirical testing or field data presented in the paper.

low positive Symbiarchic leadership: leading integrated human and AI cybe... AI‑enabled productivity gains; employee trust; ethical outcomes; employee well‑b...

AI tools complement sensory expertise and design thinking, shifting skill demand toward interdisciplinary competencies (e.g., computational rheology, psychophysics, cultural analytics).

Reasoned inference from technology literature and skill-complementarity theory; literature synthesis but no labor-market empirical analysis provided.

low positive At the table with Wittgenstein: How language shapes taste an... demand for interdisciplinary skills in food R&D and complementarity between AI t...

The research establishes the theory of performance management by developing operational measurement solutions for companies going through workplace redesign due to AI.

Authors claim theoretical contribution and provision of operational measurement solutions based on the proposed three-dimensional model and the empirical patterns observed in the 2022–2024 LinkedIn and Indeed datasets; no external validation or implementation evidence reported in the summary.

low positive Reconstruction of knowledge worker performance evaluation sy... operational performance-measurement solutions and theoretical framing for perfor...

By integrating psychological trust factors with cognitive capability optimisation, this model offers actionable insights for knowledge management practitioners implementing AI‑augmented decision systems while advancing theoretical understanding of human–AI collaboration effectiveness.

Integrative theoretical claim based on combining constructs from psychological trust research and cognitive/capability literature via systematic synthesis; no empirical evaluation reported in the abstract.

low positive Optimising Human– AI Decision Performance: A Trust and Cap... actionability for practitioners / advancement of theoretical understanding / ove...

The framework provides practical guidance for executives designing human–AI teams, developing trust calibration training, and establishing performance metrics.

Prescriptive recommendations derived from the proposed model and literature synthesis; the abstract does not report empirical testing of the recommended interventions or their effects.

low positive Optimising Human– AI Decision Performance: A Trust and Cap... practical outcomes (team design quality, training effectiveness, performance mea...

The practical value of the study lies in outlining an analytical framework that can support the design of adaptive workforce strategies, reduce vulnerability to technological disruption, and strengthen the capacity of economies to respond to ongoing digital change.

Claim about the paper's contribution based on the produced analytical framework; the paper presents the framework but does not report empirical validation or outcome measures from real-world implementations.

low positive EDUCATIONAL AND PROFESSIONAL STRATEGIES FOR PREPARING HUMAN ... utility of analytical framework for adaptive workforce strategy design, vulnerab...

Integration of data-driven and AI-supported training tools is a critical component for effective reskilling and upskilling.

Argument based on theoretical analysis and review of practices; the paper recommends integration but does not present empirical performance metrics or randomized evaluations of such tools.

low positive EDUCATIONAL AND PROFESSIONAL STRATEGIES FOR PREPARING HUMAN ... effectiveness of training/reskilling when using data-driven and AI-supported too...

Evidence-based interventions—communication strategies, workload design, capability development, and sustainable human-AI collaboration models—can enhance rather than deplete human cognitive resources.

Paper claims these interventions are identified through synthesis of research; the excerpt does not present direct trial results or quantified effectiveness for these interventions.

low positive When AI Assistance Becomes Cognitive Overload: Understanding... human cognitive resource outcomes (reduced fatigue, improved sustained attention...

Cultural, structural, and decision-making elements co-evolve through recursive feedback loops in human–AI collaboration, advancing process-theoretical understandings of such collaboration.

Analytic interpretation of interview data indicating recursive feedback between cultural norms, structures, and decision routines in AI-integrated startups; presented as an advance to process theory (qualitative evidence; no quantitative test reported).

low positive Hybrid decision architectures: exploring how facilitated AI ... co-evolution dynamics of cultural, structural, and decision-making elements in o...

The study introduces 'hybrid decision architectures' as a dual-level construct that explains how AI triggers systematic organizational change in startups.

Conceptual/theoretical contribution based on synthesis of qualitative interview findings and process-theoretical reasoning (theoretical claim supported by interview data; empirical generalizability not established in excerpt).

low positive Hybrid decision architectures: exploring how facilitated AI ... explanatory power of the 'hybrid decision architectures' construct for organizat...

The future of success will not depend on outpacing machines but on cultivating distinctly human capacities: empathy, discernment, imagination and moral reasoning.

Central argumentative claim of the conceptual essay, derived from cross-disciplinary theory (leadership, emotional intelligence, ethics); no empirical validation or sample provided.

low positive Deconstructing success: why being human still matters future success (as determined by cultivation of specific human capacities)

Productivity-based definitions of success should be dismantled and reconstructed into a framework centered on adaptability and purpose.

Prescriptive recommendation based on synthesis of leadership theory, emotional intelligence research and AI ethics; presented as theoretical proposal rather than empirically tested intervention.

low positive Deconstructing success: why being human still matters formulation of success frameworks emphasizing adaptability and purpose (conceptu...

Digitalization strengthens data security and enhances stakeholder trust in audits.

Findings reported from literature synthesis and empirical analysis in the study; specific security measures, metrics, and sample sizes are not reported in the abstract.

low positive Audit 5.0 and the Digital Transformation of Auditing: The Ro... data security posture and stakeholder trust levels (perceived or measured trust ...

AI would have operated as a cognitive and organizational stabilizer in past industrial contexts, reducing inefficiencies and reinforcing the firm's capacity to adapt, coordinate, and perform.

Interpretation of overall simulation results showing reductions in inefficiencies and improvements across multiple performance measures in the counterfactual AI-HRM scenarios.

low positive Artificial Intelligence and Human Resource Management: A Cou... inefficiency measures; adaptability; coordination; overall firm performance

AI could optimize coordination between human and technological resources, improving operational coordination.

Model includes workforce allocation and coordination-related variables and uses regression-based simulations to project coordination improvements under AI-driven HR processes.

low positive Artificial Intelligence and Human Resource Management: A Cou... coordination metrics between human and technological resources; operational coor...

AI could reduce information asymmetries in performance evaluation.

The paper posits mechanisms and encodes performance-evaluation indicators in the counterfactual model; simulations indicate reduced evaluation-related asymmetries under AI-HRM. (Evidence is model-based; direct empirical measurement of information asymmetry reduction not detailed.)

low positive Artificial Intelligence and Human Resource Management: A Cou... information asymmetry in performance evaluation (evaluation bias/accuracy)

AI could enhance precision in staffing decisions and improve skill–task matching.

Model specification includes staffing and workforce-allocation variables; simulations portray improved staffing precision and skill–task alignment when HR processes are AI-supported. (This is primarily inferred from modeled mechanisms rather than direct experimental manipulation.)

low positive Artificial Intelligence and Human Resource Management: A Cou... staffing precision; quality of skill–task matching

Policy implications emphasize the importance of well-being-centered education, workforce development, and sustainable growth strategies aligned with the Sustainable Development Goals.

Authors recommend these policy directions based on the study's findings linking emotional/psychological factors to productivity and resilience. This is a prescriptive implication rather than an empirical finding; the excerpt does not provide policy evaluation data.

low positive Emotional Intelligence as Human Capital: A Behavioral Econom... policy recommendations (education and workforce development aligned with SDGs)

The helicoid regime is tractable: identifying it, naming it, and understanding its boundary conditions are necessary first steps toward LLMs that remain trustworthy partners in hardest, highest-stakes decisions.

Authors' prescriptive/conceptual claim based on the study's findings and proposed hypotheses; not an empirical result but a recommendation.

low positive AI Knows What's Wrong But Cannot Fix It: Helicoid Dynamics i... not an empirical outcome—this is a proposed strategy/roadmap (qualitative assess...

The study contributes to research emphasizing the importance of prompt design in AI governance, multi-agent coordination, and autonomous system reliability.

Stated contribution based on the experimental results and discussion sections; framed as adding to existing literature rather than a discrete empirical finding. (Contribution scope and bibliometric support not provided in the excerpt.)

low positive Prompt Engineering for Autonomous AI Agents: Enhancing Decis... perceived importance of prompt design in AI governance, multi-agent coordination...

Prompt engineering is not a peripheral technique but a foundational mechanism for optimizing autonomous AI functionality.

Interpretive claim grounded in the study's cumulative experimental findings and discussion; presented as a conceptual conclusion rather than a single measured outcome. (No direct experimental metric labeled 'foundationalness' reported.)

low positive Prompt Engineering for Autonomous AI Agents: Enhancing Decis... conceptual/operational importance of prompt engineering for autonomous AI functi...

The paper contains sufficient detail (representative prompts, verification methodology, complete results) that a coding agent could reproduce the translations directly from the manuscript.

Authors assert inclusion of representative prompts, verification methodology, and comprehensive results in the manuscript to enable direct reproduction by a coding agent.

low positive Automatic Generation of High-Performance RL Environments reproducibility by an automated coding agent (qualitative claim about sufficienc...

TCGJax was synthesized from a private reference absent from public repositories, serving as a contamination control for agent pretraining data concerns.

Statement in the paper that TCGJax was derived from a private, non-public reference (i.e., not in public repos), intended to ensure the environment was not present in agent pretraining data.

low positive Automatic Generation of High-Performance RL Environments availability/uniqueness of reference (private vs public) as contamination contro...

Puffer Pong sees a 42x PPO improvement.

Reported PPO throughput/speed comparison for Puffer Pong between the paper's translated implementation and a baseline (implicit reference), yielding a 42x factor.

low positive Automatic Generation of High-Performance RL Environments PPO throughput / speedup factor

Addressing concerns about job security and skill obsolescence contributes to a more sustainable AI integration approach that promotes workforce adaptability, inclusion, and ethical decision-making.

Framed as a concluding implication of the study's socio-technical perspective; based on theoretical synthesis and empirical observations from Scopus-derived case material but without detailed longitudinal data provided in the summary.

low positive Artificial intelligence and organisational transformation: t... sustainability of AI integration; workforce adaptability; inclusion; ethical dec...

Structured skill enhancement programs, transparent communication, and ethical AI governance frameworks reduce workforce resistance, enhance innovation, and facilitate equitable AI-driven transformation.

Recommendation and finding derived from the study's analysis and case-based insights; the summary frames this as actionable insight but does not cite measured effect sizes or how these interventions were tested empirically.

low positive Artificial intelligence and organisational transformation: t... workforce resistance; organisational innovation; equity of AI-driven transformat...

« Prev 1 2 3 … 53 54 55 … 59 60 Next »