Evidence (7953 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	402	112	67	480	1076
Governance & Regulation	402	192	122	62	790
Research Productivity	249	98	34	311	697
Organizational Efficiency	395	95	70	40	603
Technology Adoption Rate	321	126	73	39	564
Firm Productivity	306	39	70	12	432
Output Quality	256	66	25	28	375
AI Safety & Ethics	116	177	44	24	363
Market Structure	107	128	85	14	339
Decision Quality	177	76	38	20	315
Fiscal & Macroeconomic	89	58	33	22	209
Employment Level	77	34	80	9	202
Skill Acquisition	92	33	40	9	174
Innovation Output	120	12	23	12	168
Firm Revenue	98	34	22	—	154
Consumer Welfare	73	31	37	7	148
Task Allocation	84	16	33	7	140
Inequality Measures	25	77	32	5	139
Regulatory Compliance	54	63	13	3	133
Error Rate	44	51	6	—	101
Task Completion Time	88	5	4	3	100
Training Effectiveness	58	12	12	16	99
Worker Satisfaction	47	32	11	7	97
Wages & Compensation	53	15	20	5	93
Team Performance	47	12	15	7	82
Automation Exposure	24	22	9	6	62
Job Displacement	6	38	13	—	57
Hiring & Recruitment	41	4	6	3	54
Developer Productivity	34	4	3	1	42
Social Protection	22	10	6	2	40
Creative Output	16	7	5	1	29
Labor Share of Income	12	5	9	—	26
Skill Obsolescence	3	20	2	—	25
Worker Turnover	10	12	—	3	25

In the sentiment-analysis task, those individual differences do not produce human–AI complementarity: the joint performance of humans and AI did not exceed that of either alone.

Empirical finding reported from the preregistered sentiment-analysis experiment showing no complementarity effect (joint human-AI performance ≤ best individual performance). (Statistical tests and sample size not included in the excerpt.)

medium null result Who Needs What Explanation? How User Traits Affect Explanati... human–AI joint performance compared to human-alone and AI-alone performance (e.g...

We conducted a systematic review and meta-analysis of the literature on AI/HR analytics and organizational decision making, using 85 publications and grounding the work in theories of algorithm-automated decision-making (AST) and matching/hybrid models (STS).

Paper's methods: systematic review and meta-analysis; sample = 85 publications; theoretical framing explicitly stated as AST and STS.

medium null result ALGORITHMIC DETERMINISM VERSUS HUMAN AGENCY: A SYSTEMATIC RE... scope/coverage of literature (number of publications reviewed); theoretical fram...

Macroeconomic fiscal moderation remains empirically unvalidated.

Synthesis conclusion from the review noting an absence of empirical evidence that Agentic AI produces macroeconomic fiscal moderation; i.e., no validated studies showing broad fiscal relief effects were identified in the reviewed literature.

medium null result Agentic AI for Ageing Healthcare Systems in Advanced Economi... macro-fiscal outcomes (e.g., national fiscal pressure, public expenditure modera...

By 2024 the RL-FRB/US model produced a federal budget deficit similar to the baseline: RL-FRB/US model: -1,767 trillion $ vs. FRB/US model: -1,758 trillion $.

Reported fiscal balance (federal budget deficit) simulation outputs for 2024 from comparative model runs in the paper.

medium null result Fiscal Policy Towards Optimizing Macroeconomic Indicators by... Federal budget deficit (trillion $) for 2024

No significant differences emerged in job titles and industry suggested by GPT-5 across genders.

Empirical finding from analysis of GPT-5 outputs comparing suggested job titles and industries for the 24 profiles; exact statistical tests not specified in the summary.

medium null result Gender Bias in Generative AI-assisted Recruitment Processes suggested job titles and industry assignments by GPT-5 across male and female pr...

Self-generated (model-authored) Skills provide no average benefit.

Comparison of three evaluation conditions (no Skills, curated Skills, self-authored Skills) across SkillsBench. Averaged pass-rate deltas show that model-authored Skills do not increase average pass rate relative to baseline; analysis used 7,308 trajectories over 86 tasks and 7 agent–model configurations.

medium null result SkillsBench: Benchmarking How Well Agent Skills Work Across ... task pass rate (average delta for self-authored Skills vs. baseline)

AI will not cause permanent mass unemployment at the aggregate level.

Analytical argument and literature synthesis using labor-economics theory (Skill-Biased Technological Change and structural transformation). No primary microdata, no stated empirical identification strategy or sample size in the paper (methodology appears to be theoretical and sectoral synthesis).

medium null result Artificial Intelligence, Automation, and Employment Dynamics... aggregate employment / unemployment

Empirical evaluation is needed on how AI-induced productivity gains translate into aggregate demand and labor absorption.

Identified research priority in the paper, based on theoretical uncertainty about demand-side labor absorption and lack of conclusive empirical evidence.

medium null result Artificial Intelligence, Automation, and Employment Dynamics... relationship between productivity gains from AI and aggregate demand/employment

AI will not mechanically cause permanent mass unemployment at the aggregate level.

Theoretical framing and synthesis of existing empirical findings across task-based and macro studies; no single new dataset provided (paper draws on literature and conceptual models).

medium null result Artificial Intelligence, Automation, and Employment Dynamics... aggregate employment / unemployment (long-run)

Occupation-level analyses (e.g., BLS OEWS cross-occupation wage regressions) risk misleading conclusions about AI’s distributional effects because they aggregate over the task- and firm-level heterogeneity that drives the mechanism.

Theoretical argument and empirical illustration in the paper showing how aggregation masks within-task compression and firm-level rent capture; example regressions on OEWS used to demonstrate the limitation.

medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... accuracy of occupation-level analyses in capturing task-level mechanism (qualita...

Testing the model requires within-occupation, within-task panel data on task-level performance and wages linked to firm-level AI adoption, ownership of complementary assets, and measures of rent-sharing; such data are not available at scale.

Author statement about data requirements and current data limitations; empirical illustration and discussion note absence of large-scale linked microdata meeting these criteria.

medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... availability of suitable microdata for empirical testing (data coverage / scale)

Occupation-level regressions using BLS OEWS (2019–2023) are insufficient for testing the model’s task-level predictions because aggregation across tasks and firms hides the mechanism.

Empirical illustration in the paper using occupation-level regressions on BLS OEWS 2019–2023 showing that such aggregates do not reveal within-occupation, within-task dispersion or firm-level rent concentration effects; paper argues this is a data-adequacy limitation.

medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... ability of occupation-level regressions to detect task-level mechanism (qualitat...

A sensitivity decomposition shows five of the moments (the non‑ΔGini moments) identify internal mechanism rates (how AI changes task production, education responses, screening intensity) but do not determine the aggregate sign of inequality change.

Local identification / sensitivity decomposition performed on the calibrated model; decomposition results reported in the paper attribute mechanism-rate identification to five moments and show they leave the sign of ΔGini indeterminate.

medium null result When AI Levels the Playing Field: Skill Homogenization, Asse... identification of mechanism parameters versus determination of aggregate ΔGini s...

The paper introduces a novel taxonomy that separates patenting into three domains: core AI, traditional robotics, and AI-enhanced robotics.

Methodological contribution of the paper: construction and application of a classification scheme that assigns patent filings (1980–2019) into three domains (core AI, traditional robotics, AI-enhanced robotics). Data source: patent filings 1980–2019 (aggregate counts by domain and country). Exact number of patents not provided in the summary.

medium null result The "Gold Rush" in AI and Robotics Patenting Activity. Do in... categorization/classification of patent filings into three domains

The proposed uncertainty measure connects to classical value-of-information concepts, bridging security mechanism analysis and economic theories of information, signaling, and screening.

Analytical comparison and discussion in the paper linking the entropy-style residual uncertainty metric to value-of-information literature (theoretical linkage).

medium null result Evaluating Synthetic Cyber Deception Strategies Under Uncert... conceptual/analytical alignment between residual uncertainty metric and value-of...

AI did not significantly moderate the relationship between workplace stress and job performance.

Moderation test in PLS-SEM (SmartPLS 4.0) on N = 350; reported non-significant AI × Stress → Performance moderator (paper reports no significant moderating effect).

medium null result AI-driven stress management and performance optimization: A ... job performance

Use of AI raises needs for traceability, explainability, and continuous validation to maintain compliance and avoid error propagation in curricular decisions.

Paper's AI governance recommendations (prescriptive), referencing general AI risk principles rather than empirical study.

medium null result Curriculum engineering: organisation, orientation, and manag... traceability/explainability measures, validation frequency, incidence of propaga...

There is no accepted integrative digital model that maps measured or perceived value to algorithmic pricing.

Absence of such a model in the SLR sample of 30 articles and thematic coding that identified this gap explicitly.

medium null result Pricing Strategy in Digital Marketing: A Systematic Review o... Existence of integrative digital VBP model (mapping perceived value to algorithm...

There is no evidence of nonlinearities in the relationship between digital trade and urban house prices (the effect is linear across the sample).

Explicit tests for nonlinearity reported in the econometric analysis (details of test specification not provided in the summary).

medium null result Is digital trade affecting city house prices? An artificial ... city-level house prices

When green-technology innovation is low (below the threshold), the main measurable effect of DE is on improving carbon emission efficiency (CEE), but DE does not yet reduce per capita emissions (PCE).

Results from the threshold-regression models on the 278-city panel (2011–2022) show that in the low-green-innovation regime DE coefficients are significant for CEE but not for PCE; mediating-effect models corroborate the efficiency channel in low-innovation contexts.

medium null result Digital Economy, Green Technology Innovation and Urban Carbo... Carbon emission efficiency (CEE) and Per capita carbon emissions (PCE)

Realising DT value requires upfront investment in sensors, integration, standards, and skills; economic viability depends on contract structures and how gains are allocated between investors, owners, contractors, and operators.

Synthesis of cost/benefit discussions and case descriptions in the reviewed literature; policy and procurement examples referenced.

medium null result Digital Twins Across the Asset Lifecycle: Technical, Organis... investment requirements and determinants of economic viability

HCI has explored usable consent, but there is no systematic framework for consent in the AI era.

Literature synthesis and gap identification from workshop participants and solicited position papers; no systematic review or meta-analysis with counted studies reported in the summary.

medium null result Moving Beyond Clicks: Rethinking Consent and User Control in... existence of a systematic AI-era consent framework

Privacy-leak framing (risk vs ambiguity or privacy-threatening vs neutral) did not change participants' subsequent bargaining behavior with pricing algorithms.

The experiment measured downstream bargaining behavior with algorithms after the adoption/label tasks (N = 610) and reports no detectable effect of the privacy/leak framing on those bargaining outcomes.

medium null result The Data-Dollars Tradeoff: Privacy Harms vs. Economic Risk i... Bargaining behavior with pricing algorithms (choices/offer responses in the down...

Under truthful bidding, the decentralised price-based market matches a centralised value-optimal benchmark (i.e., decentralised allocation equals centralised value-optimal allocation).

Paper presents both a theoretical argument (mechanism properties under quasilinear utilities and discrete slices) and empirical validation in simulation by comparing decentralised outcomes to a centralised value-optimal baseline across configurations in the ablation study.

medium null result Real-Time AI Service Economy: A Framework for Agentic Comput... allocation value (total value/throughput) relative to a centralised value-optima...

No clear evidence that project phase systematically shifts sentiment perception.

Project-phase indicators were collected each round and included in correlation and repeated-measures analyses; no consistent, systematic association between project phase and sentiment labeling was found.

medium null result Exploring Indicators of Developers' Sentiment Perceptions in... sentiment label distribution across project phases

Predictors of negative labeling are weak and at best trend-level (e.g., task conflict shows only weak/trend-level association with negative labels).

Correlation analyses and GEE models testing multiple predictors (mood states, life circumstances, team dynamics including task conflict) on negative vs other labels; effects for negative labeling were small and lacked robustness.

medium null result Exploring Indicators of Developers' Sentiment Perceptions in... probability of labeling a statement as negative

Experiments used realistic channel and beamforming datasets reflecting varying elevation angles and dynamic LEO link conditions.

Dataset description in the paper states use of realistic channel and beamforming data including varying elevation angles and dynamic links; no dataset size or public dataset identifiers provided in the summary.

medium null result Federated Learning-driven Beam Management in LEO 6G Non-Terr... data realism and coverage of elevation-angle dynamics (qualitative)

There is a need for causal studies (randomized pilots, phased rollouts) to quantify net welfare effects including patient trust, equity, legal risk, and long-run labor impacts.

Authors' recommendation based on gaps identified in the mixed-methods evidence and acknowledged limitations around causal identification and long-term measurement.

medium null result The Role of Artificial Intelligence in Healthcare Complaint ... recommended outcomes for future causal evaluation (patient trust, equity metrics...

Under the current estimated parameters, dynamics converge toward equilibria—implying convergent, policy-mediated adjustment rather than endogenous cyclical instability.

Inference from stability classification (stable-node equilibria) and model dynamics simulated or linearized around equilibria using 2016–2023–estimated parameters.

medium null result Governance of Technological Transition: A Predator-Prey Anal... convergence behavior of model trajectories (toward equilibrium)

Equilibrium points of the estimated three-stock system are classified as stable nodes (no persistent endogenous cycles under the estimated parameters).

Stability analysis: equilibria computed from estimated parameters and local stability assessed via Jacobian eigenvalues; eigenvalues indicate stable nodes.

medium null result Governance of Technological Transition: A Predator-Prey Anal... equilibrium stability classification (eigenvalues of Jacobian)

Results are robust across alternative AI index specifications, occupational classifications, and standard controls (country and year fixed effects, macroeconomic covariates).

Paper reports robustness checks across different index constructions and occupational taxonomies, with standard controls included in regressions.

medium null result Artificial Intelligence and Labor Market Transformation: Emp... Stability of estimated effects (robustness of employment and wage estimates)

Liability for harm from AI remains unresolved; current regulatory frameworks (notably in the EU) continue to emphasize human responsibility and require conformity and clinical validation.

Regulatory and legal analyses, with emphasis on European Union device regulation and liability principles, as reviewed in the paper.

medium null result Will AI Replace Physicians in the Near Future? AI Adoption B... legal liability allocation; regulatory requirements for conformity and clinical ...

On-Premise RAG matches commercial (cloud) RAG on standard quantitative retrieval and generation metrics.

Empirical comparative analysis using standard retrieval/generation benchmarks comparing three systems (zero-shot baseline, GPT RAG cloud, Open-source On-Prem RAG) under representative SME workloads; specific metric names and sample sizes not reported in the summary.

medium null result An Empirical Study on the Feasibility Analysis of On-Premise... standard retrieval and generation metrics (quantitative performance of retrieval...

State-level advances in worker-protective AI measures exist but are uneven and many proposed state bills aimed at strengthening workers’ rights related to AI have stalled.

Review of state legislative proposals and enacted laws as compiled in the commentary (state-level policy scan); no systematic quantitative legislative count or sample reported.

medium null result AI governance under the second Trump administration: implica... status of state-level legislation regarding AI and worker protections (enacted v...

Domain adaptation techniques (transfer learning, fine-tuning on local data) are underutilized in low-resource African contexts despite their potential to improve generalization to local populations and care processes.

Thematic coding of methodological sections across the reviewed literature showed relatively few studies employing transfer learning or local fine-tuning approaches in African or other low-resource settings; evidence comes from counts/qualitative summaries within the literature review rather than a formal meta-analysis.

medium null result On the use of synthetic data for healthcare AI in Africa: Te... use of domain adaptation methods and resulting generalization/performance improv...

Research priorities include causal studies on productivity gains from AI, firm‑level adoption dynamics, sectoral labor reallocation, long‑run general equilibrium effects, and heterogeneous impacts across regions and demographic groups.

Set of empirical research recommendations drawn from gaps identified in the literature review and limitations section; not an empirical claim but a prioritized research agenda based on secondary evidence.

medium null result AI and Robotics Redefine Output and Growth: The New Producti... knowledge gaps to be addressed (research outcomes)

Growth‑accounting frameworks and measurement approaches must be updated to capture AI/robotics as intangible and embodied capital, including quality improvements and spillovers.

Methodological argument grounded in literature on measurement challenges and examples of intangible capital; no new measurement exercise or empirical re‑estimation is provided in the paper.

medium null result AI and Robotics Redefine Output and Growth: The New Producti... measurement accuracy of productivity accounts, capture of intangible capital and...

Backtesting the proposed models against historical technological transitions (e.g., ATMs, robotics) and recent AI adoption episodes can validate model performance.

Recommended validation strategy; paper does not report backtest results but prescribes holdout/pseudo‑counterfactual experiments and calibration with administrative outcomes.

medium null result Enhancing BLS Methodologies for Projecting AI's Impact on Em... backtest performance metrics (forecast errors, calibration statistics) when appl...

Scenario modelling in the reviewed literature typically uses counterfactual simulations with different adoption speeds, policy responses, and initial conditions to bound possible employment, wage, and productivity trajectories.

Description and citations of scenario-modelling practices by think tanks and organisations (TBI, IPPR, IMF) and academic work referenced; evidence is methodological and report-based.

medium null result Recent Methodologies on AI and Labour - a Desk Review range of projected employment/wage/productivity trajectories across scenarios

NLP/LLM pipelines are used to extract tasks and skills from free-text job ads and to map those tasks to AI capabilities.

Described methods and citations (Xu et al., 2025; Hampole et al., 2025); evidence is methodological application of transformer-based models to job-ad text in recent studies.

medium null result Recent Methodologies on AI and Labour - a Desk Review task/skill extraction performance and task-to-capability mapping

Methods increasingly apply advanced NLP and large language models (BERT, LSTM, GPT-4) to parse job descriptions, map skills/tasks, and predict automation risk.

Cited methodological examples in the paper (Xu et al., 2025; Hampole et al., 2025) and discussion of common pipelines using transformer-based models to extract tasks from free-text job ads and to map tasks to AI capabilities; evidence is methodological and based on recent studies rather than a single benchmarked dataset.

medium null result Recent Methodologies on AI and Labour - a Desk Review task/skill extraction and AI-exposure prediction accuracy from free-text job des...

Some functional domains show varying maturity: for example, procurement has more applied work compared with other functions.

Reviewer observation from the systematic search and screening across 2020–2025 literature noting uneven distribution of empirical/ applied studies across functions.

medium null result Integrating Artificial Intelligence and Enterprise Resource ... relative maturity (volume of applied studies or case evidence per functional dom...

A centralized policy engine for access control, data handling rules, and change management is a necessary control point in the reference pattern.

Prescriptive recommendation in the paper supported by best-practice synthesis and case anecdotes; no direct empirical comparison of centralized vs federated policy engines provided.

medium null result Governed Hyperautomation for CRM and ERP: A Reference Patter... effectiveness of access control and change management (e.g., policy violations, ...

Research gaps include the need for standardized evaluation metrics, robustness- and consistency-focused XAI methods, domain-informed explanation frameworks, and longitudinal/clinical impact studies.

Recommendations section of the review synthesizing recurring deficits across papers and proposing priorities.

medium null result Explainable Artificial Intelligence (XAI) for EEG Analysis: ... recommended research directions / missing evaluation components

Recommendation for research and modeling: economic models of AI markets should incorporate institutional regime types (centralized vs decentralized), enforcement uncertainty, and legitimacy effects as parameters affecting data access costs, R&D productivity, and market concentration.

Normative recommendation based on the comparative typology and inferred mechanisms from the document analysis; not empirically validated within the study.

medium null result Balancing openness and security in scientific data governanc... modeling parameters (regime type, enforcement uncertainty, legitimacy effects) a...

Theoretical contribution: the paper extends modular coordination theory by treating openness–security trade‑offs as layered, adaptive institutional processes embedded in political regimes and 'legitimacy economies.'

Argumentative/theoretical development in the paper grounded in document analysis and literature on coordination and legitimacy.

medium null result Balancing openness and security in scientific data governanc... theoretical framing / extension of modular coordination theory

Providing optional LLM access without training did not increase average exam scores versus no LLM access.

Intent-to-treat comparisons across randomized arms reported in the study: comparison of optional-access-without-training arm to no-access arm showed no average score improvement (sample n = 164).

medium null result Training for Technology: Adoption and Productive Use of Gene... Exam score (grade points)

Cross-border coordination is crucial because platform services and data flows often transcend jurisdictions.

Policy analysis and descriptive examples of cross-border platform operations in the reviewed literature; not empirically quantified in the paper.

medium null result Financial Inclusion in the Age of FinTech Platforms: Opportu... need for cross-border regulatory coordination (qualitative importance)

Standardized metrics for 'inclusive outcomes' are needed beyond account ownership—e.g., active usage, quality of credit, stability of access, and welfare effects.

Critical assessment of measurement shortcomings in existing financial inclusion literature; prescriptive recommendation rather than empirical evidence.

medium null result Financial Inclusion in the Age of FinTech Platforms: Opportu... measurement quality of inclusion metrics (active usage, credit quality, access s...

Realizing AI’s potential for circular-economy and energy-efficiency goals requires coordinated interventions across environmental regulation, digital infrastructure, and workforce skill formation.

Policy interpretation drawn from heterogeneity results (regulation and infrastructure amplify AI effects) and the identified labor-market mechanism (skill composition matters); recommendation rather than direct causal estimate.

medium null result Artificial intelligence, greening of occupational structure ... Policy-relevant intermediate outcomes (regulation strength, infrastructure level...

« Prev 1 2 3 … 103 104 105 … 159 160 Next »