Evidence (7953 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	402	112	67	480	1076
Governance & Regulation	402	192	122	62	790
Research Productivity	249	98	34	311	697
Organizational Efficiency	395	95	70	40	603
Technology Adoption Rate	321	126	73	39	564
Firm Productivity	306	39	70	12	432
Output Quality	256	66	25	28	375
AI Safety & Ethics	116	177	44	24	363
Market Structure	107	128	85	14	339
Decision Quality	177	76	38	20	315
Fiscal & Macroeconomic	89	58	33	22	209
Employment Level	77	34	80	9	202
Skill Acquisition	92	33	40	9	174
Innovation Output	120	12	23	12	168
Firm Revenue	98	34	22	—	154
Consumer Welfare	73	31	37	7	148
Task Allocation	84	16	33	7	140
Inequality Measures	25	77	32	5	139
Regulatory Compliance	54	63	13	3	133
Error Rate	44	51	6	—	101
Task Completion Time	88	5	4	3	100
Training Effectiveness	58	12	12	16	99
Worker Satisfaction	47	32	11	7	97
Wages & Compensation	53	15	20	5	93
Team Performance	47	12	15	7	82
Automation Exposure	24	22	9	6	62
Job Displacement	6	38	13	—	57
Hiring & Recruitment	41	4	6	3	54
Developer Productivity	34	4	3	1	42
Social Protection	22	10	6	2	40
Creative Output	16	7	5	1	29
Labor Share of Income	12	5	9	—	26
Skill Obsolescence	3	20	2	—	25
Worker Turnover	10	12	—	3	25

Study limitations: cross-sectional design, self-reported intentions, potential unobserved confounders, and limited generalizability to only three cities (Beijing, Guangzhou, Lanzhou).

Explicit methodological statements in the paper describing data and design: cross-sectional survey of 889 respondents from three cities and reliance on self-reported employment intentions.

high null result Analysis of the Impact of Artificial Intelligence on Middle-... N/A

Because the study is cross-sectional and self-report, causal claims are limited and generalizability is restricted to Generation Z (limitation noted in the paper).

Authors' limitations: cross-sectional/self-report design and sample restricted to Generation Z; these constraints are reported in the paper.

high null result Trust in AI-Driven Marketing and its Impact on Brand Loyalty... Inference validity / generalizability

Study design: cross-sectional self-report survey of 450 Generation Z consumers analyzed with Structural Equation Modeling (SPSS AMOS).

Methods section reporting sample size (n = 450), target population (Generation Z), cross-sectional survey design, and analysis technique (SEM using SPSS AMOS).

high null result Trust in AI-Driven Marketing and its Impact on Brand Loyalty... Study design / sample

The measurement and structural model show good to excellent fit and reliable constructs (CFI = 0.980, TLI = 0.974, RMSEA = 0.062, SRMR = 0.031).

Reported psychometric/model-fit indices from SEM analysis (SPSS AMOS) on sample of 450 respondents.

high null result Trust in AI-Driven Marketing and its Impact on Brand Loyalty... Model fit / construct validity

Outcomes reported are primarily self-reported psychological measures rather than objective productivity metrics.

Paper reports measurement instruments focused on self-reported self-efficacy, psychological ownership, meaningfulness, and enjoyment/satisfaction; no primary objective productivity metrics reported.

high null result Relying on AI at work reduces self-efficacy, ownership, and ... measurement type (self-reported psychological outcomes)

The experiment was pre-registered, used occupation-specific writing tasks, and employed a between-subjects design with three conditions (No-AI, Passive AI, Active collaboration).

Study design reported in the paper: pre-registration statement, N = 269, between-subjects assignment to three conditions using occupation-specific writing tasks.

high null result Relying on AI at work reduces self-efficacy, ownership, and ... n/a (methodological claim)

Active, collaborative AI use preserves perceived meaningfulness of work at levels comparable to independent work and does not produce the lasting psychological costs seen with passive use.

Pre-registered experiment (N = 269) with post-manipulation and post-return measures; Active-collaboration condition matched No-AI on meaningfulness and showed no persistent declines after returning to manual tasks.

high null result Relying on AI at work reduces self-efficacy, ownership, and ... perceived meaningfulness of work (including post-return)

Active, collaborative AI use preserves psychological ownership of outputs at levels comparable to independent work.

Pre-registered experiment (N = 269); Active-collaboration condition reported ownership levels similar to No-AI condition on self-report scales.

high null result Relying on AI at work reduces self-efficacy, ownership, and ... psychological ownership of outputs

Active, collaborative AI use (human drafts first, then uses AI to refine) preserves self-efficacy at levels comparable to independent (no-AI) work.

Pre-registered experiment (N = 269) comparing Active-collaboration and No-AI conditions; no statistically meaningful differences in self-efficacy between them (self-reported measures).

high null result Relying on AI at work reduces self-efficacy, ownership, and ... self-efficacy (confidence to complete tasks without AI)

The paper identifies future research directions, including empirical causal studies on how DPP+AI interventions change recycling rates, second‑hand market prices, and firm investment in circular processes; and modeling firm strategy around proprietary vs shared DPP data.

Stated research agenda and gaps in the paper informed by the study's findings and limitations; these are recommendations rather than empirical claims.

high null result Integrating knowledge management and digital product passpor... proposed empirical and modeling research outcomes (not measured in current study...

The study used a mixed-methods design focused on the Italian fashion and cosmetics industries, employing two online surveys, k‑means clustering (consumer segmentation), principal component analysis (to identify underlying dimensions of DPP functionalities and sustainability practices), and logistic regression (to identify adoption drivers).

Methods section summary provided in the paper; explicit statement of methods and industry context. Note: sample sizes and survey instrument details are not provided in the summary.

high null result Integrating knowledge management and digital product passpor... methodological descriptors (survey-based measurements, clustering, PCA, regressi...

Two consumer segments were identified: 'aware' consumers (environmentally attuned and receptive to digital innovation and sustainability information) and 'unaware' consumers (prioritize immediate, tangible benefits like price and convenience over sustainability information).

K‑means cluster analysis applied to consumer responses from one of the online surveys in the Italian fashion and cosmetics context; summary identifies two clusters; sample sizes not reported.

high null result Integrating knowledge management and digital product passpor... consumer segmentation / cluster membership (attitudes and preferences toward sus...

This work is a conceptual/policy analysis rather than an original empirical study.

Explicit statement in the paper's Data & Methods section.

high null result A golden opportunity: Corporate sustainability reporting as ... study design/type (conceptual/policy analysis)

Study limitations include single-country (China) listed‑firm sample and reliance on secondary/administrative proxies for digitalization and innovation, which may miss internal qualitative aspects and introduce measurement error.

Authors’ stated limitations: sample restricted to Chinese A-share listed firms (2012–2022) and measures of digitalization/innovation derived from administrative/secondary data rather than direct observation/survey of internal practices.

high null result Supply Chain Digitalization and its Impact on Green Innovati... external validity and measurement quality of SCD and innovation proxies

No new primary empirical tests were performed in this paper; conclusions are based on secondary analysis and are broad and diagnostic rather than demonstrating causal mechanisms.

Explicit methodological statement in the Data & Methods and Limitations sections of the paper describing it as a qualitative literature review and synthesis.

high null result SUSTAINABILITY ISSUES IN FINANCIAL ACCOUNTING RESEARCH presence/absence of new primary empirical evidence in this paper

Research should prioritize causal identification (IV, difference‑in‑differences, regression discontinuity) to disentangle whether ESG causes better financial outcomes or instead proxies for unobserved firm quality.

Methodological recommendation based on limitations in the reviewed literature (many observational/correlational studies); the paper argues for stronger causal designs going forward.

high null result SUSTAINABILITY ISSUES IN FINANCIAL ACCOUNTING RESEARCH causal effect of ESG on financial outcomes (causal identification quality)

The authors propose research priorities for economists: quantify productivity gains from closing the actionability gap; estimate firm-level heterogeneity in evaluation capability and its effect on adoption; and model investment trade-offs between building evaluation-to-action pipelines versus accepting reduced LLM performance.

Paper's concluding recommendations for future research directions (explicitly listed by the authors).

high null result Results-Actionability Gap: Understanding How Practitioners E... recommended research agenda topics

The paper produces as primary outcomes a taxonomy of ten evaluation practices, the articulation of the results-actionability gap, and recommended strategies observed among successful teams.

Authors report these as the main outcomes of their thematic analysis and syntheses from the 19 interviews.

high null result Results-Actionability Gap: Understanding How Practitioners E... reported study outputs (taxonomy, articulated gap, recommended strategies)

The study method consisted of semi-structured qualitative interviews with 19 practitioners across multiple industries and roles, analyzed via thematic coding.

Explicit methods section of the paper stating sample size (n=19), participant diversity, interview approach, and coding/analysis procedure.

high null result Results-Actionability Gap: Understanding How Practitioners E... study design and sample size

AI-economics research should treat quantum capability as a distinct, gradually diffusing factor of production with sectoral specificity and model complementarities and policy counterfactuals endogenously.

Modeling recommendations grounded in sensitivity of macro outcomes to diffusion patterns, complementarities, and policy choices observed in the scenario and counterfactual analyses.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... quality of AI-economic forecasts and policy evaluation (model realism)

Model parameters are calibrated using historical diffusion of enabling technologies (cloud computing, GPUs, AI toolchains), industry case studies, and expert elicitation where hard data are lacking.

Empirical grounding section describing calibration sources: historical diffusion, case studies (materials discovery, optimization), and expert elicitation.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... calibrated model parameters (diffusion rates, adoption elasticities, complementa...

Uncertainty quantification is performed by running Monte Carlo or scenario ensembles and conducting sensitivity and robustness checks.

Methodological claim in the uncertainty quantification section describing Monte Carlo/scenario ensemble approach.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... sensitivity of results to parameter uncertainty; distribution of model outcomes

Sectoral TFP shocks are integrated into computational general equilibrium (CGE) or multi-sector growth models (and optionally DSGE variants) to simulate GDP, sector output, trade impacts, and labor reallocation.

Method section stating integration of sectoral TFP shocks into CGE/multi-sector growth models with optional DSGE short-run dynamics.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... GDP, sectoral output, trade flows, labor reallocation

Sectoral adoption is translated into total factor productivity (TFP) shocks or sector-specific Hicks-neutral productivity improvements based on micro evidence of quantum advantages.

Methodological description of productivity mapping linking adoption to TFP shocks using micro evidence and case studies.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... sectoral TFP shocks

The paper uses empirical diffusion functions (logistic/S-curve, Bass model) calibrated to analogous technologies to project uptake over time.

Methodological description: diffusion modeling section explicitly states use of logistic/S-curve and Bass models and calibration to past technologies (cloud, GPUs).

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... projected adoption curves over time

The analysis used sentence‑transformer models to produce dense vector representations of article text and UMAP to project those embeddings into a low‑dimensional thematic map for cluster identification and gap detection.

Methods section specifying use of sentence‑transformer embeddings and UMAP for dimensionality reduction/visualization of article text.

high null result Natural language processing in bank marketing: a systematic ... analytic techniques applied to article abstracts/text (embedding + dimensionalit...

The study followed a PRISMA protocol for literature selection and included peer‑reviewed journal articles published between 2014 and 2024, with a final sample size of n = 109.

Explicit methodological statement in the paper describing the literature search, inclusion/exclusion criteria, and final sample.

high null result Natural language processing in bank marketing: a systematic ... methodological protocol adherence and sample size

Twenty‑seven papers study marketing in banking without using NLP methods.

PRISMA systematic review; categorization of the 109 selected articles into the three coverage groups (8, 74, 27).

high null result Natural language processing in bank marketing: a systematic ... count of peer‑reviewed articles on marketing in banking that do not use NLP

Seventy‑four papers study NLP in marketing more broadly (not specifically banking).

Same PRISMA‑based systematic review and manual categorization of the final sample n = 109 into topical buckets (NLP in marketing vs. NLP in bank marketing vs. marketing in banking without NLP).

high null result Natural language processing in bank marketing: a systematic ... count of peer‑reviewed articles on NLP in marketing (general)

Only 8 peer‑reviewed papers directly examine NLP in bank marketing (out of a final sample of 109 articles published 2014–2024).

Systematic review following PRISMA protocol; final sample n = 109 peer‑reviewed journal articles published 2014–2024; manual screening and categorization yielding counts by topic.

high null result Natural language processing in bank marketing: a systematic ... count of peer‑reviewed articles focused on NLP in bank marketing

The study's findings are qualitative and case-driven (Xiaomi and Deloitte); generalizability is limited by case selection and the absence of standardized quantitative metrics.

Methods section explicitly states case analysis and literature review as primary methods and notes lack of large-scale quantitative measurement.

high null result Explore the Impact of Generative AI on Finance and Taxation external validity/generalizability of results

The methodology is normative-philosophical argumentation supplemented by interdisciplinary synthesis (phenomenology, deconstruction, OOO, STS/material turn); this is not an empirical causal study and contains no quantitative datasets.

Author-declared methods and limits: statement that the intervention is theory-driven and qualitative; absence of quantitative analysis reported.

high null result Examining ethical challenges in human–robot interaction usin... study type and presence/absence of quantitative data (methodological)

The paper’s empirical grounding consists of illustrative case studies and vignettes from healthcare robotics, autonomous vehicles, and algorithmic governance used to demonstrate distributed agency and responsibility.

Author-stated methodology: qualitative vignettes/case illustrations across three domains; no reported sample sizes or systematic data collection.

high null result Examining ethical challenges in human–robot interaction usin... use of illustrative case material (methodological/descriptive)

The analysis in the paper is primarily qualitative and descriptive; it does not empirically quantify AI’s effects on trade flows or welfare.

Explicit statement in the methods/data description noting a mixed qualitative approach (theoretical analysis, comparative legal analysis, case studies, scenario reasoning) and absence of empirical quantification.

high null result Path Analysis of Digital Economy and Reconstruction of Inter... empirical quantification of AI's effect on trade flows and welfare (not provided...

The study is qualitative and law-focused and uses Vietnam as a focused case study without collecting primary quantitative field data.

Explicit Data & Methods statement in the paper indicating doctrinal legal analysis, comparative institutional analysis, and normative framework development; no primary quantitative sample.

high null result ARTIFICIAL INTELLIGENCE AND ADMINISTRATIVE GOVERNANCE: A CRI... study design/data type (qualitative, doctrinal, comparative; absence of primary ...

The study recommends empirical metrics for future evaluation of reforms, including processing time per case, reversal rates on appeal, administrative litigation frequency, compliance and procurement costs, investment flows into public-sector AI, and changes in labor composition and wages in administrative agencies.

Methodological recommendation arising from the paper's normative and comparative analysis.

high null result ARTIFICIAL INTELLIGENCE AND ADMINISTRATIVE GOVERNANCE: A CRI... recommended empirical metrics (processing time per case; appeal reversal rates; ...

The paper's argument is principally theoretical and prescriptive and requires empirical validation across domains and at scale.

Author-stated limitation in the Data & Methods section noting that the work is primarily conceptual and that empirical validation is needed.

high null result An Alternative Trajectory for Generative AI existence/absence of empirical validation (current lack of cross-domain, large-s...

Operationalizing DSS requires building domain ontologies/knowledge graphs, designing synthetic curricula, training compact domain models, benchmarking against monolithic LLMs, and measuring total cost-of-ownership (energy, latency, bandwidth, infrastructure).

Paper's recommended experimental and measurement agenda (procedural/methodological prescriptions); this is a proposed research plan rather than an empirical result.

high null result An Alternative Trajectory for Generative AI validation metrics proposed by the paper (benchmark performance, energy/inferenc...

Analysis compared responses across 16 predefined dimension pairs (ethical dimensions or response axes) and used repeated measures and qualitative coding to characterize system behavior.

Methods and Analysis sections reporting use of 16 dimension-pair comparisons, repeated-measures tests for delta between blind and declared administrations, and qualitative coding to derive D3 failure taxonomy.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... analytic procedures applied (16 dimension pairs; repeated measures; qualitative ...

Probe administration included operational controls: runs were administered by two human raters across three machines to ensure operational consistency.

Methods statement describing administration by two human raters on three machines.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... operational administration procedure (two human raters, three machines)

The ceiling discrimination probe used Gemini Pro (Google) and Copilot Pro (Microsoft) as independent judges.

Methods: reported use of Gemini Pro and Copilot Pro as independent judges for the ceiling probe.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... agents used for ceiling-probe adjudication (Gemini Pro, Copilot Pro)

Primary blind scoring was performed by Claude (Anthropic) used as an LLM judge.

Methods: primary blind scoring explicitly performed by Claude.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... agent used for primary blind scoring (Claude)

Re-administration under declared conditions produced zero delta across all 16 dimension-pair comparisons (no measurable change when declaration status changed).

Reported repeated-measures comparisons across 16 predefined dimension pairs between blind and declared administrations, with reported zero delta.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... difference (delta) in scores across 16 dimension-pair comparisons between blind ...

Series 2 consisted of local and API open-source systems (n = 6) administered blind and declared, with four systems re-administered under declared conditions.

Methods description detailing Series 2 composition, modes (blind and declared), and that four systems were re-tested under declared conditions.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... count of systems in Series 2 (n=6) and number re-administered under declared con...

Series 1 consisted of frontier commercial systems administered blind (n = 7).

Methods description specifying Series 1 composition and blind administration.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... count of systems in Series 1 (n=7) and administration mode (blind)

The study employed 24 experimental conditions spanning 13 distinct LLM systems across two series.

Study design reported in Methods: Series 1 (frontier commercial, blind, n=7), Series 2 (local/API open-source, blind and declared, n=6), plus re-administered declared runs and ceiling-probe runs summing to 24 conditions.

high null result Literary Narrative as Moral Probe : A Cross-System Framework... number of experimental conditions and distinct systems tested (study scope)

The paper does not claim proprietary deployment metrics beyond qualitative field observations; experimental formalizations are provided for reproducible evaluation instead.

Authors explicitly note they document how to reproduce experiments but do not claim proprietary deployment metrics beyond qualitative field observations.

high null result Bridging Protocol and Production: Design Patterns for Deploy... degree to which empirical claims are qualitative field observations vs. propriet...

The paper recommends tracking specific operational and economic metrics: MTTR for tool failures, per-invocation latency variance, per-interaction operational cost, frequency of identity-related incidents, human remediation hours per 1,000 incidents, and SLA breach rates.

Explicit list of recommended metrics in the implications and metrics-to-track sections of the paper.

high null result Bridging Protocol and Production: Design Patterns for Deploy... the listed operational/economic metrics (MTTR, latency variance, costs, incident...

The paper provides a production-readiness checklist and instructions for reproducible evaluation alongside the proposed mechanisms.

Deliverables enumerated in the paper include a production-readiness checklist and reproducible experimental methodology.

high null result Bridging Protocol and Production: Design Patterns for Deploy... existence of a production-readiness checklist and reproducible evaluation instru...

All three proposed mechanisms (CABP, ATBA, SERF) are formalized as testable hypotheses with reproducible experimental methodology (benchmarks, latency/error models, broker pipeline semantics).

Paper includes formal descriptions and reproducible evaluation instructions and benchmarks; authors state methods to reproduce experiments are provided.

high null result Bridging Protocol and Production: Design Patterns for Deploy... availability and completeness of reproducible experimental methodology for each ...

« Prev 1 2 3 … 26 27 28 … 159 160 Next »