Evidence (4049 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	369	105	58	432	972
Governance & Regulation	365	171	113	54	713
Research Productivity	229	95	33	294	655
Organizational Efficiency	354	82	58	34	531
Technology Adoption Rate	277	115	63	27	486
Firm Productivity	273	33	68	10	389
AI Safety & Ethics	112	177	43	24	358
Output Quality	228	61	23	25	337
Market Structure	105	118	81	14	323
Decision Quality	154	68	33	17	275
Employment Level	68	32	74	8	184
Fiscal & Macroeconomic	74	52	32	21	183
Skill Acquisition	85	31	38	9	163
Firm Revenue	96	30	22	—	148
Innovation Output	100	11	20	11	143
Consumer Welfare	66	29	35	7	137
Regulatory Compliance	51	61	13	3	128
Inequality Measures	24	66	31	4	125
Task Allocation	64	6	28	6	104
Error Rate	42	47	6	—	95
Training Effectiveness	55	12	10	16	93
Worker Satisfaction	42	32	11	6	91
Task Completion Time	71	5	3	1	80
Wages & Compensation	38	13	19	4	74
Team Performance	41	8	15	7	72
Hiring & Recruitment	39	4	6	3	52
Automation Exposure	17	15	9	5	46
Job Displacement	5	28	12	—	45
Social Protection	18	8	6	1	33
Developer Productivity	25	1	2	1	29
Worker Turnover	10	12	—	3	25
Creative Output	15	5	3	1	24
Skill Obsolescence	3	18	2	—	23
Labor Share of Income	7	4	9	—	20

Governance Remove filter

Classical scaling laws model AI performance as monotonically improving with model size.

Statement about prior literature / modeling assumptions (classical scaling laws). No empirical sample size reported in the excerpt.

high null result The Institutional Scaling Law: Non-Monotonic Fitness, Capabi... AI performance as a function of model size

The paper derives formal conditions under which the inversion (smaller, orchestrated models outperforming frontier models) holds.

Mathematical derivations and stated sufficient/necessary conditions presented in the paper.

high null result Punctuated Equilibria in Artificial Intelligence: The Instit... parameter conditions for comparative performance inversion

We develop the Institutional Fitness Manifold, a mathematical framework that evaluates AI systems along four dimensions: capability, institutional trust, affordability, and sovereign compliance.

Theoretical/model development presented in the paper (formal definition of the manifold and its four dimensions).

high null result Punctuated Equilibria in Artificial Intelligence: The Instit... institutional fitness evaluated across four dimensions

There have been five eras of AI development since 1943, and within the current Generative AI Era there are four distinct epochs, each initiated by a discontinuous event.

Descriptive/historical classification within the paper (counts of eras and epochs; named initiating events such as the transformer and the 'DeepSeek Moment').

high null result Punctuated Equilibria in Artificial Intelligence: The Instit... count and classification of historical AI eras/epochs

The study uses panel data for 30 Chinese provinces from 2013–2022 to measure urban circular economy efficiency (UCEE) with a Super-SBM model including undesirable outputs, track dynamics via the Global Malmquist–Luenberger index, and estimate spatial effects with a spatial Durbin model.

Methodological description in the abstract: explicit statement of data (30 provinces, 2013–2022) and the three methods used (Super-SBM with undesirable outputs, GML index, spatial Durbin model).

high null result How artificial intelligence and environmental regulation inf... use of Super-SBM measurement, GML dynamics, and spatial Durbin estimation (metho...

Despite fears of mass unemployment, aggregate labor-market data through 2025 show limited labor-market disruption from generative AI.

Review of aggregate employment and labor-market studies and macro-level data through 2025 cited in the brief; methods include analyses of employment statistics and macro labor indicators (no single sample size reported).

high null result AI, Productivity, and Labor Markets: A Review of the Empiric... aggregate employment / labor-market disruption

We scored rule-breaking and abuse outcomes with an independent rubric-based judge across 28,112 transcript segments from multi-agent governance simulations.

Reported methodology: multi-agent governance simulations with agents in formal governmental roles, outcomes evaluated by an independent rubric-based judge; explicit sample count of 28,112 transcript segments.

high null result I Can't Believe It's Corrupt: Evaluating Corruption in Multi... rule-breaking and abuse outcomes (as assessed by rubric-based judge)

A GNN graph is constructed from reasoning embeddings and trading decisions are made using a PPO-DSR policy.

Method description: the paper reports embedding agents' reasoning, building a graph neural network (GNN) from those embeddings, and using a PPO-DSR reinforcement learning policy to trade. Specific GNN/PPO-DSR hyperparameters and architecture are not provided in the excerpt.

high null result Can Blindfolded LLMs Still Trade? An Anonymization-First Fra... use of GNN on reasoning embeddings and use of PPO-DSR policy to produce trading ...

Four LLM agents output scores along with reasoning.

Method description: the paper states that four LLM agents produce numeric scores and associated textual reasoning. The number of agents is explicitly given as four; no further architecture or model-family details included in the excerpt.

high null result Can Blindfolded LLMs Still Trade? An Anonymization-First Fra... agent outputs: numeric scores and textual reasoning

BlindTrade anonymizes tickers and company names (blindfolding agents by anonymizing all identifiers).

Methodological description in the paper: the system design explicitly replaces tickers and company names with anonymized identifiers. Implementation details and examples not provided in the excerpt.

high null result Can Blindfolded LLMs Still Trade? An Anonymization-First Fra... presence/absence of identifier anonymization (anonymization applied to input dat...

Data ethics, as a central pillar of digital ethics, emphasizes the responsible use and protection of personal information.

Conceptual/definitional statement in the paper situating data ethics within digital ethics and highlighting protection of personal information as a core concern.

high null result How Big Data Enhances Firm Value Under Data Privacy Regulati... not applicable

Big data usage is proxied by keyword frequency in firms' annual reports.

Operationalization described in the paper: frequency/count of big-data-related keywords in annual reports used as the proxy for firms' big data application.

high null result How Big Data Enhances Firm Value Under Data Privacy Regulati... big data usage (proxy)

The empirical analysis uses a fixed-effects regression approach to measure the impact of big data application on firm value.

Methodological statement in the paper specifying fixed-effects regression as the primary econometric approach.

high null result How Big Data Enhances Firm Value Under Data Privacy Regulati... firm value

The study analyzes panel data covering Chinese A-share listed companies from 2007 to 2021.

Description of dataset in the paper: panel of Chinese A-share listed companies spanning the years 2007–2021 (sample period stated).

high null result How Big Data Enhances Firm Value Under Data Privacy Regulati... not applicable

The analysis extends the dynamic taxation setup of Slavik and Yazici (2014).

Methodological claim: the model and solution approach build on and modify the framework from Slavik and Yazici (2014) (reference to prior theoretical framework rather than empirical data).

high null result Workers' Incentives and the Optimal Taxation of AI scope and structure of the theoretical model (extension of the referenced dynami...

We characterize the optimal tax policy in an economy with human manual and cognitive labor, physical capital, and artificial intelligence (AI).

Theoretical/analytical work: the paper develops and analyzes a dynamic general-equilibrium model that includes manual and cognitive human labor, physical capital, and AI. (No empirical sample; model-based characterization.)

high null result Workers' Incentives and the Optimal Taxation of AI form and properties of the optimal tax policy in the specified theoretical econo...

The field study used a 44-item questionnaire with 45 participants to measure comprehension, reported behavior change/adoption, and perceptions of volunteer legitimacy.

Methodological description provided in the paper: instrument and sample sizes explicitly reported.

high null result From Linguistic Hybridity to Development Sovereignty: Pidgin... study design details (instrument and sample size)

No original quantitative dataset or controlled evaluation is reported in this paper.

Methodological description in the paper stating reliance on prior literature, conceptual analysis, and prescriptive recommendations; paper does not present new experiments.

high null result LLM Alignment should go beyond Harmlessness–Helpfulness and ... existence of original empirical data or controlled experiments in the paper

The paper is a position/normative paper (not an empirical study) that uses conceptual analysis, literature synthesis, and prescriptive roadmaping rather than new quantitative experiments or datasets.

Explicit methodological statement in the paper summarizing genre and methods used; absence of reported original data or controlled evaluations.

high null result LLM Alignment should go beyond Harmlessness–Helpfulness and ... presence or absence of original empirical data / controlled evaluation in the pa...

There is a need for longitudinal and cross‑country empirical research to measure how hybrid work and AI tools affect promotion rates, network centrality, productivity, privacy harms, trust, and long‑term career trajectories.

Statement of research gaps derived from the paper's methodological approach (conceptual synthesis and secondary case studies) and absence of longitudinal/cross‑cultural primary data.

high null result The Sociology of Remote Work and Organisational Culture: How... research gap existence (need for longitudinal and cross‑country empirical studie...

Robustness checks include mediator tests (costs, tariffs, logistics) and firm‑level subgroup analyses to establish heterogeneous responses and support mechanism claims.

Paper reports robustness strategy involving mediation analysis and subgroup DID estimations across multiple mediator variables and firm size groups using the stated databases.

high null result How regional trade policy uncertainty affects agricultural i... n/a (robustness/methodology claim)

Empirical identification relies on treating CAFTA as an exogenous shock and applying a difference‑in‑differences (DID) design on firm and customs data from 2000–2014.

Methodological description in the paper: DID strategy with treated vs control comparisons; data sources explicitly listed as the China Industrial Enterprise Database and China Customs Database covering 2000–2014.

high null result How regional trade policy uncertainty affects agricultural i... n/a (methodological identification claim)

Highly Autonomous Cyber-Capable Agents (HACCAs) are AI systems able to plan and execute multi-stage cyber campaigns across the full attack lifecycle with minimal or no human direction.

Conceptual definition provided in the report; constructed via literature review and threat-framework formulation (no empirical sample; definitional/analytic).

high null result Highly Autonomous Cyber-Capable Agents: Anticipating Capabil... agent autonomy across reconnaissance, exploitation, lateral movement, persistenc...

Practical recommendations for firms and policymakers include investing in training for AI curation/evaluation/coordination, experimenting with decentralised decision rights and governance safeguards, and monitoring competitive dynamics related to model/platform providers.

Policy and practitioner takeaways explicitly presented in the discussion/implications sections, deriving from the conceptual framework and mapped literature.

high null result Generative AI and the algorithmic workplace: a bibliometric ... recommended organisational and policy actions

The paper recommends a research agenda for AI economists: causal microeconometric studies (DiD, IVs, RCTs), structural models with hybrid human–AI agents, measurement work on GenAI use, distributional analysis and policy evaluation.

Explicit recommendations listed in the implications and research agenda sections; logical follow‑on from bibliometric findings about gaps in causal and measurement evidence.

high null result Generative AI and the algorithmic workplace: a bibliometric ... recommended methodological directions for future empirical and theoretical resea...

Bibliometric mapping profiles the intellectual structure and evolution of the field but does not establish causal effects of GenAI on organisational outcomes.

Methodological limitation explicitly stated in the paper; bibliometric approach (co‑word, citation, thematic mapping) is descriptive and historical in scope.

high null result Generative AI and the algorithmic workplace: a bibliometric ... methodological limitation (inability to infer causality from bibliometric mappin...

Co‑word and thematic analyses reveal six coherent conceptual clusters that bridge technical AI topics (e.g., LLMs, GANs) with managerial themes (e.g., autonomy, coordination, decision‑making).

Thematic mapping and co‑word network analysis performed on the 212‑paper corpus; identification of six clusters reported in results.

high null result Generative AI and the algorithmic workplace: a bibliometric ... number and thematic composition of conceptual clusters (six clusters linking tec...

Bibliometric and conceptual tools (VOSviewer, Bibliometrix) were used to identify performance trends, co‑word structures, thematic maps, and conceptual evolution in the GenAI–organisation literature.

Methods section: use of VOSviewer for network visualization and Bibliometrix for bibliometric statistics, co‑word analysis, thematic mapping and Sankey thematic evolution.

high null result Generative AI and the algorithmic workplace: a bibliometric ... types of bibliometric analyses applied (performance trends, co‑word structures, ...

The study analysed a corpus of 212 Scopus‑indexed publications covering 2018–2025 to map emergent literature on Generative AI and organisational change.

Bibliometric dataset constructed from Scopus; sample size = 212 peer‑reviewed articles; time window 2018–2025; analyses performed with Bibliometrix and VOSviewer.

high null result Generative AI and the algorithmic workplace: a bibliometric ... size and timeframe of bibliometric corpus (number of publications, 2018–2025)

Research agenda: causal studies (panel data, quasi-experiments) are needed to estimate effects of AI exposure on employment outcomes and to evaluate retraining/income-support interventions for pre-retirement populations.

Authors’ stated recommendation based on limits of cross-sectional regression results from the n=889 survey and the identified need to move from association to causation.

high null result Analysis of the Impact of Artificial Intelligence on Middle-... N/A

Study limitations: cross-sectional design, self-reported intentions, potential unobserved confounders, and limited generalizability to only three cities (Beijing, Guangzhou, Lanzhou).

Explicit methodological statements in the paper describing data and design: cross-sectional survey of 889 respondents from three cities and reliance on self-reported employment intentions.

high null result Analysis of the Impact of Artificial Intelligence on Middle-... N/A

The paper identifies future research directions, including empirical causal studies on how DPP+AI interventions change recycling rates, second‑hand market prices, and firm investment in circular processes; and modeling firm strategy around proprietary vs shared DPP data.

Stated research agenda and gaps in the paper informed by the study's findings and limitations; these are recommendations rather than empirical claims.

high null result Integrating knowledge management and digital product passpor... proposed empirical and modeling research outcomes (not measured in current study...

The study used a mixed-methods design focused on the Italian fashion and cosmetics industries, employing two online surveys, k‑means clustering (consumer segmentation), principal component analysis (to identify underlying dimensions of DPP functionalities and sustainability practices), and logistic regression (to identify adoption drivers).

Methods section summary provided in the paper; explicit statement of methods and industry context. Note: sample sizes and survey instrument details are not provided in the summary.

high null result Integrating knowledge management and digital product passpor... methodological descriptors (survey-based measurements, clustering, PCA, regressi...

Two consumer segments were identified: 'aware' consumers (environmentally attuned and receptive to digital innovation and sustainability information) and 'unaware' consumers (prioritize immediate, tangible benefits like price and convenience over sustainability information).

K‑means cluster analysis applied to consumer responses from one of the online surveys in the Italian fashion and cosmetics context; summary identifies two clusters; sample sizes not reported.

high null result Integrating knowledge management and digital product passpor... consumer segmentation / cluster membership (attitudes and preferences toward sus...

This work is a conceptual/policy analysis rather than an original empirical study.

Explicit statement in the paper's Data & Methods section.

high null result A golden opportunity: Corporate sustainability reporting as ... study design/type (conceptual/policy analysis)

Study limitations include single-country (China) listed‑firm sample and reliance on secondary/administrative proxies for digitalization and innovation, which may miss internal qualitative aspects and introduce measurement error.

Authors’ stated limitations: sample restricted to Chinese A-share listed firms (2012–2022) and measures of digitalization/innovation derived from administrative/secondary data rather than direct observation/survey of internal practices.

high null result Supply Chain Digitalization and its Impact on Green Innovati... external validity and measurement quality of SCD and innovation proxies

No new primary empirical tests were performed in this paper; conclusions are based on secondary analysis and are broad and diagnostic rather than demonstrating causal mechanisms.

Explicit methodological statement in the Data & Methods and Limitations sections of the paper describing it as a qualitative literature review and synthesis.

high null result SUSTAINABILITY ISSUES IN FINANCIAL ACCOUNTING RESEARCH presence/absence of new primary empirical evidence in this paper

Research should prioritize causal identification (IV, difference‑in‑differences, regression discontinuity) to disentangle whether ESG causes better financial outcomes or instead proxies for unobserved firm quality.

Methodological recommendation based on limitations in the reviewed literature (many observational/correlational studies); the paper argues for stronger causal designs going forward.

high null result SUSTAINABILITY ISSUES IN FINANCIAL ACCOUNTING RESEARCH causal effect of ESG on financial outcomes (causal identification quality)

The authors propose research priorities for economists: quantify productivity gains from closing the actionability gap; estimate firm-level heterogeneity in evaluation capability and its effect on adoption; and model investment trade-offs between building evaluation-to-action pipelines versus accepting reduced LLM performance.

Paper's concluding recommendations for future research directions (explicitly listed by the authors).

high null result Results-Actionability Gap: Understanding How Practitioners E... recommended research agenda topics

The paper produces as primary outcomes a taxonomy of ten evaluation practices, the articulation of the results-actionability gap, and recommended strategies observed among successful teams.

Authors report these as the main outcomes of their thematic analysis and syntheses from the 19 interviews.

high null result Results-Actionability Gap: Understanding How Practitioners E... reported study outputs (taxonomy, articulated gap, recommended strategies)

The study method consisted of semi-structured qualitative interviews with 19 practitioners across multiple industries and roles, analyzed via thematic coding.

Explicit methods section of the paper stating sample size (n=19), participant diversity, interview approach, and coding/analysis procedure.

high null result Results-Actionability Gap: Understanding How Practitioners E... study design and sample size

AI-economics research should treat quantum capability as a distinct, gradually diffusing factor of production with sectoral specificity and model complementarities and policy counterfactuals endogenously.

Modeling recommendations grounded in sensitivity of macro outcomes to diffusion patterns, complementarities, and policy choices observed in the scenario and counterfactual analyses.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... quality of AI-economic forecasts and policy evaluation (model realism)

Model parameters are calibrated using historical diffusion of enabling technologies (cloud computing, GPUs, AI toolchains), industry case studies, and expert elicitation where hard data are lacking.

Empirical grounding section describing calibration sources: historical diffusion, case studies (materials discovery, optimization), and expert elicitation.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... calibrated model parameters (diffusion rates, adoption elasticities, complementa...

Uncertainty quantification is performed by running Monte Carlo or scenario ensembles and conducting sensitivity and robustness checks.

Methodological claim in the uncertainty quantification section describing Monte Carlo/scenario ensemble approach.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... sensitivity of results to parameter uncertainty; distribution of model outcomes

Sectoral TFP shocks are integrated into computational general equilibrium (CGE) or multi-sector growth models (and optionally DSGE variants) to simulate GDP, sector output, trade impacts, and labor reallocation.

Method section stating integration of sectoral TFP shocks into CGE/multi-sector growth models with optional DSGE short-run dynamics.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... GDP, sectoral output, trade flows, labor reallocation

Sectoral adoption is translated into total factor productivity (TFP) shocks or sector-specific Hicks-neutral productivity improvements based on micro evidence of quantum advantages.

Methodological description of productivity mapping linking adoption to TFP shocks using micro evidence and case studies.

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... sectoral TFP shocks

The paper uses empirical diffusion functions (logistic/S-curve, Bass model) calibrated to analogous technologies to project uptake over time.

Methodological description: diffusion modeling section explicitly states use of logistic/S-curve and Bass models and calibration to past technologies (cloud, GPUs).

high null result Modeling Macroeconomic Output Gains from Quantum-Driven Prod... projected adoption curves over time

The analysis used sentence‑transformer models to produce dense vector representations of article text and UMAP to project those embeddings into a low‑dimensional thematic map for cluster identification and gap detection.

Methods section specifying use of sentence‑transformer embeddings and UMAP for dimensionality reduction/visualization of article text.

high null result Natural language processing in bank marketing: a systematic ... analytic techniques applied to article abstracts/text (embedding + dimensionalit...

The study followed a PRISMA protocol for literature selection and included peer‑reviewed journal articles published between 2014 and 2024, with a final sample size of n = 109.

Explicit methodological statement in the paper describing the literature search, inclusion/exclusion criteria, and final sample.

high null result Natural language processing in bank marketing: a systematic ... methodological protocol adherence and sample size

Twenty‑seven papers study marketing in banking without using NLP methods.

PRISMA systematic review; categorization of the 109 selected articles into the three coverage groups (8, 74, 27).

high null result Natural language processing in bank marketing: a systematic ... count of peer‑reviewed articles on marketing in banking that do not use NLP

« Prev 1 2 3 … 11 12 13 … 80 81 Next »