Evidence (11633 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	609	159	77	736	1615
Governance & Regulation	664	329	160	99	1273
Organizational Efficiency	624	143	105	70	949
Technology Adoption Rate	502	176	98	78	861
Research Productivity	348	109	48	322	836
Output Quality	391	120	44	40	595
Firm Productivity	385	46	85	17	539
Decision Quality	275	143	62	34	521
AI Safety & Ethics	183	241	59	30	517
Market Structure	152	154	109	20	440
Task Allocation	158	50	56	26	295
Innovation Output	178	23	38	17	257
Skill Acquisition	137	52	50	13	252
Fiscal & Macroeconomic	120	64	38	23	252
Employment Level	93	46	96	12	249
Firm Revenue	130	43	26	3	202
Consumer Welfare	99	51	40	11	201
Inequality Measures	36	105	40	6	187
Task Completion Time	134	18	6	5	163
Worker Satisfaction	79	54	16	11	160
Error Rate	64	78	8	1	151
Regulatory Compliance	69	64	14	3	150
Training Effectiveness	81	15	13	18	129
Wages & Compensation	70	25	22	6	123
Team Performance	74	16	21	9	121
Automation Exposure	41	48	19	9	120
Job Displacement	11	71	16	1	99
Developer Productivity	71	14	9	3	98
Hiring & Recruitment	49	7	8	3	67
Social Protection	26	14	8	2	50
Creative Output	26	14	6	2	49
Skill Obsolescence	5	37	5	1	48
Labor Share of Income	12	13	12	—	37
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

We develop a heterogeneous-agent framework in which AI-driven labour displacement affects the equity risk premium (ERP) through three co-equal channels.

Stated model contribution in the paper: a theoretical heterogeneous-agent framework that posits three channels linking AI-driven labour displacement to the ERP (productivity, participation compression, alignment risk).

high mixed When Does AI Raise the Equity Risk Premium? Displacement, Pa... equity risk premium (ERP)

Country-specific (fixed) effects show substantial heterogeneity: some countries (e.g., Denmark, Estonia, Korea) exhibit strongly positive deviations, while others (e.g., India, South Africa) show persistently negative deviations from average trajectories.

Reported country-specific fixed effects/deviations in abstract with illustrative examples of countries with positive and negative deviations. No numeric country-level effect sizes provided in abstract.

high mixed E-government development: Artificial intelligence vibrancy a... E-Government Development Index (EGDI)

At equilibrium prices in symmetric markets, consumer surplus is improved by cheaper search but may be decreased by more informative search, due to weakened inter-business competition.

Equilibrium price analysis within the theoretical model for symmetric firms; comparative statics showing how search cost and signal informativeness affect pricing, competition intensity, and consumer surplus. No empirical validation reported.

high mixed Agentic Markets: Equilibrium Effects of Improving Consumer S... consumer surplus (under equilibrium pricing)

The market (in the model) tracks indications of fit for searched products and indications of quality for chosen products, thereby guiding subsequent searches.

Model structure and assumptions specified in the paper: an endogenous information-tracking mechanism that records signals from searches and purchases and which then influences future search behavior; presented as part of the theoretical framework rather than empirical evidence.

high mixed Agentic Markets: Equilibrium Effects of Improving Consumer S... information available to guide search (market-tracked signals)

The top four models are statistically indistinguishable (mean score 0.147–0.153) while a clear tier gap separates them from the remaining four models (mean score <= 0.113).

Reported mean performance scores across 8 models and statement of statistical indistinguishability for the top four vs lower-tier four; numerical means provided.

high mixed SWE-PRBench: Benchmarking AI Code Review Quality Against Pul... mean model performance score

Behavioral factors — specifically trust calibration, cognitive load, and affective reactions — shape the transition of corporate AI initiatives from pilot deployments to scalable, sustained use.

Synthesis of human-AI interaction literature integrated with adoption frameworks (TAM and TOE); conceptual linkage rather than new empirical testing in this paper.

high mixed Behavioral Factors as Determinants of Successful Scaling of ... success of pilot-to-production transition (scalability and sustained use)

The proportion of consumers who adopt AI-induced services influences the pricing of those services and through price adjustments will further impact wages across traditional and non-traditional services.

Theoretical development and analysis in the paper via a demand-switching model and a Finite Change General Equilibrium framework introducing AI as a technological shock modeled through price adjustments.

high mixed Artificial Intelligence, Demand Switching and Sectoral Wage ... wages (across traditional and non-traditional services) and service prices

AI accelerates value-chain maturation while creating distinct risks — including professional responsibility tensions and potential system-level externalities.

Conceptual argument and risk analysis in the Article (theoretical reasoning and synthesis of management/ethics literature). No empirical causal estimate reported in the excerpt.

high mixed Rewired: Reconceptualizing Legal Services for the AI Age acceleration of value-chain maturation and emergence of professional responsibil...

The legal profession is at a crossroads, caught between intensifying fears of AI-driven displacement and a generational opportunity for transformation.

Author's synthesis and framing in the Article (conceptual assessment; literature/contextual synthesis). No empirical sample or experiment reported in the excerpt.

high mixed Rewired: Reconceptualizing Legal Services for the AI Age risk of AI-driven displacement and opportunity for transformation in the legal p...

This advantage is contingent upon robust AI governance, ethical frameworks, and the transition from 'pilot-lite' projects to integrated, data-driven 'AI-first' business models.

Conditional claim in the paper linking success to governance, ethics, and organizational integration; appears to be normative/analytical rather than empirical in the abstract.

high mixed The AI Advantage: Strategic Innovation and Global Expansion ... dependency of AI-driven advantage on governance, ethics, and organizational inte...

The paper reframes AI governance as a form of social policy shaped by political and economic institutions.

Conceptual/interpretive claim supported by the authors' comparative analysis and theoretical framing of AI governance alongside social policy dimensions.

high mixed Artificial intelligence governance and social policy diverge... conceptual framing of AI governance as social policy influenced by political-eco...

Although many regions use similar ethical language, substantial differences persist in risk allocation, regulatory enforcement, welfare integration and social protection.

Content analysis of policy documents showing overlap in ethical rhetoric but divergence across coded institutional dimensions related to risk allocation, enforcement, welfare integration and social protection (n=24).

high mixed Artificial intelligence governance and social policy diverge... similarity of ethical language vs. divergence in (a) risk allocation, (b) regula...

Five distinct governance models emerge: rights-based (EU), market-driven (US), state-centric (China), hybrid (Australia–Japan–Singapore) and developmental (India).

Typology derived from coding and index comparison of the 24 policy documents; authors classify regions/countries into five labeled governance models.

high mixed Artificial intelligence governance and social policy diverge... categorical classification of regional AI governance model

The findings show clear and systematic differences in how regions govern AI.

Comparative analysis of coded policy documents (n=24) producing indices that the authors interpret as showing systematic cross-regional differences in governance approaches.

high mixed Artificial intelligence governance and social policy diverge... degree and nature of differences in regional AI governance approaches

The documents are systematically coded across four institutional dimensions and converted into simple indices to compare governance approaches across the regions.

Author-reported method: systematic coding of documents on four institutional dimensions and construction of indices for cross-regional comparison (based on the 24 documents).

high mixed Artificial intelligence governance and social policy diverge... coding across four institutional dimensions and index construction

This study uses a comparative qualitative policy analysis based on 24 key AI policy documents published between 2018 and 2025 across the European Union, United States, China, and Indo-Pacific economies.

Author-stated research design and sample: systematic review/comparative qualitative policy analysis of 24 AI policy documents spanning 2018–2025 covering EU, US, China and Indo-Pacific economies.

high mixed Artificial intelligence governance and social policy diverge... research design and document sample

Actual sharing often contradicted willingness to share (the privacy paradox), with consistently high sharing rates across all conditions.

Observed discrepancy reported in the experimental results (N=240): despite variation in willingness-to-share, behavioral sharing rates remained high and similar across human, white-box AI, and black-box AI conditions.

high mixed Understanding Data-Sharing with AI Systems: The Roles of Tra... discrepancy between stated willingness to share vs actual sharing behavior

Energy policy uncertainty has a nonlinear effect on AI investment: moderate uncertainty fosters innovation, whereas high volatility hinders long-term investment.

Empirical analysis using nonlinear methods (WQR and WQC) on US quarterly data 2013Q1–2024Q4 (48 quarters), assessing distributional asymmetries across quantiles and time–frequency bands.

high mixed Do energy policy uncertainty, trade openness, and renewable ... AI investment

Machine-readable metrics and open scholarly infrastructure are reshaping scholarly profiles and incentives.

Conceptual and historical discussion referring to platforms and metrics (e.g., arXiv, Google Scholar, ORCID) as mechanisms changing incentives; no new empirical estimates provided.

high mixed A Brief History of AI for Scientific Discovery: Open Researc... changes in scholarly incentives and profile construction due to machine-readable...

That interconnected ecosystem is fundamentally restructuring who can do science (access), how fast discoveries propagate, and what counts as a valid scientific contribution.

Argumentative claim linking infrastructural and tool changes to changes in access, dissemination speed, and norms of contribution. The paper presents examples and narrative but no systematic empirical evaluation or sample.

high mixed A Brief History of AI for Scientific Discovery: Open Researc... access to scientific practice, speed of discovery dissemination, and norms of sc...

The most consequential development is not any single tool but the emergence of an interconnected ecosystem—AI agents, preprint platforms, open source codebases, and citation infrastructure—that forms a feedback loop.

Synthesis/argument based on multiple examples (LLM agents, preprint servers like arXiv, open-source code repositories, citation indices). No quantitative measurement or causal identification reported.

high mixed A Brief History of AI for Scientific Discovery: Open Researc... emergence of an interconnected scientific infrastructure ecosystem

The central tension in AI for science is between automation (building systems that replace human researchers) and augmentation (tools that amplify human creativity and judgement).

Analytical claim based on the paper's review of historical examples and conceptual discussion; no primary data or experimental design reported.

high mixed A Brief History of AI for Scientific Discovery: Open Researc... relationship between automation and augmentation in research practice

Science has repeatedly delegated its bottlenecks to machines—first inference, then search, then measurement, then the full workflow—and each delegation solves one problem while exposing a harder one underneath.

Interpretive historical argument drawing on examples across AI-for-science milestones (e.g., DENDRAL, search and inference systems, measurement automation, and contemporary end-to-end workflows). No quantitative sample or experimental method reported.

high mixed A Brief History of AI for Scientific Discovery: Open Researc... pattern of delegation and emergent bottlenecks in research workflows

Testing revealed AI excels at computational tasks but consistently misses nuanced factors like new construction rent premiums and infrastructure proximity impacts, validating the framework's hybrid structure as essential for professional-grade underwriting.

Findings from the controlled ChatGPT-4 test on the single 150-unit scenario: qualitative and comparative observations showing AI handled computations well but failed to capture specific local-market nuances, leading authors to endorse a hybrid human-AI framework.

high mixed AI-Augmented Real Estate Underwriting: A Practical Framework... output_quality

Phase Two requires human-led professional validation to correct AI limitations, apply local market knowledge, and integrate risk factors.

Framework description supported by observations from the controlled test where human review was used to correct AI outputs and apply local knowledge (e.g., adjusting for nuanced market factors).

high mixed AI-Augmented Real Estate Underwriting: A Practical Framework... task_allocation

The growth effects of AI are conditional on institutional quality and organizational adaptability.

Theoretical/analytical claim in the paper's framework and supported by the stylized-facts analysis indicating heterogeneity in productivity and growth outcomes by institutional and digital capacity indicators.

high mixed Artificial intelligence, institutional innovation and econom... growth effects of AI (heterogeneity/conditionality by institutions and adaptabil...

AI agents implicate many areas of law, ranging from agency law and contracts to tort liability and labor law.

Legal/policy analysis in the paper enumerating legal domains implicated by AI agents (qualitative analysis; no sample size).

high mixed Regulating AI Agents scope of legal domains implicated by AI agents

Firms of different ownership structures and industries exhibit different responses to the income distribution changes brought by AI (heterogeneous effects).

Paper reports performing grouped regressions by ownership type and industry to identify heterogeneous responses.

high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... heterogeneous change in income distribution (e.g., labor share or profit-labor r...

Financing constraints are a key factor that hinder firms' choice of technology level, which alters the corresponding income distribution effect of AI.

Paper posits financing constraint as a moderator and states it is considered in empirical analysis (interaction/moderation tests).

high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... change in income distribution effects (e.g., labor share) conditional on financi...

The development of AI may trigger new changes in the interest pattern between corporate profits and labor compensation.

Framed as the central research question/hypothesis; paper conducts empirical tests on firm panel data to evaluate this.

high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... relationship between corporate profits and labor compensation (interest pattern)

Artificial intelligence is profoundly reshaping the organizational form, operating model and operating mechanism of enterprises, and bringing unprecedented impact to the income distribution structure within enterprises.

Statement asserted in the paper's introduction/abstract; motivates empirical analysis using panel data of Shanghai and Shenzhen A-share non-financial listed firms (2010–2022).

high mixed THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... income distribution structure within enterprises (general claim)

These findings contribute to the literature by providing empirical insights from a developing economy, where unique socioeconomic and institutional factors shape the impact of AI.

Scope/claim of contribution based on the study's context (Cambodia) and its dataset (n = 351).

high mixed Navigating AI‐Induced Job Displacement and Skill Demands: In... other

This study employed PLS‐SEM analysis on data from 351 respondents, revealing significant workforce reshaping.

PLS-SEM analysis conducted on survey data (n = 351) as reported in the paper.

high mixed Navigating AI‐Induced Job Displacement and Skill Demands: In... job_displacement

The rapid adoption of artificial intelligence (AI) is fundamentally transforming labor markets worldwide, presenting both opportunities and challenges.

Statement made in the paper as background/justification; not based on the study's empirical data.

high mixed Navigating AI‐Induced Job Displacement and Skill Demands: In... other

Traffic performance is sensitive to the distribution of safe time gaps and the proportion of RL vehicles.

Simulation results comparing Fundamental Diagrams across scenarios with different distributions of safe time gaps and shares of RL-controlled vehicles. Number of simulation runs or replicates not stated in the claim text.

high mixed Macroscopic Characteristics of Mixed Traffic Flow with Deep ... traffic performance (e.g., flow, capacity) sensitivity to time-gap distribution ...

Chat intent varies systematically with both the timing of chat relative to search and the category of products later purchased within the same journey.

Cross-tabulation/regression-style descriptive analysis relating classified chat intents to timing (relative to search) and subsequent purchased product categories in journey-level logs.

high mixed Shopping with a Platform AI Assistant: Who Adopts, When in t... task_allocation

AI assistance in safety engineering is fundamentally a collaboration design problem rather than merely a software procurement decision: the same tool can either degrade or improve analysis quality depending entirely on how it is used.

Synthesis of the formal framework and analytic results in the paper (theoretical argument; no empirical sample reported).

high mixed The Competence Shadow: Theory and Bounds of AI Assistance in... output_quality

The paper concludes by discussing open challenges in evaluating harmful manipulation by AI models.

Paper includes a discussion/conclusion section enumerating open challenges; stated in abstract.

high mixed Evaluating Language Models for Harmful Manipulation identification of open research and evaluation challenges

We identify significant differences across our tested geographies, suggesting that AI manipulation results from one geographic region may not generalise to others.

Empirical comparison across three locales (US, UK, India) showing statistically significant differences in manipulation outcomes by geography.

high mixed Evaluating Language Models for Harmful Manipulation geographic variation in manipulative behaviour/effects

Context matters: AI manipulation differs between domains, suggesting that it needs to be evaluated in the high-stakes context(s) in which an AI system is likely to be used.

Comparative analysis across three domains (public policy, finance, health) showing differences in manipulative behaviour and/or impact by domain in the empirical study.

high mixed Evaluating Language Models for Harmful Manipulation variation in manipulative behaviour/effects across use domains

AUROC_2 and M-ratio produce fully inverted model rankings, demonstrating these metrics answer fundamentally different evaluation questions.

Metric comparison across models showing that AUROC_2-based ranking and M-ratio-based ranking are fully inverted in the reported results on the evaluated dataset.

high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... model ranking by AUROC_2 versus model ranking by M-ratio

Temperature manipulation shifts Type-2 criterion while meta-d' remains stable for two of four models, dissociating confidence policy from metacognitive capacity.

Experimental manipulation (temperature changes) applied to models; reported result that Type-2 criterion shifted with temperature while meta-d' was stable for two models (out of four) in the 224,000-trial dataset.

high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... Type-2 criterion (confidence policy) and meta-d' (metacognitive capacity)

Metacognitive efficiency is domain-specific, with different models showing different weakest domains, invisible to aggregate metrics.

Domain-level analyses reported in the paper showing per-domain M-ratio results and identification of different weakest domains per model, contrasted with aggregate metric behavior.

high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... domain-specific metacognitive efficiency (M-ratio) across task domains

Metacognitive efficiency varies substantially across models even when Type-1 sensitivity is similar — Mistral achieves the highest d' but the lowest M-ratio.

Empirical comparison of Type-1 sensitivity (d') and metacognitive efficiency (M-ratio) across the four evaluated LLMs on the 224,000 QA trials; explicit statement that Mistral had highest d' but lowest M-ratio.

high mixed Do LLMs Know What They Know? Measuring Metacognitive Efficie... Type-1 sensitivity (d') and metacognitive efficiency (M-ratio)

The paper's findings deepen the understanding of algorithmic aversion in the context of generative AI and offer practical guidance for creators and platforms navigating transparency versus engagement trade-offs.

Authors' interpretation and conclusions summarized in the abstract, based on the two experiments (study 1: n = 325; study 2: n = 371).

high mixed AI content labeling and user engagement on social media: The... interpretation of experimental results (algorithmic aversion / guidance implicat...

The paper's primary contribution is to combine established ingredients—attention scarcity, free-entry dilution, superstar effects, and preferential attachment—into a unified framework directed at claims about AI-enabled entrepreneurship.

Stated contribution and methodological description in the paper (synthesis and applied formalisation); this is a descriptive/methodological claim rather than an empirical result.

high mixed The Economics of Builder Saturation in Digital Markets n/a (methodological contribution)

Modern pretrained time-series foundation models can forecast without task-specific training, but they do not fully incorporate economic behavior.

Statement in paper's introduction/abstract summarizing prior capabilities and limitations of pretrained time-series foundation models (no experimental sample or numeric evidence provided in the excerpt).

high mixed GARP-EFM: Improving Foundation Models with Revealed Preferen... ability of pretrained time-series models to forecast and degree to which they in...

The governance risk-mitigation effects of AI operate through increasing financial risk exposure.

Authors' mechanism tests indicate a relationship between AI adoption and changes in financial risk exposure measures, which they interpret as a channel affecting executive behavior.

high mixed The risk-mitigation effects of artificial intelligence adopt... financial risk exposure (financial risk/proxy metrics)

Organizational culture and technological readiness moderate the effectiveness of generative AI integration in decision-making processes.

The paper reports moderation effects tested in the SEM framework using survey data from senior managers, decision-makers, and AI adoption specialists (SmartPLS). No numeric moderator effect sizes or sample size provided in the excerpt.

high mixed The Strategic Impact of Generative Artificial Intelligence o... effectiveness of generative AI integration in decision-making (moderation effect...

The paper draws comparisons between inference tokens and established commodities such as electricity, carbon emission allowances, and bandwidth to motivate financialization.

Theoretical comparison and historical analysis (drawing on the historical experience of electricity futures markets and commodity financialization theory) as presented in the paper.

high mixed AI Token Futures Market: Commoditization of Compute and Deri... similarity / comparability to established commodity markets

« Prev 1 2 3 … 8 9 10 … 232 233 Next »