Evidence (5126 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	369	105	58	432	972
Governance & Regulation	365	171	113	54	713
Research Productivity	229	95	33	294	655
Organizational Efficiency	354	82	58	34	531
Technology Adoption Rate	277	115	63	27	486
Firm Productivity	273	33	68	10	389
AI Safety & Ethics	112	177	43	24	358
Output Quality	228	61	23	25	337
Market Structure	105	118	81	14	323
Decision Quality	154	68	33	17	275
Employment Level	68	32	74	8	184
Fiscal & Macroeconomic	74	52	32	21	183
Skill Acquisition	85	31	38	9	163
Firm Revenue	96	30	22	—	148
Innovation Output	100	11	20	11	143
Consumer Welfare	66	29	35	7	137
Regulatory Compliance	51	61	13	3	128
Inequality Measures	24	66	31	4	125
Task Allocation	64	6	28	6	104
Error Rate	42	47	6	—	95
Training Effectiveness	55	12	10	16	93
Worker Satisfaction	42	32	11	6	91
Task Completion Time	71	5	3	1	80
Wages & Compensation	38	13	19	4	74
Team Performance	41	8	15	7	72
Hiring & Recruitment	39	4	6	3	52
Automation Exposure	17	15	9	5	46
Job Displacement	5	28	12	—	45
Social Protection	18	8	6	1	33
Developer Productivity	25	1	2	1	29
Worker Turnover	10	12	—	3	25
Creative Output	15	5	3	1	24
Skill Obsolescence	3	18	2	—	23
Labor Share of Income	7	4	9	—	20

Adoption Remove filter

Richer personalization depends on granular data and cross-device identity, creating privacy externalities and compliance risks (Personalization vs privacy trade-off).

Data source inventory and privacy literature review; supported by observational industry trends (move to first-party identity) rather than a quantified sample in the paper.

high negative Artificial Intelligence for Personalized Digital Advertising... degree of personalization versus exposure to privacy risks/compliance failures

The cost of formalizing informal labor (CFIL) implies formalizing a worker costs on average 88% more than the informal wage in 2023.

New CFIL metric calculated for 19 countries (2023 baseline) by estimating the additional employer cost of hiring and formalizing an informal worker and reporting it relative to the informal wage, using compiled statutory obligations and informal wage benchmarks.

high negative Salaried Labor Costs in Latin America and the Caribbean: A T... CFIL (additional cost of formalizing) as % above informal wage

There is sizable attrition in the pipeline from applicant admission through to direct employment of AI graduates, indicating leakages at multiple stages (application → admission → graduation → employment).

Quantification of human-resource losses across pipeline stages using the monitoring dataset for the 191 institutions; descriptive counts/percentages of entrants, admitted students, graduates, and those directly employed in AI roles (pipeline loss metrics reported in paper).

high negative Employment og Graduates of Educational Programs in the Field... Attrition rates / absolute losses at sequential pipeline stages (applicants → ad...

Graduates from Russian universities running AI-related educational programs together with alternative training routes (self-education and professional retraining) satisfy 43.9% of estimated national AI personnel demand.

Monitoring dataset of 191 Russian universities implementing AI-related programs; aggregated counts of university graduates plus estimated contributions from self-education and professional retraining compared to an estimated national AI personnel demand (coverage reported as 43.9%).

high negative Employment og Graduates of Educational Programs in the Field... Share (%) of estimated national AI personnel demand satisfied by combined univer...

AI automates routine and some mid-skill tasks, reducing employment in those occupations.

Empirical task-based exposure measures mapping AI capabilities to occupational task content, microdata analyses of employment by occupation using household/employer/administrative datasets, and panel regressions/decompositions that document within-occupation declines and between-occupation shifts.

high negative Intelligence and Labor Market Transformation: A Critical Ana... employment levels in routine and mid-skill occupations

Relying on secondary literature limits the paper's ability to make causal inferences and constrains empirical generalizability to all sectors or countries.

Stated limitations in the paper's Data & Methods section acknowledging scope and inferential constraints.

high negative Who Loses to Automation? AI-Driven Labour Displacement and t... causal inference strength and generalizability of conclusions

Increases in K_T reduce employment levels in affected firms and industries even when aggregate productivity rises.

Panel econometric estimates at firm and industry levels relating K_T intensity to employment outcomes, controlling for demand, input prices, and firm characteristics; difference-in-differences specifications and instrumental-variable robustness checks; corroborated by sectoral case studies.

high negative The Macroeconomic Transition of Technological Capital in the... employment (firm- and industry-level employment counts or employment growth)

Rising technological capital (K_T) — proxied by robot/automation density, software and intangible capital accumulation, AI adoption surveys, and AI-related patenting — leads to a decline in labor’s share of output.

Firm- and industry-level panel regressions linking constructed K_T intensity measures to labor shares, supported by macro growth-accounting decompositions; robustness checks include difference-in-differences and instrumenting adoption with plausibly exogenous shocks (e.g., cross-border technology diffusion, trade shocks); validated with cross-country comparisons and case studies.

high negative The Macroeconomic Transition of Technological Capital in the... labor share of income (share of output paid to labor)

Traffic performance is evaluated using the Fundamental Diagram (FD) under varying driver heterogeneity, heterogeneous time-gap penetration levels, and different shares of RL-controlled vehicles.

Description of experimental/evaluation setup in the paper: macroscopic evaluation via Fundamental Diagram across varied scenario parameters. No numeric sample size provided in the claim text.

high neutral Macroscopic Characteristics of Mixed Traffic Flow with Deep ... traffic performance (via Fundamental Diagram) under varied heterogeneity and RL ...

CriQ is a sister app to Dream11, India's largest fantasy sports platform with over 250 million users.

Descriptive statement in the paper providing context about the application domain and user base.

high neutral Schema on the Inside: A Two-Phase Fine-Tuning Method for Hig... user base size

In the near term, the most plausible equilibrium is bounded autonomy, in which AI agents operate as supervised co-pilots, monitoring systems, and constrained execution modules embedded within human decision processes.

Theoretical argument and forward-looking assessment by the authors based on the proposed framework and plausibility considerations; not presented as the result of a causal empirical study in the excerpt.

high neutral AI Agents in Financial Markets: Architecture, Applications, ... expected equilibrium mode of AI agent autonomy in finance (bounded autonomy / su...

Economic evaluations of GLAI should account for end-to-end risk externalities (error propagation, institutional trust, rights impacts), not only short-term productivity gains.

Methodological recommendation grounded in conceptual synthesis of technical, behavioral, and legal risks; normative argument rather than empirical result.

high neutral Why Avoid Generative Legal AI Systems? Hallucination, Overre... comprehensiveness of economic evaluations (inclusion of externalities vs. narrow...

Generative Legal AI (GLAI) systems are built on token-prediction (LLM) architectures rather than formal legal-reasoning architectures.

Conceptual and technical analysis in the paper distinguishing GLAI from other legal-tech; literature synthesis on common LLM architectures. No original empirical dataset or sample size—qualitative/technical review.

high neutral Why Avoid Generative Legal AI Systems? Hallucination, Overre... underlying model architecture type (token-prediction vs. formal-reasoning)

The paper's formalism shows that prompt/system messages shape distributions over possible execution paths (indirect control) but do not evaluate actual partial paths at runtime.

Formal mapping in the paper that treats prompts as shaping prior over paths; conceptual argument and illustrative examples.

high neutral Runtime Governance for AI Agents: Policies on Paths degree of control over execution path (distributional shaping vs. path-specific ...

Through a thematic review of existing research, the authors identified recurring themes about incentive schemes: their components, how researchers manipulate them, and their impact on research outcomes.

Authors' stated method and findings: thematic review (the scope/number of reviewed papers not specified in excerpt).

high neutral Incentive-Tuning: Understanding and Designing Incentives for... themes in incentive design practices and reported impacts on empirical study out...

A critical aspect of conducting human–AI decision-making studies is the role of participants, often recruited through crowdsourcing platforms.

Claim based on the authors' thematic literature review noting participant sourcing practices (specific studies and counts not given in excerpt).

high neutral Incentive-Tuning: Understanding and Designing Incentives for... participant recruitment source (e.g., crowdsourcing) and its influence on study ...

Researchers conduct empirical studies investigating how humans use AI assistance for decision-making and how this collaboration impacts results.

Statement summarizing the research landscape; supported implicitly by the authors' thematic review of existing empirical studies (number of studies not specified in excerpt).

high neutral Incentive-Tuning: Understanding and Designing Incentives for... human behaviour and decision outcomes when assisted by AI (empirical study outco...

The study provides empirical evidence specific to a small open EU economy (Slovakia) on the relationship between AI adoption and labour productivity.

Use of harmonised Eurostat enterprise and productivity data for Slovakia and EU27 over 2021–2024, analysed with descriptive statistics, gap analysis, dynamics of change, correlation, and an illustrative regression model.

high neutral Artificial Intelligence Adoption and Labour Productivity in ... Empirical characterization of AI adoption and labour productivity relationship f...

Returns to AI are heterogeneous across firms; estimating treatment effects requires attention to selection, complementarities, and dynamic adoption pipelines.

Methodological argument referencing treatment-effect literature and observed firm heterogeneity; supported by conceptual examples rather than a single empirical treatment-effect estimate.

high neutral Modern Management in the Age of Artificial Intelligence: Str... heterogeneity in returns to AI adoption (firm-level productivity or performance ...

Methods combine targeted literature synthesis, comparative conceptual analysis, and framework building (with recent scholarly and institutional sources reviewed).

Explicit methodological statement in the paper describing the review and analytic approach; no primary-data methods used.

high null result Behavioral Factors as Determinants of Successful Scaling of ... methodological approach (literature synthesis and conceptual framework developme...

AI coding assistants are a high-visibility class of corporate AI and are given special attention as an illustrative case in the paper.

Paper specifically calls out AI coding assistants as a focal example in the conceptual analysis and discussion; based on literature review rather than original measurement.

high null result Behavioral Factors as Determinants of Successful Scaling of ... role of coding assistants as illustrative case for scaling and behavioral dynami...

AI’s societal integration in India is gradual, and therefore its impact on economic variables (like wages and inequality) is also gradual.

Synthesis in the paper based on empirical adoption figures (e.g., <0.7% adoption for AI ride services) and the observed weak changes in inequality measures in the transportation sector.

high null result Artificial Intelligence, Demand Switching and Sectoral Wage ... pace of AI integration and consequent economic impact

Despite AI’s introduction, wage inequality in the transportation sector (measured by the Gini coefficient) has not significantly worsened.

Empirical investigation reported in the paper analyzing transportation-sector wage disparities over time using the Gini coefficient; the paper reports no significant worsening post-introduction.

high null result Artificial Intelligence, Demand Switching and Sectoral Wage ... Gini coefficient of wages in the transportation sector

The Article translates these insights into risk-sensitive guideposts for modernizing governance of AI-enabled tools and emerging modalities, from agentic systems to blockchain-deployed smart contracts.

Prescriptive/conceptual policy guidance presented in the Article (normative recommendations; governance framework).

high null result Rewired: Reconceptualizing Legal Services for the AI Age provision of governance guideposts for AI-enabled legal technologies

The Innovation Frontier traces LegalTech’s evolution from 2000s-vintage e-discovery to generative AI.

Historical/chronological analysis in the Article (literature review/history of LegalTech provided by authors).

high null result Rewired: Reconceptualizing Legal Services for the AI Age narrative/historical scope of LegalTech evolution covered in the Article

The Legal Services Value Chain disaggregates the lifecycle of a legal matter into five distinct nodes of activity.

Model description in the Article (conceptual architecture; decomposition of legal work).

high null result Rewired: Reconceptualizing Legal Services for the AI Age number and structure of nodes in the proposed value-chain model

The Article develops two core organizing models: the Legal Services Value Chain and the Innovation Frontier.

Explicit claim in the Article describing conceptual/model contributions (theoretical/model-building).

high null result Rewired: Reconceptualizing Legal Services for the AI Age presence of two organizing conceptual models in the Article

This Article provides a practical framework for navigating the shifting terrain of legal innovation and AI.

Statement of purpose in the Article (conceptual contribution; framework development). No empirical validation reported in the excerpt.

high null result Rewired: Reconceptualizing Legal Services for the AI Age existence of a practical framework for legal-AI governance and strategy

There are action tools for higher-stakes tasks like financial transactions.

Observed examples of action tools in the monitored MCP repositories that perform higher-stakes functions, with financial transactions given as an explicit example in the paper.

high null result How are AI agents used? Evidence from 177,000 MCP tools presence of action tools enabling high-stakes tasks (e.g., financial transaction...

We use O*NET mapping to identify each tool's task domain and consequentiality.

Method described in paper: mapping each tool to O*NET task domains and consequentiality using the monitored tool metadata and descriptions.

high null result How are AI agents used? Evidence from 177,000 MCP tools method for assigning task domain and consequentiality

We categorise tools according to their direct impact: perception tools to access and read data, reasoning tools to analyse data or concepts, and action tools to directly modify external environments.

Methodological classification described in paper (taxonomy of tools into perception, reasoning, action); applied to monitored MCP server dataset.

high null result How are AI agents used? Evidence from 177,000 MCP tools tool category / taxonomy

AI transparency alone did not significantly increase data-sharing.

Result reported from the randomized experiment (N=240) comparing actual data-sharing rates across human, white-box AI, and black-box AI conditions; authors state that transparency alone did not produce a significant increase in sharing.

high null result Understanding Data-Sharing with AI Systems: The Roles of Tra... actual data-sharing (behavioral sharing decisions)

These energy reductions are achieved without statistically significant performance loss.

Paper states that performance loss is not statistically significant across the evaluated benchmarks (as reported in the abstract).

high null result EcoThink: A Green Adaptive Inference Framework for Sustainab... model performance / benchmark accuracy (no statistically significant degradation...

The research surveys current methodologies and empirical evidence related to regulatory early-warning systems and desegregates (synthesizes) findings from empirical information.

Paper states it examines existing methodologies and empirical findings (literature review / synthesis); no scope (e.g., number of studies reviewed) given in the excerpt.

high null result Research on the Construction of an AI-Driven Financial Regul... state of evidence on methodologies for regulatory early-warning of fiscal risk

The study uses a mixed-methods approach combining qualitative insights from 1,500 semi-structured customer interviews with quantitative analysis of transaction records, loan repayment histories, and account activity.

Paper states methods explicitly in abstract: 1,500 semi-structured interviews plus quantitative analysis of transaction records, loan repayment histories, and account activity (case-study approach across three platforms).

high null result Artificial Intelligence, Climate Resilience, and Financial I... research_methodology

Three interlocking threads characterize AI for science: (1) AI as research instrument, (2) AI for research infrastructure, and (3) the reshaping of scholarly profiles and incentives by machine-readable metrics.

Conceptual framework presented in the paper; organization of topics rather than empirical measurement. The paper indicates these threads are followed through historical and contemporary examples.

high null result A Brief History of AI for Scientific Discovery: Open Researc... conceptual decomposition of AI-for-science developments

The history of artificial intelligence for scientific discovery is not a two year story about chatbots learning to write papers; it is a sixty year story beginning with DENDRAL (1965).

Historical narrative / literature review citing early systems such as DENDRAL (1965) and subsequent developments in scholarly infrastructure (arXiv, Google Scholar, ORCID). No empirical sample or statistical test reported.

high null result A Brief History of AI for Scientific Discovery: Open Researc... historical scope and timeline of AI for scientific discovery

At the macroeconomic level, Kazakhstan's state programs (e.g., 'Digital Kazakhstan' and the Industrial and Innovation Development Program) and international indices (WIPO Global Innovation Index, OECD digital assessments, IMF data) are used to evaluate and position Kazakhstan within the global digital economy.

Macro-level analysis using national programs and international indices described in the article to assess Kazakhstan's digital economy standing.

high null result Digitalization and labor costs: efficiency of industrial ent... Kazakhstan's position in global digital economy (evaluative metric)

This paper uses panel data of China's Shanghai and Shenzhen A-share non-financial listed companies from 2010 to 2022 to study AI's effects.

Explicit data description in the paper (sample frame and period stated).

high null result THE IMPACT OF ARTIFICIAL INTELLIGENCE ON ENTERPRISE INCOME D... n/a (methodological/data claim)

Deep Reinforcement Learning (DRL) has shown strong microscopic performance in car-following conditions, but its macroscopic traffic flow characteristics remain underexplored.

Literature synthesis / motivation in the paper (review of existing DRL work focused on microscopic performance). No empirical sample size.

high null result Macroscopic Characteristics of Mixed Traffic Flow with Deep ... extent of prior research on macroscopic traffic flow characteristics for DRL mod...

The paper is intentionally public-safe: it omits proprietary implementation details, training recipes, thresholds, hidden-state instrumentation, deployment procedures, and confidential system design choices, and therefore the contribution is theoretical rather than operational.

Statement about the paper's scope and publication choices; directly asserted by the authors regarding omitted content and the theoretical nature of the contribution.

high null result A Public Theory of Distillation Resistance via Constraint-Co... scope_and_nature_of_contribution (theoretical vs operational)

The paper introduces a constraint-coupled reasoning framework with four elements: bounded transition burden, path-load accumulation, dynamically evolving feasible regions, and a capability-stability coupling condition.

Descriptive/theoretical: the paper explicitly defines and enumerates these four framework elements. This is a claim about the paper's content rather than an empirical finding.

high null result A Public Theory of Distillation Resistance via Constraint-Co... presence_and_definition_of_framework_components

The analysis uses data on 31 million users of Ctrip, China's largest online travel platform, to study "Wendao," an LLM-based AI assistant integrated into the platform.

Descriptive statement in the paper about data source: platform logs/usage data for Ctrip covering 31 million users and the Wendao assistant.

high null result Shopping with a Platform AI Assistant: Who Adopts, When in t... other

The top three platforms (Claude, ChatGPT, and DeepSeek) receive statistically indistinguishable satisfaction ratings despite vast differences in funding, team size, and benchmark performance.

Statistical comparison of self-reported satisfaction ratings collected via the paper's survey (overall N=388); statistical tests reported in paper (specific test and per-platform n not provided in abstract).

high null result Beyond Benchmarks: How Users Evaluate AI Chat Assistants user satisfaction ratings

We ran a behavioral experiment (N = 200) in which participants predicted the AI's correctness across four AI calibration conditions: standard, overconfidence, underconfidence, and a counterintuitive "reverse confidence" mapping.

Reported experimental design and sample size in the paper (behavioral experiment with N = 200; four experimental conditions).

high null result Learning to Trust: How Humans Mentally Recalibrate AI Confid... experimental conditions / task setup (participants predicting AI correctness)

Study methodology: Two online experiments were conducted via the crowdsourcing platform Prolific with sample sizes study 1: n = 325 and study 2: n = 371; participant mean age = 35 years; 55% female.

Methodological and sample description provided in the abstract.

high null result AI content labeling and user engagement on social media: The... study design and sample characteristics

Late disclosure of AI involvement did not improve affective engagement for AI-generated content.

Reported experimental result in the abstract from the two online studies manipulating disclosure timing (early vs. late).

high null result AI content labeling and user engagement on social media: The... affective engagement for AI-generated content under late disclosure

The study was conducted by the Mohammed bin Rashid School of Government’s Future of Government Center, in collaboration with global AI pioneers.

Authorship and collaboration statement in the report.

high null result Charting AI Governance Future in the Arab Region: A Policy R... institutional authorship and collaboration on the study

The report highlights the key findings of a field study covering ten Arab countries to explore the realities and challenges of AI governance.

Report statement describing the geographic scope of the field study (explicitly: ten Arab countries).

high null result Charting AI Governance Future in the Arab Region: A Policy R... geographic coverage of the field study (number of countries)

The recommendations are based on regional research that included hundreds of leaders active in the AI domains, from the public and private sectors.

Report statement claiming participant base of the underlying research (described as 'hundreds of leaders').

high null result Charting AI Governance Future in the Arab Region: A Policy R... scope and participant coverage of the underlying research

« Prev 1 2 3 … 10 11 12 … 102 103 Next »