Evidence (7395 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	609	159	77	736	1615
Governance & Regulation	664	329	160	99	1273
Organizational Efficiency	624	143	105	70	949
Technology Adoption Rate	502	176	98	78	861
Research Productivity	348	109	48	322	836
Output Quality	391	120	44	40	595
Firm Productivity	385	46	85	17	539
Decision Quality	275	143	62	34	521
AI Safety & Ethics	183	241	59	30	517
Market Structure	152	154	109	20	440
Task Allocation	158	50	56	26	295
Innovation Output	178	23	38	17	257
Skill Acquisition	137	52	50	13	252
Fiscal & Macroeconomic	120	64	38	23	252
Employment Level	93	46	96	12	249
Firm Revenue	130	43	26	3	202
Consumer Welfare	99	51	40	11	201
Inequality Measures	36	105	40	6	187
Task Completion Time	134	18	6	5	163
Worker Satisfaction	79	54	16	11	160
Error Rate	64	78	8	1	151
Regulatory Compliance	69	64	14	3	150
Training Effectiveness	81	15	13	18	129
Wages & Compensation	70	25	22	6	123
Team Performance	74	16	21	9	121
Automation Exposure	41	48	19	9	120
Job Displacement	11	71	16	1	99
Developer Productivity	71	14	9	3	98
Hiring & Recruitment	49	7	8	3	67
Social Protection	26	14	8	2	50
Creative Output	26	14	6	2	49
Skill Obsolescence	5	37	5	1	48
Labor Share of Income	12	13	12	—	37
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Adoption Remove filter

AI capabilities can be copied, invoked, embedded in workflows, and scaled across institutions at low marginal cost.

Descriptive claim about AI technology characteristics made in the paper; supported by conceptual argument and examples rather than quantified empirical data.

high mixed AI Safety as Control of Irreversibility: A Systems Framework... deployment / scaling cost

Earlier high-risk technologies were slowed by capital intensity, physical bottlenecks, organizational inertia, and specialized supply chains.

Historical/analytic claim presented as background context in the paper; supported by conceptual comparison rather than a specific empirical study.

high mixed AI Safety as Control of Irreversibility: A Systems Framework... deployment speed / adoption

These divergences carry direct implications for policy interventions.

Interpretation/conclusion drawn from the divergence between RL Feasibility Index and existing measures (policy implication claimed by authors).

high mixed What Jobs Can AI Learn? Measuring Exposure by Reinforcement ... policy relevance of measurement divergences

Scientific institutions, distinctively, manufacture legitimate judgment, so they do not merely adapt to AI; they compete with it for the same functional role.

Conceptual/theoretical assertion in the paper describing institutional roles; no empirical data or sample size provided in the excerpt.

high mixed AI-Augmented Science and the New Institutional Scarcities competition between scientific institutions and AI for the functional role of pr...

While Agentic AI enhances economic performance, its benefits are mediated by structural conditions and are unevenly distributed across countries (i.e., reinforcing core–periphery inequalities).

Combined findings from fixed-effects regressions, mediation analysis, and observed heterogeneity between developed and emerging economies in the 2015–2024 panel.

high mixed The Economic Value of Agentic AI: A Comparative Analysis of ... distribution of economic benefits from AI across countries (inequality of gains)

AI learns indiscriminately from implicit knowledge, acquiring both beneficial patterns and harmful biases.

Asserted in the paper as a conceptual point about training data and learned patterns; no empirical evaluation or quantified bias measures provided.

high mixed Reliable AI Needs to Externalize Implicit Knowledge: A Human... patterns and biases acquired by AI from implicit knowledge

Workload-aware blended pricing reorders the leaderboard substantially: 7 of 10 top-ranked endpoints under the chat preset (3:1 input:output) fall out of the top 10 under the retrieval-augmented preset (20:1).

Comparison of endpoint rankings under two workload presets (chat preset 3:1 and retrieval-augmented preset 20:1); statement gives counts (7 of top 10 change).

high mixed Token Arena: A Continuous Benchmark Unifying Energy and Cogn... change in top-10 endpoint rankings between workload presets

Modeled joules per correct answer varies by a factor of 6.2 across endpoints.

Modeled energy estimate combined with task accuracy to compute joules per correct answer across 78 endpoints.

high mixed Token Arena: A Continuous Benchmark Unifying Energy and Cogn... joules per correct answer (modeled energy efficiency)

Across 78 endpoints, the same model on different endpoints differs in tail latency by an order of magnitude.

Empirical tail-latency measurements across 78 endpoints serving 12 model families.

high mixed Token Arena: A Continuous Benchmark Unifying Energy and Cogn... tail latency

The same model on different endpoints differs in fingerprint similarity to first party by up to 12 points.

Empirical measurement of fingerprint (output-distribution) similarity to a first-party reference across the same set of endpoints (78 endpoints, 12 model families).

high mixed Token Arena: A Continuous Benchmark Unifying Energy and Cogn... fingerprint similarity to first-party reference (endpoint fidelity)

Across 78 endpoints serving 12 model families, the same model on different endpoints differs in mean accuracy by up to 12.5 points on math and code.

Empirical measurement across 78 endpoints and 12 model families comparing mean accuracy on math and code tasks.

high mixed Token Arena: A Continuous Benchmark Unifying Energy and Cogn... mean accuracy on math and code benchmarks

Generative AI-powered tools like ChatGPT are reshaping market skill demands while also offering new forms of on-demand learning support to meet those demands.

Framed in paper as background/motivation; asserted from prior literature and the paper's motivating claims rather than reported as a quantified result in this study.

high mixed Upskilling with Generative AI: Practices and Challenges for ... impact of generative AI on market skill demands and availability of on-demand le...

In operational meteorology, adjoint-based methods derive value from the forecast model itself but require full data assimilation infrastructure.

Technical background in paper describing adjoint-based methods and their infrastructural requirements (methodological literature references; no new empirical data).

high mixed Calibrating Attribution Proxies for Reward Allocation in Par... suitability and infrastructure requirements of adjoint-based value methods

The retrieved sources are substantially different for each search engine (average pairwise Jaccard similarity < 0.2).

Computed average Jaccard similarity of source-domain sets returned by each engine (Google organic results, Google AIO, Gemini Flash 2.5) across the 11,500 queries; reported average similarity < 0.2.

high mixed How Generative AI Disrupts Search: An Empirical Study of Goo... overlap (Jaccard similarity) of retrieved source domains across engines

These patterns suggest that AI adoption is associated with expected efficiency gains that shape both firms' pricing behaviour and their macroeconomic expectations.

Interpretation based on observed increases in productivity/profitability and different pricing/inflation expectations among adopters vs non-adopters in survey and DID analyses.

high mixed The economic impact of artificial intelligence: evidence fro... interpretive link between productivity/profitability gains and firms' pricing an...

The rapid growth of AI and automation offers Sub-Saharan Africa economic opportunities as well as labor market challenges.

Systematic review of the literature reported in the paper; scope and number of studies not specified in the abstract/summary provided.

high mixed The Impact of AI-Driven Automation on Semi and Unskilled Wor... economic opportunities and labor market challenges in Sub‑Saharan Africa

LLMs are able to extract signals from unstructured text (financial news headlines) but have limitations without explicit quantitative optimization.

Interpretation in discussion/conclusion: empirical finding that LLM-based portfolios beat naive diversification but underperform AI-optimized strategies, implying LLMs extract signals from text yet lack full optimization capability.

high mixed Few-Shot Portfolio Optimization: Can Large Language Models O... ability to extract actionable signals from unstructured text as reflected in por...

Statistical tests confirmed significant performance differences (p ≤ 0.01).

Reported inferential statistics in results: statistical tests comparing strategy performances produced p-values at or below 0.01.

high mixed Few-Shot Portfolio Optimization: Can Large Language Models O... statistical significance of performance differences between strategies

AI adoption leads both to job displacement and job creation, including the emergence of new occupational categories.

Abstract states the review examines empirical evidence on both job displacement and creation and the emergence of new occupations; no numeric counts or sample sizes provided in abstract.

high mixed AI and the Transformation of Human Employment: Challenges, O... job destruction and creation; emergence of new occupations

The study identifies short-term transitional risks and long-term productivity gains associated with AI integration in the workforce.

Abstract states the paper evaluates both short-term risks and long-term productivity gains from AI integration based on the reviewed literature; no empirical quantification given in abstract.

high mixed AI and the Transformation of Human Employment: Challenges, O... transitional risks and productivity gains

AI-driven automation and augmentation are reshaping employment landscapes, with emphasis on sector-level disruption, skill transformation, and socioeconomic consequences.

Abstract states this as a conclusion of the review drawing on interdisciplinary empirical literature; no specific studies or sample sizes cited in abstract.

high mixed AI and the Transformation of Human Employment: Challenges, O... employment landscape changes (sector disruption, skill transformation, socioecon...

The accelerating deployment of artificial intelligence across industries has fundamentally altered the structure of global labour markets.

Statement in abstract summarizing a systematic review of interdisciplinary literature (economics, computer science, organizational behaviour, public policy); no specific sample size reported in abstract.

high mixed AI and the Transformation of Human Employment: Challenges, O... structure of global labour markets

Firms may continue to exist as legal and physical entities, but their coordinating function will be displaced as they become data nodes within regionally governed AI infrastructure.

Predictive/conceptual claim within the framework; no empirical sample reported in the excerpt and presented as a theoretical outcome of Interface Internalization.

high mixed Structural Dissolution: How Artificial Intelligence Dismantl... change in the coordinating role of firms (from coordinators to data nodes)

The Structural Dissolution Framework challenges the Coasian view that organizational boundaries are determined by transaction cost minimization, arguing that AI makes such boundaries economically obsolete.

Theoretical critique of transaction-cost-based explanations for firm boundaries presented in the paper; argumentative and conceptual rather than supported by empirical tests in the provided summary.

high mixed Structural Dissolution: How Artificial Intelligence Dismantl... economic relevance of transaction-cost-based firm boundaries

Regional data sovereignty entities will emerge as organizational forms that replace the coordinating role of firms and markets.

Normative/predictive claim within the paper's framework arguing for new organizational forms (regional data sovereignty entities); illustrated conceptually (e.g., through resource-dependent regional economies) rather than empirically tested in the provided text.

high mixed Structural Dissolution: How Artificial Intelligence Dismantl... emergence of regional data sovereignty entities as coordinators

Domain-specific data refinement infrastructure will become the new basis of positional control in industries.

Theoretical claim in the framework asserting a shift in positional control to data refinement infrastructure; presented as a predicted structural outcome rather than supported by empirical data in the provided text.

high mixed Structural Dissolution: How Artificial Intelligence Dismantl... basis of positional control (movement to data refinement infrastructure)

AI adoption moves value creation away from physical resources and human collaboration toward continuous token flows produced through data refinement loops.

Theoretical/analytical claim within the Structural Dissolution Framework and illustrative discussion; no empirical quantification provided in the text excerpt.

high mixed Structural Dissolution: How Artificial Intelligence Dismantl... source of value creation (physical/human → data/token flows)

The mechanism driving this restructuring is 'Interface Internalization', through which inter-agent coordination is absorbed into intra-system computation.

Conceptual mechanism defined and argued in the paper; presented as the central theoretical mechanism rather than as an empirically validated finding.

high mixed Structural Dissolution: How Artificial Intelligence Dismantl... shift of coordination from inter-agent (firms/markets) to intra-system computati...

AI dissolves the boundaries that once separated firms, markets, experts, and consumers by internalizing human multimodal interfaces (language, vision, and behavioral data) into computational systems.

Theoretical argument and conceptual framework introduced in the paper (Structural Dissolution Framework); no empirical sample or quantitative analysis reported for this claim in the text provided.

high mixed Structural Dissolution: How Artificial Intelligence Dismantl... dissolution of boundaries between firms, markets, experts, and consumers

Architectural interventions can instead be used to trade off personalization against preference privacy.

Proposed solution described in the paper (architectural interventions) as an alternative to prompt-level fixes; presented as a design tradeoff rather than empirically validated mitigation in the excerpt.

high mixed When Agents Shop for You: Role Coherence in AI-Mediated Mark... trade-off between personalization and preference privacy under architectural int...

The system tends to be factually correct when it answers but often omits information (i.e., 'the system is right when it answers — it just leaves things out').

Interpretation combining reported factual accuracy (85.5%) with low completeness (0.40) from benchmark results.

high mixed Benchmarking Complex Multimodal Document Processing Pipeline... factual accuracy vs. answer completeness

Differences between models are large enough to shape outcomes in practice, so reliability should be incorporated alongside average performance when assessing and deploying LLMs in high-stakes decision contexts.

Authors' interpretation of empirical differences in funding decisions, scores, confidence, and reliability across models in the controlled experiment; presented as an implication/recommendation.

high mixed Algorithmic personalities and the myth of neutrality: financ... policy recommendation regarding assessment criteria (reliability + average perfo...

This hybrid Make governance form has qualitatively different economics, capability requirements, and governance structures than pre-AI in-house development.

Paper's conceptual comparison between pre-AI hierarchy and post-AI hybrid Make governance (theoretical reasoning and examples; no empirical quantification).

high mixed The Buy-or-Build Decision, Revisited: How Agentic AI Changes... economics and capability requirements of in-house development governance

AI reshapes seven canonical decision determinants for make-or-buy choices: cost, strategic differentiation, asset specificity, vendor lock-in, time-to-market, quality and compliance, and organizational capability.

Paper's factor-level conceptual analysis enumerating and discussing seven determinants (theoretical synthesis rather than empirical measurement).

high mixed The Buy-or-Build Decision, Revisited: How Agentic AI Changes... sensitivity of canonical make-or-buy determinants to AI

The adoption of AI in Israel constitutes a systemic transformation of employment relations, necessitating doctrinal adaptation and institutional reform to keep the labor market aligned with foundational legal principles.

Synthesis and conclusion from the paper's combined legal and empirical analysis; presented as the author's overarching interpretive claim rather than as a specific quantified finding.

high mixed Artificial Intelligence in Israel, Trends, Developments, and... degree of systemic transformation of employment relations and need for doctrinal...

Within the public sector, there is an emerging policy trend to incorporate AI considerations into workforce planning, including examining whether human positions may be substituted by technological solutions prior to recruiting new employees.

Paper reports an observed policy trend in public-sector workforce planning; specific policy documents, jurisdictions, or counts not provided in the excerpt.

high mixed Artificial Intelligence in Israel, Trends, Developments, and... public sector workforce planning practices (consideration of substituting human ...

Within that n=11 subset, 9 of 11 agents shift by at least 2 ranks between composite and benchmark-only rankings.

Comparison of rank positions between composite and benchmark-only rankings on the 11-agent subset; reported count of agents that moved at least 2 ranks.

high mixed AgentPulse: A Continuous Multi-Signal Framework for Evaluati... count/proportion of agents with ≥2-rank shifts

The four factors capture largely complementary information (n=50; ρ_max = 0.61 for Adoption-Ecosystem, all others |ρ| ≤ 0.37).

Correlation analysis among the four factor scores computed on the 50-agent sample; reported maximum inter-factor Pearson/Spearman correlation coefficients.

high mixed AgentPulse: A Continuous Multi-Signal Framework for Evaluati... inter-factor correlations (Adoption vs Ecosystem and other factor pairs)

The intervention only modestly narrows the gap to a full-information benchmark.

Comparison between post-intervention calibration/auction outcomes and a full-information benchmark reported in the paper, showing only modest improvement.

high mixed MarketBench: Evaluating AI Agents as Market Participants remaining gap between post-intervention outcomes and full-information benchmark ...

Provisioned Throughput delivers the lowest latency at low concurrency but saturates its reserved capacity above approximately 20 concurrent users.

Empirical measurements from the instrumented system across concurrency up to 50 users and tier comparisons; the paper reports the observed saturation point near ~20 concurrent users.

high mixed Latency and Cost of Multi-Agent Intelligent Tutoring at Scal... response time (latency) and saturation threshold (concurrency where reserved cap...

Delegating tasks to genAI can be individually beneficial in the short term even as widespread adoption degrades future model performance (creating a social dilemma).

Result of the paper's behavioral model showing an individual-level incentive to use genAI versus a collective cost from adoption (theoretical/model-based; no empirical sample reported in abstract).

high mixed Generative artificial intelligence reduces social welfare th... individual short-term benefit vs future model performance (collective welfare)

Token usage is highly variable and inherently stochastic: runs on the same task can differ by up to 30x in total tokens.

Observed run-to-run variability in total token counts for identical tasks across the collected agentic trajectories from eight frontier LLMs on SWE-bench Verified.

high mixed How Do AI Agents Spend Your Money? Analyzing and Predicting ... run-to-run variability in total token consumption for the same task

Firms with a high market position tend to imitate the peer leader, whereas firms in middle and low market positions are more likely to follow the peer group.

Heterogeneity analysis / subgroup regressions in fixed-effects models on panel data of publicly listed Chinese firms (2012–2023), stratifying firms by market position (high, middle, low).

high mixed Following the Herd or the Bellwether: Peer Effects in Firms’... focal firm AI adoption level (differential peer influence by firm market positio...

These efficiency gains are offset by a growing 'Efficiency-Legitimacy Paradox' (i.e., improvements in efficiency come with worsening legitimacy concerns).

Conceptual synthesis from the systematic review (2018-2026) identifying a recurring trade-off across reviewed studies; specific empirical quantification not provided in abstract.

high mixed Artificial Intelligence, Public Policy and Governance - impl... trade-off between administrative efficiency and democratic legitimacy/procedural...

There is a structural shift from 'street level' bureaucracies to 'system-level' architectures that can be defined as the institutional division of 'Artificial Discretion' to algorithmic infrastructures.

Synthesis from the PRISMA-guided systematic review of literature (2018-2026) reporting observed changes in administrative architectures; specific studies not enumerated in abstract.

high mixed Artificial Intelligence, Public Policy and Governance - impl... institutional/administrative architecture (shift from street-level to system-lev...

As a General-Purpose Technology (GPT), Artificial Intelligence (AI) is fundamentally reconfiguring state capacity, as well as the mechanics of global economic management.

Systematic review of current research studies (2018-2026) conducted following PRISMA guidelines; synthesis of literature claiming broad institutional and macroeconomic effects. Number of studies not specified in abstract.

high mixed Artificial Intelligence, Public Policy and Governance - impl... state capacity and the mechanics of global economic management

Agentic AI differs from traditional algorithmic trading and generative AI through its capacity for goal-oriented autonomy, continuous learning, and multi-agent coordination.

Analytic comparison and synthesis across prior research and technical architectures in the survey; descriptive/definitional rather than empirical testing.

high mixed Agentic Artificial Intelligence in Finance: A Comprehensive ... capability differences (goal-oriented autonomy, continuous learning, multi-agent...

Uncertainty-aware exploration (in algorithms) alters fairness metrics compared to policies that ignore uncertainty.

Results from simulation experiments compare uncertainty-aware exploration policies to baseline policies and report changes in fairness metrics (as described in the abstract and results).

high mixed Fairness under uncertainty in sequential decisions fairness metrics

Analysis of more than two decades of M&A deals reveals shifts in acquisition activity and allows mapping of corporate linkages and overlapping investments.

Empirical longitudinal analysis of M&A deals over a period exceeding 20 years; method: mapping corporate linkages from M&A data (sample size/dataset not specified in the excerpt).

high mixed Industry 4.0 Inc.—Mergers and acquisitions and the digital t... acquisition activity and corporate linkages / overlapping investments

The emissions effects of digital trade are conditional rather than uniform, depending on complementary policy (carbon pricing, regulatory stringency), technological (AI-enhanced logistics), and energy (renewables) factors.

Synthesis of findings from fixed-effects regressions with interactions, carbon-pricing threshold analysis, machine-learning threshold detection, and SEM mediation on the monthly panel of 38 OECD economies (2000–2024).

high mixed From Digital Trade to Climate Gains: How Global Value Chains... CO2 emissions

« Prev 1 2 3 … … 147 148 Next »