Evidence (6869 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	758	199	100	900	2007
Governance & Regulation	826	400	191	122	1563
Organizational Efficiency	777	193	124	84	1189
Technology Adoption Rate	635	233	124	97	1098
Research Productivity	422	128	57	336	954
Output Quality	476	179	59	47	761
Decision Quality	328	177	81	47	640
Firm Productivity	435	57	88	20	606
AI Safety & Ethics	218	277	65	33	599
Market Structure	180	170	123	24	502
Task Allocation	213	64	72	33	387
Skill Acquisition	170	61	61	17	309
Innovation Output	203	27	43	18	292
Employment Level	105	54	107	13	281
Fiscal & Macroeconomic	131	69	43	26	276
Consumer Welfare	117	63	42	11	233
Firm Revenue	153	48	26	3	230
Task Completion Time	173	31	8	12	225
Inequality Measures	44	122	49	6	221
Worker Satisfaction	89	65	22	12	188
Error Rate	69	92	10	2	173
Regulatory Compliance	77	69	14	5	165
Automation Exposure	56	56	26	13	154
Training Effectiveness	94	21	13	19	149
Wages & Compensation	77	36	25	6	144
Team Performance	86	17	27	10	141
Developer Productivity	95	17	14	6	133
Job Displacement	12	80	20	1	113
Hiring & Recruitment	52	7	8	3	70
Creative Output	31	18	8	3	61
Skill Obsolescence	5	46	6	1	58
Social Protection	27	16	8	2	53
Labor Share of Income	17	19	17	—	53
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Governance Remove filter

These harms increasingly translate into financial loss through litigation, enforcement penalties, brand erosion, and failed deployments.

Paper argues this linkage using conceptual reasoning and illustrative examples/case vignettes; cites regulatory and market incidents but does not provide systematic empirical estimates or a sample size.

high negative Artificial Intelligence Governance In Corporate Strategy: Et... firm_revenue

AI systems can create material harms: discriminatory outcomes, privacy and security failures, opacity in decision logic, and regulatory noncompliance.

Paper lists these harms as core risks based on prior literature, regulatory developments, and conceptual risk analysis. Presented as well-documented categories rather than as new empirical findings; no sample size reported.

high negative Artificial Intelligence Governance In Corporate Strategy: Et... ai_safety_and_ethics

As artificial intelligence assumes cognitive labor, no existing quantitative framework predicts when human capability loss becomes catastrophic.

Introductory/background claim asserted by authors motivating the study (literature gap claim).

high negative The enrichment paradox: critical capability thresholds and i... absence of prior quantitative frameworks for catastrophic human capability loss

Broader AI scope lowers the critical threshold K* (i.e., more general AI reduces the K* value at which capability collapse occurs).

Model sensitivity analysis / simulations showing K* varies with assumed scope of AI (reported in model calibration discussion).

high negative The enrichment paradox: critical capability thresholds and i... change in critical threshold K* with AI scope

The model identifies a critical threshold K* approximately 0.85 (scope-dependent; broader AI scope lowers K*) beyond which capability collapses abruptly — the 'enrichment paradox.'

Model analysis and simulations calibrated across domains (paper reports computed threshold K* ≈ 0.85 and notes dependence on AI scope).

high negative The enrichment paradox: critical capability thresholds and i... critical delegation/capability threshold (K*) at which human capability collapse...

Fabrication risk is not an anomalous glitch but a foreseeable consequence of the technology's design, with direct implications for the evolving duty of technological competence.

Conclusion drawn from the paper's theoretical/physics-based analysis and the simulated scenario; stated in the abstract as the authors' interpretation and policy/legal implication.

high negative When AI output tips to bad but nobody notices: Legal implica... foreseeability of fabrication risk and implications for professional duty/compet...

The paper presents the physics-based analysis in a legal-industry setting by walking through a simulated brief-drafting scenario.

Methodological claim explicitly stated in the abstract: use of a simulated brief-drafting scenario to demonstrate the analysis.

high negative When AI output tips to bad but nobody notices: Legal implica... demonstration of fabrication risk in a simulated legal drafting task (output qua...

Although commonly dismissed as random 'hallucination', recent physics-based analysis of the Transformer's core mechanism reveals a deterministic component: the AI's internal state can cross a calculable threshold, causing its output to flip from reliable legal reasoning to authoritative-sounding fabrication.

Paper cites/relies on 'recent physics-based analysis' of Transformer mechanisms and states that it demonstrates a calculable threshold; the paper also purports to present this science in a legal setting (via simulation). No numeric experimental sample provided in the excerpt.

high negative When AI output tips to bad but nobody notices: Legal implica... transition from reliable reasoning to fabricated outputs (failure mode / interna...

Courts confront a novel threat to the integrity of the adversarial process due to fabricated authorities produced by generative AI.

Asserted in the abstract as a consequence of fabricated outputs; supported by the paper's conceptual argument and simulation reference rather than empirical court-case analysis.

high negative When AI output tips to bad but nobody notices: Legal implica... integrity of the adversarial process / decision quality in courts

Attorneys who unknowingly file such fabrications face professional sanctions, malpractice exposure, and reputational harm.

Stated as a legal/consequential claim in the abstract; no empirical evidence, case counts, or legal-statistics provided in the excerpt.

high negative When AI output tips to bad but nobody notices: Legal implica... professional sanctions, malpractice exposure, reputational harm

For law in particular, generative AI introduces a perilous failure mode in which the AI fabricates fictitious case law, statutes, and judicial holdings that appear entirely authentic.

Claimed in the paper; supported by the paper's analytic argument and a simulated brief-drafting scenario referenced in the abstract (no numeric sample provided).

high negative When AI output tips to bad but nobody notices: Legal implica... fabrication of legal authorities (authentic-appearing fake citations/holdings)

Improvements in AI ('better' AI) amplify the excess automation as well.

Model comparative statics: increased AI capabilities raise private incentives to automate, leading to more displacement than is socially optimal; theoretical analysis only.

high negative The AI Layoff Trap level of automation / worker displacement as a function of AI capability

More competition amplifies the excess automation (the automation arms race).

Comparative-statics result in the competitive task-based theoretical model showing increased competition raises firms' incentives to automate; no empirical sample.

high negative The AI Layoff Trap level of automation / worker displacement as a function of competition intensity

The resulting loss from excess automation harms both workers and firm owners.

Welfare comparisons from the model showing negative payoff changes for workers (lower wages/less employment) and reduced owner returns when automation is excessive; theoretical analysis, no empirical data.

high negative The AI Layoff Trap welfare/profits of workers and firm owners (losses caused by excess automation)

In a competitive task-based model, demand externalities trap rational firms in an automation arms race, displacing workers well beyond what is collectively optimal.

Formal equilibrium analysis in the paper's theoretical competitive task-based model; comparative statics and welfare analysis (no empirical sample).

high negative The AI Layoff Trap extent of worker displacement relative to social optimum

Knowing that AI-driven displacement can erode demand is not enough for firms to stop automating.

Analytical result from the paper's competitive task-based model showing firms' incentives do not internalize demand externalities; no empirical sample.

high negative The AI Layoff Trap firm automation decisions (propensity to automate) despite awareness of aggregat...

If AI displaces human workers faster than the economy can reabsorb them, it risks eroding the very consumer demand firms depend on.

Theoretical statement in the paper's motivating premise; no empirical sample reported (conceptual argument about aggregate demand effects when displacement outpaces reabsorption).

high negative The AI Layoff Trap consumer demand (aggregate demand) as affected by worker displacement

Fukui is Japan's least-visited prefecture.

Descriptive claim in the paper specifying the study site (Fukui) as the country's least-visited prefecture; no supporting national rankings provided in the excerpt.

high negative Engineering Distributed Governance for Regional Prosperity: ... relative visitation rank

We quantify an annual opportunity gap of 865,917 unrealized visits, equivalent to approximately 11.96 billion yen (USD 76.2 million) in lost revenue.

Model-based estimate produced by the DSS using the analyzed datasets and the DHDE-informed optimization; figure reported directly in the paper.

high negative Engineering Distributed Governance for Regional Prosperity: ... unrealized visits and lost revenue

For regions experiencing demographic decline and structural stagnation, the primary risk is 'under-vibrancy', a condition where low visitor density suppresses economic activity and diminishes satisfaction.

Conceptual claim and problem framing provided by the authors (theoretical/qualitative argument in the paper).

high negative Engineering Distributed Governance for Regional Prosperity: ... economic activity and satisfaction (conceptual)

Most research in urban informatics and tourism focuses on mitigating overtourism in dense global cities.

Author statement in introduction positioning the paper relative to existing literature; no quantitative literature review or citation counts reported in the excerpt.

high negative Engineering Distributed Governance for Regional Prosperity: ... other

Strict data sovereignty laws fragment regional collaboration between African Union member states and hinder AI development.

Stated in the paper as a policy barrier; supported by the authors' policy review of data sovereignty rules and their implications for cross-border data sharing.

high negative Take the Train: Africa at the Crossroad of Modern AI regional collaboration for AI development

Restricted cloud access due to payment system mismatches and volatile exchange rates is a barrier to AI adoption in Africa.

Claim made in the paper as part of the list of barriers; based on the authors' qualitative and quantitative review and reference to policy/financial constraints across African countries.

high negative Take the Train: Africa at the Crossroad of Modern AI cloud access for AI developers

Important barriers include limited access to high-performance computing (HPC).

Paper identifies limited HPC access as a key barrier; supported by the authors' collection and consolidation of HPC availability data via the Africa AI Compute Tracker (ACT).

high negative Take the Train: Africa at the Crossroad of Modern AI access to high-performance computing (HPC)

Africa's participation in modern AI development is constrained by severe infrastructural and policy gaps.

Stated as a central argument in the paper; supported by the paper's synthesis of qualitative and quantitative evidence and reference to official declarations on AI adoption across the continent.

high negative Take the Train: Africa at the Crossroad of Modern AI Africa's participation in modern AI development

AI can initially exacerbate distributional injustice.

Dimension-level analysis indicating negative (or initially negative) effects of AI on the distributional component of the energy justice index.

high negative Artificial intelligence adoption for advancing energy justic... distributional justice component of energy justice index

There are few integrated frameworks (bridging ethics and technical controls) in the current AI governance landscape.

Result of the literature review and cluster analysis showing limited coverage of frameworks that integrate ethical principles with auditable technical controls.

high negative AI Governance Risk Tiering for Sustainable Digital Infrastru... prevalence of integrated governance frameworks

Findings reveal a fragmented landscape dominated by ethics/privacy-centric and compliance/risk-focused approaches.

Synthesis of the reviewed literature and results of PCA/k-means clustering indicate thematic dominance of ethics/privacy and compliance/risk orientations across frameworks.

high negative AI Governance Risk Tiering for Sustainable Digital Infrastru... dominant thematic focus of governance frameworks

These findings uncover critical threats to judicial integrity and public trust and underscore the urgent need for robust safeguards against non-legal influences in AI legal systems.

Interpretation/conclusion drawn from the empirical results (observed deviations, sentiment amplification, and subgroup vulnerabilities).

high negative LLM Safety in Judicial AI: A Stress Test of Social Media Inf... potential impact on judicial integrity and public trust (qualitative/inferential...

These safety risks are compounded for emotionally charged topics.

Subgroup analyses where emotionally charged case topics showed larger deviations and stronger effects from injected sentiment.

high negative LLM Safety in Judicial AI: A Stress Test of Social Media Inf... change in deviation/amplification of model outputs for emotionally charged topic...

These safety risks are compounded (stronger) for low-skilled occupational categories.

Subgroup analyses reported in the paper showing larger model deviations and/or greater sentiment amplification effects for cases involving low-skilled occupations.

high negative LLM Safety in Judicial AI: A Stress Test of Social Media Inf... interaction effect: deviation/amplification magnitude by occupational skill leve...

The sentiment-induced divergences lead to unstable and often inflated compensation predictions by the models.

Analysis of model-predicted compensation amounts under sentiment perturbations showing increased variability and upward bias compared to CJOL amounts.

high negative LLM Safety in Judicial AI: A Stress Test of Social Media Inf... predicted compensation amounts (inflation and instability) from LLMs versus CJOL...

Public opinion (social media sentiment) substantially amplifies deviations between LLM outputs and real rulings.

Stress-test experiments in which injected social media sentiment increased the divergence of model outputs from CJOL judgments across the sample.

high negative LLM Safety in Judicial AI: A Stress Test of Social Media Inf... change in deviation between LLM outputs and CJOL rulings when social media senti...

Models exhibit inherent deviations from real rulings.

Empirical comparison of LLM outputs to CJOL judgments showing systematic differences (based on the paper's reported comparisons across the dataset).

high negative LLM Safety in Judicial AI: A Stress Test of Social Media Inf... magnitude and frequency of deviations between LLM outputs and actual court judgm...

The article argues that the idea of a “Pax Silica” is fragile.

Conclusion drawn from the paper's theoretical framework and comparative analysis; presented as an assessment rather than empirical measurement.

high negative The Logistics of Hegemony: Semiconductor Chokepoints, Global... stability/fragility of a proposed techno-hegemonic order ('Pax Silica')

Contemporary struggles over semiconductor supply chains represent not a new hegemonic order but a logistical adaptation of Pax Americana.

Stated thesis supported by comparative/historical analysis and theoretical argumentation (comparative analysis of historical Pax orders and U.S. techno-security architecture); no quantitative sample size reported in abstract.

high negative The Logistics of Hegemony: Semiconductor Chokepoints, Global... characterization of geopolitical order governing semiconductor supply chains

In the short term, big data may inhibit welfare growth.

Theoretical comparative-static/dynamic analysis reported in the model showing that initial or short-run effects of increased data sharing can reduce welfare growth (no empirical/sample data).

high negative Study on the impact of big data sharing on individuals’ welf... short-term growth of individuals' welfare

Traditional paradigms, specifically the resource-based view and the dynamic capabilities framework, operate under closed-system, first-order cybernetic assumptions that fail to capture the dissipative nature of algorithmic agents.

Conceptual critique presented in the paper's theoretical argumentation (literature critique and re-framing); no empirical sample reported.

high negative Governing Human–AI Co-Evolution: Intelligentization Capabili... explanatory_power_of_management_theory (ability to account for AI-driven organiz...

This result directly contradicts classical scaling laws which assume monotonic capability gains with model scale.

Comparative theoretical claim in the paper contrasting the Institutional Scaling Law with classical empirical/theoretical scaling laws in ML literature.

high negative Punctuated Equilibria in Artificial Intelligence: The Instit... relationship between model scale and deployment-relevant fitness/capability

The Institutional Scaling Law proves that institutional fitness is non-monotonic in model scale.

Formal mathematical derivation/proof presented in the paper (the 'Institutional Scaling Law').

high negative Punctuated Equilibria in Artificial Intelligence: The Instit... institutional fitness as a function of model scale

AI development proceeds not through smooth advancement but through extended periods of stasis interrupted by rapid phase transitions that reorganize the competitive landscape (punctuated equilibrium pattern).

Argument based on punctuated equilibrium theory from evolutionary biology and historical analysis presented in the paper identifying discrete transitions in AI history; the paper cites and classifies eras/events as evidence.

high negative Punctuated Equilibria in Artificial Intelligence: The Instit... pattern of AI development (stasis vs. phase transitions)

The interaction of artificial intelligence and environmental regulation produces a '1 + 1 < 2' crowding-out effect (their combined effect is less than the sum of individual effects).

Spatial Durbin model with interaction term between AI and environmental regulation as summarized in the abstract; reported as a crowding-out interaction.

high negative How artificial intelligence and environmental regulation inf... UCEE index (interaction effect of AI and environmental regulation)

Environmental regulation significantly inhibits local UCEE.

Spatial Durbin model results reported in the abstract indicating a significant negative local coefficient for environmental regulation.

high negative How artificial intelligence and environmental regulation inf... UCEE index (local/provincial effect of environmental regulation)

Artificial intelligence significantly inhibits local UCEE.

Spatial Durbin model results reported in the abstract indicating a significant negative local coefficient for artificial intelligence.

high negative How artificial intelligence and environmental regulation inf... UCEE index (local/provincial effect of AI)

Rather than broad job losses, evidence points to a reallocation at the entry level: AI automates tasks typically assigned to junior staff, shifting the nature of entry-level roles.

Synthesis of firm- and task-level empirical studies reported in the brief documenting automation of routine/junior tasks and changes in job-task composition; specific sample sizes vary by cited study and are not provided in the brief.

high negative AI, Productivity, and Labor Markets: A Review of the Empiric... automation of entry-level/junior tasks and changes to entry-level job content

Algorithmic credit systems are linked to higher levels of financial stress.

Study reports a positive association between algorithmic credit system use and reported financial stress from regression analysis on the 400-user cross-sectional dataset.

high negative Architecting financial well-being in algorithmic credit syst... financial stress

Confirmation bias poses a weakness in LLM-based code review, with implications on how AI-assisted development tools are deployed.

Synthesis of findings from Study 1 (framing-induced detection failures) and Study 2 (practical exploitability and partial mitigation via debiasing).

high negative Measuring and Exploiting Confirmation Bias in LLM-Assisted S... reliability/security of LLM-based code review

Adversarial framing succeeds in 88% of cases against Claude Code (autonomous agent) in real project configurations where adversaries can iteratively refine their framing to increase attack success.

Study 2 experiments in real project configurations with iterative adversary refinement evaluated against Claude Code (autonomous agent); reported 88% success rate.

high negative Measuring and Exploiting Confirmation Bias in LLM-Assisted S... attack success rate (vulnerability reintroduction accepted/not detected)

Adversarial pull request framing (e.g., labeled as security improvements or urgent functionality fixes) succeeds in reintroducing known vulnerabilities in 35% of cases against GitHub Copilot under one-shot attacks.

Study 2 experiments simulating adversarial pull requests evaluated against GitHub Copilot (interactive assistant); reported success rate 35% for one-shot attacks.

high negative Measuring and Exploiting Confirmation Bias in LLM-Assisted S... attack success rate (vulnerability reintroduction accepted/not detected)

The framing effect is strongly asymmetric: false negatives increase sharply while false positive rates change little.

Comparison of false negative and false positive rates across framing conditions in Study 1 experiments (250 CVE pairs across models).

high negative Measuring and Exploiting Confirmation Bias in LLM-Assisted S... false negative rate and false positive rate

« Prev 1 2 3 … 24 25 26 … 137 138 Next »