Evidence (6917 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	761	200	101	904	2020
Governance & Regulation	829	400	191	122	1566
Organizational Efficiency	784	193	125	84	1197
Technology Adoption Rate	637	236	124	97	1103
Research Productivity	431	131	58	340	972
Output Quality	481	183	59	47	770
Decision Quality	332	177	82	49	647
Firm Productivity	439	57	88	20	610
AI Safety & Ethics	218	279	66	33	602
Market Structure	181	170	123	24	503
Task Allocation	214	64	72	33	388
Skill Acquisition	174	62	62	17	315
Innovation Output	204	27	45	18	295
Employment Level	105	54	108	13	282
Fiscal & Macroeconomic	132	69	43	26	277
Consumer Welfare	117	63	42	11	233
Firm Revenue	154	48	26	3	231
Task Completion Time	173	31	8	12	225
Inequality Measures	44	123	50	6	223
Worker Satisfaction	89	65	22	12	188
Error Rate	71	92	10	2	175
Regulatory Compliance	77	69	14	5	165
Automation Exposure	58	56	26	13	156
Training Effectiveness	96	21	14	19	152
Wages & Compensation	77	37	25	6	145
Team Performance	86	17	27	10	141
Developer Productivity	95	17	14	6	133
Job Displacement	12	81	21	1	115
Hiring & Recruitment	52	7	8	3	70
Creative Output	32	20	8	3	64
Skill Obsolescence	5	47	6	1	59
Social Protection	28	16	8	2	54
Labor Share of Income	17	19	17	—	53
Worker Turnover	11	12	—	3	26
Industry	—	—	—	1	1

Governance Remove filter

The major disadvantage of an MIS is dependency on reliable electricity and internet, creating systemic vulnerability due to the digital divide.

Paper notes infrastructure dependency as a constraint; assertion grounded in common infrastructural realities but no measured connectivity or outage statistics from DRC/SA are provided.

high negative Establishes a technical and academic bridge between the educ... geographic/regional access to equivalency services and system uptime availabilit...

Key audit/control weaknesses with respect to prompt fraud include lack of provenance for inputs/prompts and model outputs, inadequate access controls, and missing or ineffective monitoring and anomaly detection for AI outputs.

Qualitative control analysis and adaptation of established auditing principles to GenAI workflows; recommendations based on threat modeling rather than field data.

high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... presence or absence of specific control capabilities (provenance, access control...

GenAI outputs can be tailored to mimic corporate styles, templates, and evidence artifacts (e.g., summaries, memos, audit trails), which increases their credibility to auditors, managers, or customers.

Illustrative examples and scenario mapping demonstrating templated output mimicry; no controlled experiments or corpus analysis provided.

high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... perceived credibility of machine-generated artifacts when formatted to corporate...

Large language models produce fluent, human-like outputs that can mask falsehoods (hallucinations) as facts, making prompt fraud effective.

Well-established LLM behavior cited conceptually and supported in the paper by illustrative examples; no new empirical measurement in this article.

high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... propensity of LLM outputs to present fabricated information as authoritative

Prompt fraud does not require system intrusion, credential theft, or software exploits; it operates at the reasoning/language layer of large language models and therefore can be executed without technical breaches.

Logical/technical argumentation built from properties of LLMs and illustrative hypothetical attack chains; threat modeling rather than empirical attack logs.

high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... necessity of technical breach for successful fraud (binary: required/not require...

Prompt fraud is a new, distinct fraud modality in which adversaries intentionally craft natural-language prompts (or manipulate prompt inputs) to steer generative AI outputs into producing misleading, fabricated, or compliance-evading artifacts that bypass traditional internal controls.

Conceptual definition presented by the paper based on threat taxonomy and scenario mapping; illustrated with case-style examples. No empirical incident dataset or prevalence statistics provided.

high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... existence/recognition of a distinct fraud modality ('prompt fraud')

Potential limitations include limited methodological detail on case selection and measurement, possible selection and reporting bias from practitioner-sourced examples, and variable generalizability to small firms or highly regulated industries.

Authors' self-reported limitations in the Methods/Limitations section (qualitative assessment).

high negative Governed Hyperautomation for CRM and ERP: A Reference Patter... methodological completeness and generalizability (qualitative limitation)

Prompt fraud exploits the natural-language interface of large language models (LLMs) to produce outputs that appear authoritative (reports, audit trails, explanations) without system intrusion, credential theft, or software exploitation.

Definition and threat-model description using conceptual examples and case vignettes; literature/regulatory review to position the threat relative to traditional fraud vectors.

high negative Prompt Engineering or Prompt Fraud? Governance Challenges fo... production of authoritative-appearing artifacts by LLMs without technical system...

Data privacy and cross-border compliance issues arise from using cloud and SECaaS, complicating legal compliance for firms.

Regulatory analyses and compliance reports; documented examples in case studies and industry guidance on cross-border data flows.

high negative Security- as- a- service: enhancing cloud security through m... compliance incident rates / regulatory risk exposure

The cloud shared responsibility model creates potential ambiguities in liability between providers and customers.

Regulatory guidance, legal analyses, and documented post-incident case studies showing confusion over responsibilities.

high negative Security- as- a- service: enhancing cloud security through m... clarity/ambiguity of security and liability responsibilities

China manages the openness–security trade-off through a centralized, developmentalist, techno‑sovereignty approach that privileges coordinated state direction and control.

Qualitative content analysis of national‑level policy texts: 18 Chinese policy documents coded across four analytical dimensions (coordination objectives, institutional actors, governance mechanisms, stakeholder legitimacy).

high negative Balancing openness and security in scientific data governanc... governance logic / institutional coordination type (centralized, state‑led)

Antibiotic use in humans and animals, along with environmental antibiotic residues, generates converging selection pressures that drive AMR relevant to children.

Well-established ecological and microbiological literature summarized in the review showing cross-sector selection pressures; narrative integration rather than new empirical analysis.

high negative Safeguarding future generations: a One Health perspective on... selection and dissemination of antimicrobial resistance genes/pathogens across h...

Child behaviors (hand-to-mouth activity, play, outdoor exposure) increase contact with environmental and animal reservoirs and therefore exposure risk.

Behavioral and exposure studies synthesized narratively; observational evidence from exposure assessments and pediatric environmental health studies cited in review (no meta-analysis).

high negative Safeguarding future generations: a One Health perspective on... frequency/intensity of contact with environmental/animal reservoirs and resultan...

Developmental windows imply early-life exposures can have long-term consequences for health and human capital.

Developmental and epidemiologic literature integrated in the review; narrative citations of studies linking early exposures to later health and cognitive outcomes (no single longitudinal dataset presented).

high negative Safeguarding future generations: a One Health perspective on... long-term health, cognitive development, and human-capital outcomes following ea...

Physiological and immunological immaturity (including neonatal risks) increases children's susceptibility to infectious disease and related harms.

Established biological and clinical literature synthesized in the review; references to neonatal clinical risks and immunological immaturity across pediatric literature (no pooled effect sizes reported).

high negative Safeguarding future generations: a One Health perspective on... susceptibility to infection and severity of disease in neonates and young childr...

Automation and LLM-driven orchestration add opacity; errors in instrument control or analysis could propagate quickly, raising liability, insurance, and reproducibility concerns.

Analytical discussion of risks and analogies to automated systems in other domains; no incident-level empirical data from microscopy given.

high negative ChatMicroscopy: A Perspective Review of Large Language Model... frequency and impact of errors, liability exposure, reproducibility failures

Ethical and governance issues related to LLM-driven microscopy include accountability, reproducibility, access inequities, data privacy, and concentration of capabilities in large providers.

Policy-oriented synthesis and analogies to governance challenges observed in other AI deployments; no new empirical measurement in microscopy contexts.

high negative ChatMicroscopy: A Perspective Review of Large Language Model... presence of governance risks: accountability gaps, reproducibility problems, une...

Integration of LLMs with microscopes faces challenges including safety and reliability of instrument control, verification of scientific outputs, data provenance, and alignment with experimental constraints.

Analytical discussion based on known reliability and safety issues in automated systems and AI tool use; no empirical incident data from microscopy provided.

high negative ChatMicroscopy: A Perspective Review of Large Language Model... risks to safety, reliability, and scientific validity when deploying LLM-driven ...

There is substantial uncertainty in economic forecasts due to possible scale-up failures, regulatory constraints, feedstock price volatility, and path‑dependent lock‑in effects.

Synthesis of technical failure modes, regulatory uncertainty, and sensitivity analyses reported in TEA/LCA literature and economic modeling sections of the review.

high negative Harnessing Microbial Factories: Biotechnology at the Edge of... forecast variance in cost trajectories, probability of commercial success, and s...

Regulatory and biosafety concerns (including environmental release risks and dual‑use issues) increase fixed costs and create entry barriers that shape industry structure and diffusion.

Policy and governance literature reviewed alongside technical case studies; citations of regulatory requirements, biosafety frameworks, and examples of compliance costs affecting project viability.

high negative Harnessing Microbial Factories: Biotechnology at the Edge of... regulatory compliance costs, time-to-market, number of approved facilities/proce...

Engineering and economic challenges—scale‑up hurdles, process robustness, feedstock cost, and downstream purification—limit industrial deployment of many bio-based processes.

Case study TEA/LCA summaries and process reports in the review highlighting scale-up failures or increased costs at larger scales, purification complexity for low‑concentration products, and sensitivity to feedstock prices.

high negative Harnessing Microbial Factories: Biotechnology at the Edge of... capital and operating costs, purification yield and cost, process robustness met...

Technical biological limitations—metabolic burden, pathway crosstalk, byproduct formation, and genetic instability—remain major constraints on strain performance and scalability.

Multiple experimental reports and method papers cited in the review documenting decreased growth/productivity due to engineered pathway burden, unintended interactions between pathways, accumulation of byproducts, and genetic mutations during production runs.

high negative Harnessing Microbial Factories: Biotechnology at the Edge of... strain growth rate, productivity (g/L/h), byproduct concentrations, genetic muta...

Empirical validation is concentrated on the Agora-12 corpus; generalizability to other architectures, scales, or deployment contexts is unproven and identified as a limitation.

Authors' own limitations section and scope of empirical tests (analyses limited to Agora-12 and four clinical cases).

high negative Model Medicine: A Clinical Framework for Understanding, Diag... Scope of empirical validation (limited to Agora-12 dataset and 4 case studies)

Platforms benefit from data-driven scalability and network effects, creating barriers to entry and affecting consumer surplus, innovation incentives, and pricing.

Economic theory of platforms and empirical cases from platform markets synthesized in the literature review; argument supported by secondary empirical studies cited.

high negative Financial Inclusion in the Age of FinTech Platforms: Opportu... barriers to entry; consumer surplus; prices; innovation indicators

Market concentration and network effects create platform power that may squeeze smaller providers, raise costs, or lock users into ecosystems.

Platform economics literature and case examples reviewed in the paper; conceptual and theoretical support with illustrative empirical instances from secondary sources.

high negative Financial Inclusion in the Age of FinTech Platforms: Opportu... market concentration measures; prices/costs to users; switching costs/lock-in

Infrastructure gaps (connectivity, electricity, identity systems) limit who benefits from digital finance.

Cross-country and development literature synthesized in the paper highlighting correlations between infrastructure availability and digital finance uptake; no primary empirical analysis in the paper.

high negative Financial Inclusion in the Age of FinTech Platforms: Opportu... uptake/usage of digital financial services conditional on infrastructure availab...

Implementing the governed hyperautomation pattern raises upfront costs (governance tooling, monitoring, validation, compliance processes).

Economic and cost-structure discussion in the paper, based on qualitative reasoning and industry experience; no quantified cost estimates or sample-based cost analysis provided.

high negative Governed Hyperautomation for CRM and ERP: A Reference Patter... upfront implementation costs (governance tooling, validation, compliance overhea...

Use of standardized (non-adaptive) dialogues limits ecological validity relative to live adaptive chatbots.

Limitations section acknowledges that standardized (non-adaptive) experimental dialogues reduce ecological validity compared with live/adaptive chatbot interactions.

high negative AI Chatbots as Informatics-Enabled Marketing Service Systems... ecological validity

Platform KPIs (e.g., eCPM) can diverge from social welfare metrics (consumer surplus, privacy harms), creating metric misalignment.

Conceptual critique with examples of common platform metrics versus welfare economics; not accompanied by a quantitative comparison dataset.

high negative Artificial Intelligence for Personalized Digital Advertising... alignment between platform KPIs and social welfare measures

Privacy constraints reduce observability and necessitate privacy-preserving study designs that complicate estimation.

Methodological analysis referencing differential privacy, federated learning and their effects on statistical power/observability; no experimental power analyses with sample sizes presented here.

high negative Artificial Intelligence for Personalized Digital Advertising... observability and estimation precision under privacy constraints

Data access asymmetries (platforms holding proprietary logs) limit external auditability and replication of advertising research.

Empirical and institutional observation about industry data practices; supported by calls for privacy-preserving shared datasets in the paper; no quantified survey sample included.

high negative Artificial Intelligence for Personalized Digital Advertising... external auditability and ability to replicate studies

Attribution complexity — multi-touch, cross-device, and delayed conversions — confounds causal inference in advertising measurement.

Methodological discussion referencing causal inference challenges and standard problems in attribution; widely-documented in the literature though not re-measured in this paper.

high negative Artificial Intelligence for Personalized Digital Advertising... accuracy of causal attribution for ad effects

Complex automated systems make attribution and responsibility harder when harms occur (Automation vs accountability trade-off).

Qualitative institutional analysis and case-study reasoning about multi-agent automated pipelines and opaque model decisions; no single empirical incident dataset provided.

high negative Artificial Intelligence for Personalized Digital Advertising... clarity of attribution and accountability in case of harms

Richer personalization depends on granular data and cross-device identity, creating privacy externalities and compliance risks (Personalization vs privacy trade-off).

Data source inventory and privacy literature review; supported by observational industry trends (move to first-party identity) rather than a quantified sample in the paper.

high negative Artificial Intelligence for Personalized Digital Advertising... degree of personalization versus exposure to privacy risks/compliance failures

Federated infrastructures introduce adversarial risks (model/data poisoning, inference attacks on updates) that require robust aggregation, anomaly detection, and other defenses.

Threat modeling and taxonomy of adversarial/privacy threats with mapped mitigations (robust aggregation, anomaly detection, DP). Evidence is conceptual and based on standard threat frameworks; no empirical attack/defense experiments reported at scale.

high negative Privacy-Aware AI Advertising Systems: A Federated Learning F... vulnerability to poisoning/inference (attack success rate), effectiveness of def...

Delayed and sparse feedback (clicks/conversions) in advertising complicates credit assignment and timely model updates, degrading learning unless specific methods for delayed/sparse signals are used.

Analytical discussion of learning dynamics with delayed/sparse labels; conceptual solutions suggested (credit assignment methods). No large-scale empirical evaluation presented.

high negative Privacy-Aware AI Advertising Systems: A Federated Learning F... learning efficacy under delayed/sparse feedback (convergence, time-to-adapt), at...

Non-IID and heterogeneous data distributions across devices and publishers impair convergence and degrade personalization unless addressed with algorithmic adaptations.

Analytical modeling of convergence under non-IID conditions; threat/robustness discussion; prototype/simulation illustrations. This claim is supported by established literature and the paper's analytic treatment.

high negative Privacy-Aware AI Advertising Systems: A Federated Learning F... convergence behavior (rate, stability), personalization performance (accuracy on...

AI automates routine and some mid-skill tasks, reducing employment in those occupations.

Empirical task-based exposure measures mapping AI capabilities to occupational task content, microdata analyses of employment by occupation using household/employer/administrative datasets, and panel regressions/decompositions that document within-occupation declines and between-occupation shifts.

high negative Intelligence and Labor Market Transformation: A Critical Ana... employment levels in routine and mid-skill occupations

Relying on secondary literature limits the paper's ability to make causal inferences and constrains empirical generalizability to all sectors or countries.

Stated limitations in the paper's Data & Methods section acknowledging scope and inferential constraints.

high negative Who Loses to Automation? AI-Driven Labour Displacement and t... causal inference strength and generalizability of conclusions

Increases in K_T reduce employment levels in affected firms and industries even when aggregate productivity rises.

Panel econometric estimates at firm and industry levels relating K_T intensity to employment outcomes, controlling for demand, input prices, and firm characteristics; difference-in-differences specifications and instrumental-variable robustness checks; corroborated by sectoral case studies.

high negative The Macroeconomic Transition of Technological Capital in the... employment (firm- and industry-level employment counts or employment growth)

Rising technological capital (K_T) — proxied by robot/automation density, software and intangible capital accumulation, AI adoption surveys, and AI-related patenting — leads to a decline in labor’s share of output.

Firm- and industry-level panel regressions linking constructed K_T intensity measures to labor shares, supported by macro growth-accounting decompositions; robustness checks include difference-in-differences and instrumenting adoption with plausibly exogenous shocks (e.g., cross-border technology diffusion, trade shocks); validated with cross-country comparisons and case studies.

high negative The Macroeconomic Transition of Technological Capital in the... labor share of income (share of output paid to labor)

Fuel subsidy reform imposed an enormous fiscal burden that peaked at 2.8% of GDP in 2022, limiting the macroeconomic leverage of AI-driven efficiency gains.

Reported fiscal statistic in the paper (2.8% of GDP in 2022) and its role in analysis of why AI savings do not translate into large macro gains.

high negative (constraint) AI-Based Technological Transformation as a Driver for Develo... fiscal burden of fuel subsidies (% of GDP) and its moderating effect on GDP/trad...

The oil and gas trade balance remained in deficit at -1.55 billion USD in May 2025 and -1.58 billion USD in July 2025 despite an overall national trade surplus.

Reported trade-balance figures in the paper (monthly trade statistics for May and July 2025).

high negative (deficit persists) AI-Based Technological Transformation as a Driver for Develo... oil & gas trade balance (USD, monthly values)

The ABC-Bench tasks require a combination of biology and software expertise.

Authors' description of task design highlighting that tasks (robot control code, DNA design, screening evasion) combine biological domain knowledge with software/coding skills.

high neutral ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecu... skill mix required for benchmark tasks

ABC-Bench evaluates LLM agents on both benign and dual-use biology tasks, including: writing code to operate liquid handling robots, designing DNA fragments for in vitro assembly, and evading DNA synthesis screening.

Description of benchmark tasks provided in the paper; task list enumerated by the authors as components of ABC-Bench.

high neutral ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecu... types of tasks included in ABC-Bench

We introduce the Agentic Bio-Capabilities Benchmark (ABC-Bench), a suite of tasks to measure agentic biosecurity-relevant capabilities.

The paper describes the design and composition of ABC-Bench and presents evaluation results using it; this is a methodological contribution asserted by the authors.

high neutral ABC-Bench: An Agentic Bio-Capabilities Benchmark for Biosecu... existence and composition of the ABC-Bench benchmark

The core problem is not the absence of explanation but the absence of structured reasoning in the first place.

Conceptual argument/proposed reframing presented in the paper; no empirical test reported.

high neutral Beyond Post-hoc Explanation: Toward Glassbox AI via Probabil... presence of structured reasoning vs. post-hoc explanation

The evidence base was concentrated in system-facing applications that detect or shape inequities within recruitment, evaluation and exposure systems.

Synthesis result from the scoping review indicating thematic concentration across included studies (as reported in abstract).

high neutral Artificial intelligence applications supporting women’s care... focus of existing empirical studies (system-facing vs individual-facing applicat...

Agentic AI is best characterized as a continuum of autonomy and delegated authority, distinct from purely informational outputs and including systems capable of independently generating insured events through external actions.

Conceptual taxonomy and definitional argument presented in the paper distinguishing informational models from agentic systems with delegated authority; theoretical reasoning and classification.

high neutral Insurance of Agentic AI characterization of agentic AI along autonomy/delegation continuum

The benchmark probes 18 mainstream LLMs across four prompting strategies.

Benchmark experiments described in the paper evaluate 18 mainstream LLMs using four different prompting strategies applied to the collected dataset.

high neutral Benchmarking LLMs for Community Governance Simulation with L... coverage of models and prompting strategies in benchmark (number of LLMs and pro...

« Prev 1 2 3 … 30 31 32 … 138 139 Next »