Evidence (4333 claims)

Evidence Matrix

Claim counts by outcome category and direction of finding.

Outcome	Positive	Negative	Mixed	Null	Total
Other	402	112	67	480	1076
Governance & Regulation	402	192	122	62	790
Research Productivity	249	98	34	311	697
Organizational Efficiency	395	95	70	40	603
Technology Adoption Rate	321	126	73	39	564
Firm Productivity	306	39	70	12	432
Output Quality	256	66	25	28	375
AI Safety & Ethics	116	177	44	24	363
Market Structure	107	128	85	14	339
Decision Quality	177	76	38	20	315
Fiscal & Macroeconomic	89	58	33	22	209
Employment Level	77	34	80	9	202
Skill Acquisition	92	33	40	9	174
Innovation Output	120	12	23	12	168
Firm Revenue	98	34	22	—	154
Consumer Welfare	73	31	37	7	148
Task Allocation	84	16	33	7	140
Inequality Measures	25	77	32	5	139
Regulatory Compliance	54	63	13	3	133
Error Rate	44	51	6	—	101
Task Completion Time	88	5	4	3	100
Training Effectiveness	58	12	12	16	99
Worker Satisfaction	47	32	11	7	97
Wages & Compensation	53	15	20	5	93
Team Performance	47	12	15	7	82
Automation Exposure	24	22	9	6	62
Job Displacement	6	38	13	—	57
Hiring & Recruitment	41	4	6	3	54
Developer Productivity	34	4	3	1	42
Social Protection	22	10	6	2	40
Creative Output	16	7	5	1	29
Labor Share of Income	12	5	9	—	26
Skill Obsolescence	3	20	2	—	25
Worker Turnover	10	12	—	3	25

Governance Remove filter

Temporal mapping and citation networks reveal distinct technology maturity patterns, which are visualised using S-curve and hype cycle models.

Paper describes use of temporal mapping and citation network analysis and visualization via S-curve and hype cycle models; methodological description without quantitative sample-size details.

high positive Emerging Technologies Based on Large AI Models and the Desig... technology maturity patterns as revealed by temporal mapping and citation networ...

Technologies such as AI-driven healthcare, quantum communication, hydrogen energy, and smart educational AI are identified as key domains of convergence.

Paper reports these domains were identified via the applied analytic framework and multi-source data triangulation; no numeric counts/sample sizes provided.

high positive Emerging Technologies Based on Large AI Models and the Desig... identification of key converging technology domains

The study applies advanced techniques such as LDA topic modelling, BERT-based clustering, and co-citation analysis to detect innovation trajectories.

Paper states these specific analytic techniques were applied (method description).

high positive Emerging Technologies Based on Large AI Models and the Desig... detection of innovation trajectories using LDA, BERT clustering, co-citation ana...

The research leverages large AI models and multi-source data—including global patent databases (WIPO, USPTO, Lens.org), scientific literature corpora, and industry intelligence platforms (CB Insights, Qichacha).

Paper statement of data sources and use of large AI models; methodological description (no sample sizes reported).

high positive Emerging Technologies Based on Large AI Models and the Desig... use of multi-source data and large AI models for technology detection

A stylized-facts analysis using OECD and World Bank indicators shows that economies with higher digital capacity, greater R&D intensity, and stronger institutions exhibit superior productivity and growth performance.

Stylized-facts (cross-country) analysis based on OECD and World Bank indicators; descriptive correlations reported in the paper (sample of countries not enumerated in the provided summary).

high positive Artificial intelligence, institutional innovation and econom... productivity and economic growth (superior performance)

AI adoption stimulates institutional innovation, which in turn increases total factor productivity (TFP) and supports sustainable economic growth.

Theoretical mediation claim developed in the paper (integration of Schumpeterian growth theory with institutional economics); supported conceptually and argued with stylized-facts analysis but not presented as causally identified empirical estimates.

high positive Artificial intelligence, institutional innovation and econom... total factor productivity and economic growth (increase)

AI improves governance quality.

Argument within the conceptual framework linking AI capabilities (information processing, monitoring) to improved governance; stated qualitatively in the paper rather than supported by causal empirical tests.

high positive Artificial intelligence, institutional innovation and econom... governance quality (improvement)

AI lowers transaction costs.

Paper's conceptual/theoretical framework that characterizes AI as lowering transaction costs through improved information and coordination; no quantitative causal estimate reported.

high positive Artificial intelligence, institutional innovation and econom... transaction costs (reduction)

AI reduces information asymmetries.

Theoretical/conceptual argument in the paper framing AI as a general-purpose technology that improves information flows; supported by the paper's conceptual framework (no experimental or causal identification reported).

high positive Artificial intelligence, institutional innovation and econom... information asymmetries (reduction)

These systems are now being widely used to produce software, conduct business activities, and automate everyday personal tasks.

Authors' statement describing observed applications and uses (policy/legal analysis; specific empirical data or sample size not provided in excerpt).

high positive Regulating AI Agents use of AI agents across software production, business processes, and personal ta...

AI agents have entered the mainstream.

Authors' declarative statement based on their review of recent developments and observed uptake (policy/legal analysis in the paper). No empirical sample size reported in excerpt.

high positive Regulating AI Agents AI agent adoption / prevalence

Economies and organizations that prioritize adaptability, workforce transformation, and real-time decision-making capabilities are better positioned to sustain growth under volatile conditions.

Claim based on the paper's cross-cutting analysis of global indicators and the conceptual AEPM framework; the excerpt does not provide a quantified causal estimate, experimental evidence, or sample size supporting this assertion.

high positive Beyond Forecasting: Adaptive Economic Preparedness in a Geop... ability to sustain growth under volatile conditions

AEPM is structured around five core pillars—energy resilience, supply chain flexibility, human capital adaptability, financial sustainability, and AI-enabled decision systems—which together provide a comprehensive approach to managing uncertainty and enabling dynamic responses to structural disruptions.

Conceptual design of the AEPM presented in the paper; described as a multidimensional framework combining these five pillars. No empirical validation or quantified impact measures reported in the excerpt.

high positive Beyond Forecasting: Adaptive Economic Preparedness in a Geop... capacity to manage uncertainty and mount dynamic responses to structural disrupt...

The paper proposes shifting from forecasting-centric economic management to an adaptive preparedness paradigm and introduces the Adaptive Economic Preparedness Model (AEPM), a multi-dimensional framework designed to enhance resilience at both organizational and national levels.

Presentation of a conceptual model (AEPM) in the paper structured around five pillars; this is a proposed framework rather than an empirically validated intervention (no evaluation sample or randomized test reported in the excerpt).

high positive Beyond Forecasting: Adaptive Economic Preparedness in a Geop... resilience of organizations and nations to structural disruptions

The contribution is a falsifiable architectural thesis, a clear threat model, and a set of experimentally testable hypotheses for future work on distillation resistance, alignment, and model governance.

Theoretical contribution claim: the paper proposes hypotheses and a threat model intended to be testable in future empirical work; no experiments in the paper itself are reported.

high positive A Public Theory of Distillation Resistance via Constraint-Co... provision_of_falsifiable_thesis_and_testable_hypotheses

The authors call for shifting evaluation and assurance from tool qualification toward workflow qualification to achieve trustworthy Physical AI.

Normative recommendation based on the paper's theoretical analysis (policy/recommendation; no empirical sample reported).

high positive The Competence Shadow: Theory and Bounds of AI Assistance in... governance_and_regulation

The paper derives non-degradation conditions that characterize shadow-resistant workflows for AI-assisted safety analysis.

Analytic derivations and formal criteria presented in the paper (theoretical result; no empirical validation/sample size reported).

high positive The Competence Shadow: Theory and Bounds of AI Assistance in... output_quality

The paper formalizes four canonical human–AI collaboration structures and derives closed-form performance bounds for them.

Theoretical/mathematical derivations and models in the paper (no empirical verification/sample size reported).

high positive The Competence Shadow: Theory and Bounds of AI Assistance in... task_allocation

A five-dimensional competence framework captures safety competence via domain knowledge, standards expertise, operational experience, contextual understanding, and judgment.

Theoretical contribution: paper defines and formalizes a five-dimension framework (no empirical validation/sample size reported).

high positive The Competence Shadow: Theory and Bounds of AI Assistance in... skill_acquisition

To facilitate adoption of our evaluation framework, we detail our testing protocols and make relevant materials publicly available.

Statement in paper that testing protocols and materials are documented and released publicly (paper claims to provide materials).

high positive Evaluating Language Models for Harmful Manipulation availability of testing protocols and materials

We assess an AI model with 10,101 participants spanning interactions in three AI use domains (public policy, finance, and health) and three locales (US, UK, and India).

Reported sample size and study design details stated in abstract: N = 10,101; three domains and three locales specified.

high positive Evaluating Language Models for Harmful Manipulation sample composition and scale of the empirical study

This paper introduces a framework for evaluating harmful AI manipulation via context-specific human-AI interaction studies.

Paper describes a proposed evaluation framework (methodological contribution); claimed in abstract/introduction as new contribution. No numeric sample required for the claim itself.

high positive Evaluating Language Models for Harmful Manipulation existence of an evaluation framework for harmful AI manipulation

The result is evidence-based triggers that replace calendar schedules and make governance auditable.

Claimed outcome of applying the decision-theoretic framework in the paper (argumentative; no empirical deployment or case-study evidence reported in the summary).

high positive Retraining as Approximate Bayesian Inference retraining trigger design and governance auditability

The paper provides a decision-theoretic framework for retraining policies.

Explicit claim about the paper's contribution; the article presents a framework (conceptual/methodological exposition).

high positive Retraining as Approximate Bayesian Inference existence of a prescriptive framework for retraining policies

The retraining decision is a cost minimization problem with a threshold that falls out of your loss function.

Decision-theoretic derivation presented in the paper (analytical/theoretical reasoning; no empirical validation reported).

high positive Retraining as Approximate Bayesian Inference formalization of retraining decision rule (cost-minimization/threshold)

Retraining can be better understood as approximate Bayesian inference under computational constraints.

Theoretical argument and decision-theoretic framing presented in the paper (conceptual/mathematical derivation rather than empirical testing).

high positive Retraining as Approximate Bayesian Inference conceptual framing of retraining

The framework is designed for direct application to engineering processes for which operational event logs are available.

Statement of intended applicability in the paper and demonstration on a large enterprise procurement workflow (BPI 2019 log).

high positive The Stochastic Gap: A Markovian Framework for Pre-Deployment... adoptability / applicability to engineering processes

The same quantities that delimit statistically credible autonomy (blind masses, escalation gate, m(s), etc.) also determine expected oversight burden (the framework includes an expected oversight-cost identity over the workflow visitation measure).

Theoretical identity and discussion in the paper plus demonstration on the empirical workflow showing how the introduced quantities relate to expected oversight costs.

high positive The Stochastic Gap: A Markovian Framework for Pre-Deployment... expected oversight burden / oversight cost

On the held-out split, m(s) = max_a \hat{\pi}(a|s) tracks realized autonomous step accuracy within 3.4 percentage points on average.

Empirical evaluation on the paper's held-out test split (chronological 20%); reported average discrepancy between the maximum predicted action probability and realized autonomous-step accuracy.

high positive The Stochastic Gap: A Markovian Framework for Pre-Deployment... accuracy of autonomous step selection (realized autonomous step accuracy)

Refining the operational state to include case context, economic magnitude, and actor class expands the state space from 42 to 668.

Empirical report in the paper showing state-space expansion when additional contextual variables are included in state definition (numbers 42 and 668 stated).

high positive The Stochastic Gap: A Markovian Framework for Pre-Deployment... other

We instantiate the framework on the Business Process Intelligence Challenge 2019 purchase-to-pay log (251,734 cases, 1,595,923 events, 42 distinct workflow actions) and construct a log-driven simulated agent from a chronological 80/20 split of the same process.

Empirical instantiation described in the paper using the BPI 2019 purchase-to-pay event log; dataset statistics (cases, events, distinct actions) and an 80/20 chronological train/test split are reported.

high positive The Stochastic Gap: A Markovian Framework for Pre-Deployment... other

We develop a measure-theoretic Markov framework for agentic AI in organizations, whose core quantities are state blind-spot mass B_n(\tau), state-action blind mass B^{SA}_{\pi,n}(\tau), an entropy-based human-in-the-loop escalation gate, and an expected oversight-cost identity over the workflow visitation measure.

Theoretical development presented in the paper (definition and derivation of the measure-theoretic Markov framework and associated quantities).

high positive The Stochastic Gap: A Markovian Framework for Pre-Deployment... other

The framework aims to support more comparable benchmarks and cumulative research on human-AI readiness, advancing safer and more accountable human-AI collaboration.

Stated aims and intended impact in paper; aspirational/conceptual rather than empirically demonstrated in excerpt.

high positive From Accuracy to Readiness: Metrics and Benchmarks for Human... benchmarks, cumulative research, safety and accountability in human-AI collabora...

Operationalizing evaluation through interaction traces rather than model properties or self-reported trust enables deployment-relevant assessment of calibration, error recovery, and governance.

Methodological claim/proposed approach in paper; presented as enabling assessment but no empirical evaluation reported in excerpt.

high positive From Accuracy to Readiness: Metrics and Benchmarks for Human... assessment of calibration, error recovery, governance via interaction traces

The taxonomy and metrics are connected to the Understand-Control-Improve (U-C-I) lifecycle of human-AI onboarding and collaboration.

Conceptual mapping described in paper; no empirical tests or sample reported in excerpt.

high positive From Accuracy to Readiness: Metrics and Benchmarks for Human... linking metrics to U-C-I onboarding lifecycle

We introduce a four part taxonomy of evaluation metrics spanning outcomes, reliance behavior, safety signals, and learning over time.

Explicit methodological claim in paper announcing a taxonomy; described as a contribution rather than empirically tested in excerpt.

high positive From Accuracy to Readiness: Metrics and Benchmarks for Human... evaluation metrics taxonomy (outcomes, reliance behavior, safety signals, learni...

This paper proposes a measurement framework for evaluating human-AI decision-making centered on team readiness.

Methodological contribution presented in paper; conceptual framework proposed (no empirical validation reported in excerpt).

high positive From Accuracy to Readiness: Metrics and Benchmarks for Human... team readiness evaluation

Artificial intelligence (AI) systems are deployed as collaborators in human decision-making.

Statement in paper (conceptual/observational claim); no empirical sample or method provided in excerpt.

high positive From Accuracy to Readiness: Metrics and Benchmarks for Human... deployment of AI as collaborators

Late disclosure of AI involvement improved affective engagement for AI-enhanced content.

Reported experimental result in the abstract from the two online studies (study 1: n = 325; study 2: n = 371) manipulating disclosure timing (early vs. late).

high positive AI content labeling and user engagement on social media: The... affective engagement for AI-enhanced content under late disclosure

The results of this regional research outline a multi-dimensional policy roadmap that dives deep into the region’s current capabilities and the hurdles it faces in catching up with the AI revolution from a governance and policy perspective, presenting them in a practical framework for public sector leaders.

Report summary claiming that the study's results produce a comprehensive roadmap and practical framework (content description).

high positive Charting AI Governance Future in the Arab Region: A Policy R... comprehensiveness and practicality of the policy roadmap produced by the study

This executive report provides a roadmap for establishing an AI governance infrastructure through a set of strategic policy recommendations across seven key pillars.

Document assertion describing the content and structure of the report (authors' deliverable).

high positive Charting AI Governance Future in the Arab Region: A Policy R... existence of a multi-pillar policy roadmap in the report

The reality of limited AI governance capacity calls for a series of policy interventions at both local and regional levels to empower the AI ecosystem in the Arab region.

Authors' policy recommendation derived from the regional study and synthesis of findings.

high positive Charting AI Governance Future in the Arab Region: A Policy R... adoption of policy interventions to strengthen AI governance and ecosystem

A governance model linking 'trustworthy AI' practices to competitive advantage yields reduced uncertainty, faster deployment cycles, and higher stakeholder trust.

Central claim of the paper tying the proposed AIGSF to business benefits; supported by conceptual linkage and illustrative examples rather than quantified empirical evidence or controlled evaluation.

high positive Artificial Intelligence Governance In Corporate Strategy: Et... firm_revenue

Case illustrations across hiring, credit, consumer services, and generative AI draw lessons on controls such as model documentation, algorithmic audits, impact assessments, and human-in-the-loop oversight.

Paper includes qualitative case illustrations in the listed domains to demonstrate governance controls; these are presented as examples and lessons rather than as systematic empirical studies (no sample sizes reported).

high positive Artificial Intelligence Governance In Corporate Strategy: Et... regulatory_compliance

The paper develops an AI Governance Strategic Framework (AIGSF) and an implementation roadmap that connect ethical accountability, regulatory readiness, cybersecurity resilience, and performance outcomes.

Paper contribution described as an integrative conceptual framework and roadmap; supported by theoretical grounding and illustrative cases rather than empirical validation; no sample size provided.

high positive Artificial Intelligence Governance In Corporate Strategy: Et... organizational_efficiency

AI governance should be treated as a strategic governance function—anchored in board oversight and enterprise risk management—rather than a narrow technical or compliance task.

Central normative recommendation and thesis of the paper; derived from an integrative conceptual framework grounded in corporate governance theory, ERM, and emerging regulation. No empirical testing or sample reported.

high positive Artificial Intelligence Governance In Corporate Strategy: Et... governance_and_regulation

AI has moved from a peripheral digital capability to a central driver of corporate strategy, reshaping decision-making, customer engagement, operations, and risk exposure.

Statement presented in the paper's introduction and motivation; supported by integrative conceptual design and literature grounding (theory and descriptive citations). No empirical sample or quantitative analysis reported.

high positive Artificial Intelligence Governance In Corporate Strategy: Et... organizational_efficiency

A policy of 20% mandatory practice preserves 92% more capability than the simulation baseline (baseline includes a 5% background AI-failure rate).

Simulation comparing baseline (5% background AI-failure rate) to a counterfactual with 20% mandatory practice; reported 92% relative preservation of capability.

high positive The enrichment paradox: critical capability thresholds and i... preserved human capability under mandatory practice policy vs baseline

The model predicts that periodic AI failures improve human capability 2.7-fold (relative improvement reported in simulations).

Simulation experiments comparing scenarios with/without periodic AI failures; reported fold-change in capability of 2.7×.

high positive The enrichment paradox: critical capability thresholds and i... human capability (H) under periodic AI-failure regime

Validated against 15 countries' PISA data (102 points), the model achieves R^2 = 0.946 with 3 parameters and attains the lowest BIC among compared specifications.

Empirical validation using PISA dataset covering 15 countries and 102 data points; reported fit statistics (R^2, number of parameters, BIC).

high positive The enrichment paradox: critical capability thresholds and i... fit of model to PISA data (explained variance, model selection via BIC)

« Prev 1 2 3 … 26 27 28 … 86 87 Next »