Evidence (6869 claims)
Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 758 | 199 | 100 | 900 | 2007 |
| Governance & Regulation | 826 | 400 | 191 | 122 | 1563 |
| Organizational Efficiency | 777 | 193 | 124 | 84 | 1189 |
| Technology Adoption Rate | 635 | 233 | 124 | 97 | 1098 |
| Research Productivity | 422 | 128 | 57 | 336 | 954 |
| Output Quality | 476 | 179 | 59 | 47 | 761 |
| Decision Quality | 328 | 177 | 81 | 47 | 640 |
| Firm Productivity | 435 | 57 | 88 | 20 | 606 |
| AI Safety & Ethics | 218 | 277 | 65 | 33 | 599 |
| Market Structure | 180 | 170 | 123 | 24 | 502 |
| Task Allocation | 213 | 64 | 72 | 33 | 387 |
| Skill Acquisition | 170 | 61 | 61 | 17 | 309 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 54 | 107 | 13 | 281 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 153 | 48 | 26 | 3 | 230 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 56 | 56 | 26 | 13 | 154 |
| Training Effectiveness | 94 | 21 | 13 | 19 | 149 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 31 | 18 | 8 | 3 | 61 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Governance
Remove filter
Trajectory-level evaluation is essential in regulated domains.
Conclusion drawn by the authors based on the ASR findings (hidden shortcuts, metric blind spots, and remediation gains); presented as a policy/recommendation implication.
Gradient attribution is established as a computationally validated signal for model-informed reward allocation in participatory weather sensing.
Synthesis/conclusion in paper based on the computational experiments and evaluations (results across >400 configurations demonstrating fidelity and limitations).
Attribution captures near-optimal sensor placement utility with monotonically faithful payments.
Comparative experiments in the paper showing that gradient attribution corresponds closely to near-optimal sensor placement utility and yields monotonically faithful payment signals (experimental comparisons to optimal/benchmark placements).
Embedding governance into agent reasoning produces more consistent, explainable, and auditable compliance than external enforcement.
Comparative claim asserted in the paper, apparently supported by the reported production deployment results (95% compliance, zero false escalations); explicit experimental comparison details are not provided in the abstract.
Technology has increased efficiency in organisations based in large cities in India.
Review result statement claiming observed efficiency gains in urban organisations according to the literature summarized; based on reviewed studies (no single sample size reported in excerpt).
Controversial questions frequently result in an AIO.
Analysis of the 11,500-query benchmark with annotation/identification of 'controversial' queries and observed higher incidence of AIO generation for those queries.
Prompt modifications, Chain-of-Thought (CoT) reasoning, and visual token reduction can mitigate visual-priming effects on VLM behavior (with varying effectiveness across models).
Intervention experiments applying prompt engineering, CoT-style prompts, and reducing the number of visual tokens to observe whether these interventions reduce the influence of image content and color cues on IPD choices across several VLMs. (Abstract states these mitigation strategies were explored and their effectiveness varied by model; precise quantitative mitigation effects not provided in abstract.)
The proposed, validated model can equip fintech managers and regulators with a governance-based approach to tackling algorithmic bias and better position them to engender trust and financial inclusion.
Concluding assertion based on the integrated framework developed from the SLR (45 papers) and the structured five-expert validation; positioned as the intended practical utility of the model rather than an empirically measured outcome.
Our results establish C2C as a testbed for studying and building LM-based agents that can navigate the sophisticated coordination required for real-world deployments.
Authors' interpretation/implication based on the experiments and dataset produced (conclusion statement).
The paper proposes a safety-oriented inductive bias for rational AI decision-makers whose desiderata align with implementable policy constraints in high-stakes, low-signal situations.
Theoretical proposal and normative argument in the paper linking the proposed inductive bias (negligibility threshold and associated norms) to policy-implementable constraints; argued rather than empirically demonstrated.
These patterns are consistent with transfer emerging through accumulated interaction between owners (or owners' computer environments) and their agents in everyday use.
Interpretation offered by the authors based on observed alignment patterns and robustness checks; the paper argues consistency with an interaction-driven transfer mechanism rather than providing a direct experimental causal test.
This transfer persists among agents without explicit configuration.
Subgroup analyses (described in paper) isolating agents lacking explicit configuration settings and comparing behavioral alignment to owners; reported persistence of alignment in that subgroup.
Trade unions have increasingly pursued algorithmic transparency and stronger technology governance rights through collective bargaining, and governments are accelerating legislative initiatives to establish and protect workplace technology rights.
Descriptive review of labor-movement responses and recent government legislative initiatives reported in the literature (case studies and policy reviews).
Visibility mechanisms, such as public algorithm registers or role-sensitive explainability, can be effective tools in regaining citizen trust.
Review examines studies on transparency/visibility mechanisms; abstract states these mechanisms are examined for effectiveness but does not report definitive quantitative results or study counts.
Enterprise deployment of long-horizon decision agents in regulated domains (underwriting, claims adjudication, tax examination) is dominated by retrieval-augmented pipelines despite a decade of increasingly sophisticated stateful memory architectures.
Stated observation/argument in the paper's introduction; no empirical sample size or systematic industry survey reported in the abstract.
An accompanying open-source interactive tool, the Co-creation Provenance Lab, enables policymakers to audit and iteratively improve summaries, establishing genuine human-in-the-loop oversight at scale.
Statement in the paper about an open-source tool released alongside the research; likely demonstration or software repository provided.
AI adoption enhances the reliability of financial reporting and the effectiveness of audits by reducing information asymmetry and strengthening internal monitoring processes.
Argument grounded in theory and supported empirically via SEM showing AI adoption associated with greater reporting transparency and internal control quality, which are linked to higher audit quality.
AI-enabled reporting systems strengthen firm-level governance mechanisms (e.g., reporting transparency and internal controls), which enhances audit quality (governance substitution perspective complemented by institutional and technology diffusion theories).
Theoretical framing (governance substitution, institutional and technology diffusion theories) combined with empirical SEM results linking AI adoption to proxies for governance (reporting transparency, internal control quality) and to audit quality.
Differences in institutional quality, digital infrastructure, and absorptive capacity explain the disparity in technology impacts between GCC and non-GCC countries.
Exploratory/mediation or interaction analysis linking institutional quality, measures of digital infrastructure, and absorptive capacity to heterogeneity in estimated technology effects across countries in the panel.
Developing and further developed countries only integrate with China, signaling China's expanding influence over the international AI research landscape.
Observed integration patterns in the publication-based collaboration and citation networks showing that (some) developing and further developed countries connect primarily with China rather than the US; comparison to randomized networks.
The calibration mapping suggests Google and OpenAI face conditions most conducive to foreclosure.
Outcomes of the paper's stylized calibration/comparative mapping across four providers (April 2026 data); authors' interpretation.
Artificial intelligence algorithms are increasingly used by firms to set prices.
Statement in paper's introduction/abstract referencing prior adoption trends; no specific empirical study or sample reported in the excerpt.
LinuxArena is the largest and most diverse control setting for software engineering to date.
Authors assert this comparative claim based on the reported scale and diversity (20 environments, 1,671 main tasks, 184 side tasks); no detailed comparison data included in the excerpt.
This paper presents the first comparative study of game-theoretic mechanisms designed to enable cooperative outcomes between rational agents in equilibrium.
Authors' characterization of their contribution: a comparative study across four social dilemmas evaluating multiple mechanisms; no external validation provided in excerpt.
AI adoption improves efficiency, cost reduction, and strategic innovation.
Synthesis across included empirical studies reporting organizational outcomes following AI implementation (effects reported qualitatively across the 27 studies).
Exploitative innovation is associated with performance through incremental efficiency mechanisms.
Authors' interpretation of model results from the survey (104 managers) suggesting exploitative innovation improves performance via incremental efficiency, though specific mechanisms were not separately measured.
This is the first impossibility result in AI governance, establishing a formal boundary below which current paradigms remain valid and above which distributed accountability mechanisms become necessary.
Claim of novelty in the paper (author assertion). The paper provides the formal theorem and discusses implications; novelty relative to prior literature is asserted but not empirically demonstrated.
Human-in-the-loop governance is a practical lever to align GenAI productivity with environmental efficiency.
Interpretation of the experimental results: findings that certain prompt-based governance (operational constraints/decision rules) reduced footprint while preserving outputs, leading to the recommendation (argumentative claim).
Inference efficiency and system level optimisation are growing rapidly in the Green AI literature.
Temporal / thematic analysis of literature cited in the paper's mapping (asserted growth; no growth rates or counts provided in abstract).
As a consequence of these dynamics, 'algorithmic unions' (organised, coordinated resistance) may evolve organically as a survival strategy against over-optimized management systems.
Interpretation/implication drawn from the EGT model results (theoretical suggestion), not supported by empirical observations in the paper.
The analysis implies specific implications for healthcare leadership and procurement (e.g., procurement and leadership should consider incentive and risk-allocation effects, not just task optimisation).
Authors' conclusions/recommendations drawn from the theoretical analysis and typology (prescriptive claim in the paper; no empirical evaluation reported in the abstract).
AI enhances innovation and productivity, even though it currently contributes to higher CO2 emissions.
Statement in the study linking AI adoption to improvements in innovation and productivity alongside the empirical finding of higher CO2 emissions (based on the same cross-country panel analysis over 2000–2023).
Intelligent manufacturing policies can generate economically meaningful benefits by improving firms’ sustainability performance and the credibility of ESG information, which are central to capital allocation and the effectiveness of green governance.
Synthesis/implication drawn from the empirical findings reported in the paper (positive effects on ESG ratings, reduced greenwashing, and lower ESG uncertainty).
AI-enabled ESG ratings, green innovation, ethical AI, RegTech, and explainable AI in finance are becoming highly influential in international financial markets.
Paper identifies these themes as emerging and influential based on trends in the reviewed literature and topical focus areas; no quantitative adoption metrics or sample sizes are provided in the excerpt.
Public Model Context Protocol (MCP) server repositories are the current predominant standard for agent tools.
Paper asserts MCP servers are the predominant standard and uses these repositories as the primary monitoring source.
Drawing on analysis of agentic investment firm operational models demonstrating 50-70% cost reductions while maintaining fiduciary standards.
Internal analysis/modeling of agentic investment firm operational models reported by the authors; paper states the 50–70% cost reduction result but provides no sample size or detailed empirical validation in the provided text.
Using machine learning applied to news streams constitutes a practical method to augment existing fiscal surveillance tools.
Paper asserts practical applicability of ML + news for surveillance; presented as recommendation/claim rather than documented large-sample trial in the provided excerpt.
Incorporating news-based signals into machine-learning models can enhance regulatory practice by improving detection of potential fiscal instabilities.
Paper claims an empirical analysis and synthesizes findings linking news-derived signals and ML methods to improved regulatory monitoring; specific datasets, evaluation metrics, and sample sizes are not provided in the excerpt.
The framework offers a replicable model for governments and institutions seeking to proactively support high-potential innovations across sectors.
Paper asserts replicability and applicability to governments/institutions based on the described methods and outputs; no deployment case studies or empirical replication evidence reported in text provided.
A data-driven, foresight-based approach to policy design significantly enhances responsiveness, precision, and resource efficiency in science and technology governance.
Paper concludes this benefit based on its integrated framework, triangulation, Delphi/AHP validation and illustrative mapping; no quantified comparative metrics or experimental evaluation reported in text provided.
These findings provide quantitative foundations for AI capability-threshold governance.
Synthesis/interpretation of model results and empirical validation described in the paper (recommendation/implication).
The paper introduces the Distributed Human Data Engine (DHDE), a socio-technical framework previously validated in biological crisis management, and adapts it for regional economic flow optimization.
Author statement describing the DHDE and asserting prior validation in biological crisis management; adaptation described in paper (methodological description).
The ACT represents the first open-source effort to consolidate data on Africa's evolving HPC landscape, aiming to encourage more transparency from local AI stakeholders and facilitate broader access for AI developers.
Authors' characterization of ACT as a novel, open-source consolidation; assertion based on literature/tools review performed by the authors and on the tool's stated goals.
The results contribute to literature arguing that cloud-based GenAI is a source of enterprise value creation rather than merely an experimental technology.
Paper's stated addition to the existing literature based on the combined empirical and theoretical findings.
Orchestrated systems of smaller, domain-adapted models can mathematically outperform frontier generalist models in most institutional deployment environments.
Formal conditions and comparative analysis derived in the paper plus referenced/claimed empirical support across several domains (frontier lab dynamics, alignment evolution, sovereign AI pressures).
Debiasing via metadata redaction and explicit instructions restores detection in all interactive cases and 94% of autonomous cases.
Intervention experiments in Study 2 where metadata redaction and explicit instructions were applied to interactive assistants (e.g., GitHub Copilot) and autonomous agents (e.g., Claude Code); reported full restoration for interactive and 94% for autonomous.
An increasing number of enterprises are using the label of artificial intelligence merely as a cosmetic embellishment in their annual reports (the phenomenon of 'AI washing' is spreading).
Framing/background claim in the paper's introduction/abstract; implied support from the semantic analysis of annual report texts across Chinese A-share firms over 2006–2024.
There are ethical imperatives of fairness and transparency in automated wealth management, and the paper proposes a roadmap toward sustainable and interpretable financial AI.
Normative analysis and proposed roadmap described in the paper; the excerpt does not provide operationalized fairness metrics, interpretability methods, or evaluation results.
In environments characterized by high-frequency data, non-linear dependencies, and stochastic market regimes, autonomous DRL agents can learn optimal sequential decision-making policies that offer a compelling alternative to static or rule-based allocation strategies.
Argument based on theoretical suitability of DRL for sequential decision problems and the paper's system-level investigation; excerpt does not report specific experimental datasets, sample sizes, benchmarks, or performance metrics.
The integration of Deep Reinforcement Learning (DRL) into portfolio management represents a significant evolution from classical Mean-Variance Optimization and modern econometric frameworks.
Conceptual comparison and synthesis presented in the paper; no empirical sample size or experimental results are provided in the excerpt to quantify the degree of improvement.