Evidence (6869 claims)
Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 758 | 199 | 100 | 900 | 2007 |
| Governance & Regulation | 826 | 400 | 191 | 122 | 1563 |
| Organizational Efficiency | 777 | 193 | 124 | 84 | 1189 |
| Technology Adoption Rate | 635 | 233 | 124 | 97 | 1098 |
| Research Productivity | 422 | 128 | 57 | 336 | 954 |
| Output Quality | 476 | 179 | 59 | 47 | 761 |
| Decision Quality | 328 | 177 | 81 | 47 | 640 |
| Firm Productivity | 435 | 57 | 88 | 20 | 606 |
| AI Safety & Ethics | 218 | 277 | 65 | 33 | 599 |
| Market Structure | 180 | 170 | 123 | 24 | 502 |
| Task Allocation | 213 | 64 | 72 | 33 | 387 |
| Skill Acquisition | 170 | 61 | 61 | 17 | 309 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 54 | 107 | 13 | 281 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 153 | 48 | 26 | 3 | 230 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 56 | 56 | 26 | 13 | 154 |
| Training Effectiveness | 94 | 21 | 13 | 19 | 149 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 31 | 18 | 8 | 3 | 61 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Governance
Remove filter
Labor-market segmentation and digital capability gaps in India create distributional vulnerabilities.
Abstract cites Indian official statistics and household/labor surveys (PLFS, HCES, MoSPI–NSO) and integrates sector evidence; no specific sample size reported in abstract.
Refined exposure measures imply widespread task transformation rather than uniform job destruction, with accelerated skill change as a central risk for vulnerable workers.
Abstract cites labor-market analyses and ILO (2025) as the basis for refined exposure measures and conclusions; no sample size stated in abstract.
Global frameworks warn that uneven readiness may produce a 'Next Great Divergence' between countries.
Cited global reports in abstract (UNDP 2025, WTO 2025, OECD 2026) which are summarized as issuing this warning; no primary data sample size reported in paper abstract.
Persistent adoption gaps among groups suggest unequal access to AI-enabled productivity.
Abstract references global reports (OECD, WEF, UNDP, WTO) and sector evidence indicating adoption gaps; no numerical sample size given.
AI may widen capability inequality—inequalities in access to knowledge, digital infrastructure, computational resources, and organizational adoption—thereby shaping income opportunities and socio-economic security for low-income groups.
Argument presented using the paper's socio-technical political economy framework and validated secondary sources (OECD, ILO, UNDP, WTO, WEF) and official Indian statistics; no direct empirical sample from this paper reported.
Design choices that prioritize scalable growth introduce trade-offs in reusability, evolution, and auditability in A2A collaboration networks.
Synthesis of empirical findings (low reuse, manipulable rankings, unverified validations) connecting design incentives to negative side-effects.
EvoMap relies on agents to provide local execution logs as evidence that uploaded assets function correctly; because these validations are not independently verified, over 84% of approved assets bypass quality checks using vacuous tests (e.g., console.log).
Empirical audit of validation logs and acceptance tests reported in the paper showing >84% of approved assets used trivial/vacuous checks.
Agents can trivially manipulate their asset's scores by falsifying self-reported metadata.
Demonstrations/analyses in the paper that changing metadata values leads to predictable changes in GDI scores; examples like claimed lines-of-code manipulation are provided.
An asset's GDI rank is heavily dictated by unverified, self-reported metadata (e.g., claimed lines of code modified).
Correlation/causal analysis in the paper showing strong dependence of GDI scores on self-reported metadata fields rather than objective performance measures.
EvoMap employs an algorithm (GDI) to score and rank shared assets, and this scoring system is flawed.
Paper description of the GDI ranking algorithm and empirical analyses illustrating problems with how it operates.
Rewards become highly concentrated among a small fraction of agents.
Distributional analysis of credits/rewards across agents (inequality/concentration observed in reward allocation).
98% of assets are never reused.
Empirical reuse metric computed across the asset corpus reported in the paper.
Because rewards favor publication over adoption, agents mass-produce assets to accumulate credits.
Observed publishing behavior (large numbers of assets per agent) and the platform's incentive structure; paper links publication-focused rewards to high per-agent asset counts.
Rewards are tied primarily to publication rather than adoption.
Analysis of reward allocation rules and empirical patterns showing reward issuance linked to publication events more than measured reuse/adoption.
AI adoption presents workforce adaptation challenges.
Reported in the study's literature synthesis and thematic analysis of secondary sources (qualitative review). No sample size reported.
AI adoption raises ethical considerations.
Authors' thematic evaluation of secondary literature identifying ethical issues associated with human-AI collaboration (qualitative synthesis). No sample size reported.
AI adoption presents challenges related to skill gaps.
Thematic findings from peer-reviewed literature and secondary data (qualitative review). No sample size reported.
Concentrated digital power may hinder inclusive industrialisation (SDG 9) and exacerbate global inequalities (SDG 10).
Argument linking conceptual analysis of digital power concentration to Sustainable Development Goals based on literature and policy interpretation (literature-based reasoning, no empirical measurement provided).
Industrial data systems generate 'participation without power,' a dynamic that particularly affects workers, small and medium enterprises (SMEs), and developing economies.
Theoretical/conceptual framing introduced by the paper and justified via literature review and examples from recent studies (no quantitative sample reported).
Inequality is increasingly shaped by the capacity to control and leverage digital systems rather than merely by access to digital technologies.
Conceptual claim grounded in synthesis of recent literature arguing a shift from access-based digital divide frameworks to control/power-based frameworks (literature review, no primary data reported).
The fidelity gain from richer profiles comes with more input tokens per call from the longer prompts they require (i.e., higher per-call input cost).
Measurement of input token counts per model call for prompt variants with and without life-history profiles in the benchmark experiments; comparison shows longer prompts require more input tokens.
Simultaneously, there is a structural shortage of qualified personnel and a gap between the education system and the needs of the economy in Uzbekistan.
Synthesis of statistical data, industry reviews, and regulatory/legal document analysis presented in the paper (no primary survey/sample size reported).
As these systems scale, the bottleneck shifts away from raw model capability toward coordination.
Analytical/argumentative claim in the paper framing a shift in primary constraint; no empirical study or quantified benchmark reported.
AI systems intended to simulate companionship or emotional responsiveness raise risks such as emotional manipulation, addictive interaction patterns, and potential impact of prolonged AI interaction on users’ mental well-being, particularly for vulnerable users.
Asserted risk statement in policy recommendations; no empirical study, prevalence data, or sample provided in the text.
Current systems still struggle with evidence preservation, reproducibility, weak-direction rejection, provenance tracking, cross-domain robustness, and accountable scientific closure.
Survey-identified recurring failure modes and limitations reported in literature and system descriptions; qualitative synthesis.
Current systems remain fragmented, differing in autonomy, domain scope, execution environment, validation mechanism, and human oversight.
Survey of existing systems and categorization across the listed dimensions; descriptive synthesis rather than an empirical meta-analysis.
A reported limitation is that at this privacy level the released valuations remain noise-dominated; the system's utility derives primarily from public index routing and adaptive scheduling driven by low-sensitivity statistics.
Authors' limitation/analysis section and experimental observations.
Static temporal knowledge-graph data marketplace designs suffer three coupled failures: (i) stale hybrid index shortcuts reduce recall as edges evolve, (ii) stationary Shapley pricing misattributes value after distribution shifts, and (iii) uncoordinated agents over-consume a shared differential-privacy budget.
Authors' problem statement / conceptual diagnosis presented in the paper (no numeric sample size reported).
Commercial or dual-use AI models and semiconductors do not meet the security exception criteria under GATT Article XXI(b), so security interests should be interpreted restrainedly.
Legal argument and interpretive analysis in the paper contending that the GATT Article XXI(b) security exception does not encompass routine commercial or dual-use AI models and semiconductors; doctrinal legal reasoning rather than empirical measurement.
Overusing export controls can complicate dispute resolution and hinder AI progress.
Normative and legal-political argument in the paper: overuse raises legal disputes (e.g., WTO litigation) and may slow cross-border AI development and diffusion (qualitative reasoning).
Overly strict or arbitrary controls may violate WTO obligations.
Legal analysis in the paper arguing that some export controls could conflict with WTO law (GATT) depending on scope and justification; interpretive legal reasoning cited.
The long-term effectiveness of export controls is questionable.
Paper's argumentative assessment drawing on historical examples and theoretical considerations (qualitative reasoning rather than quantitative causal inference).
China responded with export curbs on critical minerals and filed a WTO complaint against the U.S. under GATT.
Factual claim citing China's counter-measures (export curbs) and legal action (WTO complaint under GATT) as described in the paper.
Rule debt is a governance burden that accrues when organizational decision rules migrate from formal information systems into ungoverned agentic execution environments.
Conceptual construct introduced and defined in the paper; supported by illustrative examples, no empirical measurement reported.
AI-enabled capabilities whose outputs require evidence, review, signoff, or assignable responsibility may retain integrated accountability boundaries even when their technical interfaces become modular.
Theoretical claim supported by conceptual analysis and domain illustrations; no empirical sample or formal measurement reported.
We argue that regions are unlikely to maximize all three [Progress, Sustainability, Equity] simultaneously under current technological, institutional, and resource conditions.
Argument based on synthesis of prior literature on limits of AI development and illustrative evidence (regional cases and stakeholder comment analysis); explicitly stated in the abstract.
The rapid expansion of artificial intelligence infrastructure, including data centers and the energy, land, water, and labor systems that support them, presents regional policymakers with trade-offs that are poorly captured by the prevailing "innovation versus regulation" frame.
Conceptual argument drawing on prior literature and illustrative regional examples presented in the paper; stated explicitly in the abstract.
Notable challenges to AI implementation include concerns about algorithmic bias, privacy, transparency, job displacement, organizational culture, and issues related to ethical and legal oversight.
Synthesis of reported challenges across the 29 empirical studies included in the scoping review.
Fragmented, uncoordinated approaches in the absence of national strategy constitute a structural barrier to technological development in Georgia.
Method: logical inference and country assessment presented in the paper documenting fragmentation across policy and institutional actors; qualitative evidence rather than quantitative causal estimation.
In Georgia, the total absence of a national AI strategy and legal definition produces fragmented approaches, creating a structural barrier to technological development.
Method: country-level assessment of policy and legal framework for AI in Georgia; descriptive analysis identifying lack of a national strategy and definition. (No sample size reported.)
Technical bottlenecks (cross-border data compliance, algorithm interpretability) and ethical challenges (algorithmic bias, privacy infringement, cultural conflicts) are intertwined impediments to intelligent international marketing.
Synthesis of challenges identified across the reviewed literature (systematic review and content analysis, 2010–2025) as reported in the paper.
Traditional international marketing theories, constrained by static assumptions and linear logic, struggle to explain intelligent contexts.
Conclusion from the paper's systematic review and content analysis of core literature (2010–2025); no quantitative test or sample size reported in the summary.
There is a negativity asymmetry: negative histories induce 1.62x more bias than positive (paired per item; t = 13.46, p < 10^-39, n = 2,481).
Paired per-item comparison of bias induced by negative versus positive histories; reported multiplicative factor, t-statistic, p-value, and sample size n = 2,481.
In deployed settings, the effects of AI systems on human agency, creativity, and institutional well-being emerge over time, shaped by repeated interaction, reuse, and integration into real-world workflows, and these dynamics are rarely visible through pre-deployment evaluation or isolated prompt–response analysis.
Argumentative observation based on conceptual reasoning; no empirical data or sample size reported.
Because contracts are negotiated by legal departments alone, many apparent legal disputes are incentive misalignment problems that only scientists at the table can correctly diagnose.
Argumentative claim presented in the paper (normative/diagnostic); no empirical study or sample provided in the excerpt.
These failures are not for scientific reasons, but because academics must publish while companies must protect models trained on proprietary data, and no standard contract framework resolves this tension.
The paper presents this as the causal explanation (analytical/argumentative claim); no empirical testing or sample reported in the provided text.
Industry-academia ML collaborations routinely fail to launch.
Asserted in the paper as an empirical observation/statement; no empirical methods, data, or sample size reported in the provided text (argument/anecdote).
Regulatory uncertainty and the absence of explicit legislation on digital data and artificial intelligence may leave the economic potential of these technologies unexplored while increasing market concentration, inequality, and the risk of personal information misuse.
Argued implications from the paper's theoretical model and comparative legal discussion; no empirical testing or quantified analysis provided.
Current regulatory frameworks—designed for human-intermediated payments—are ill-equipped to address the dynamic and decentralised nature of agent-led transactions.
Regulatory and legal analysis asserted in the abstract (argument that existing frameworks are mismatched to agent-led payments).
The article identifies and categorises a range of technical, legal and societal risks, including cybersecurity vulnerabilities, liability gaps, regulatory non-compliance, and potential economic disruption.
Risk identification and categorisation presented in the paper (qualitative analysis and case studies referenced in the abstract). No quantitative risk measurement reported in the abstract.