Evidence (4114 claims)
Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 758 | 199 | 100 | 900 | 2007 |
| Governance & Regulation | 826 | 400 | 191 | 122 | 1563 |
| Organizational Efficiency | 777 | 193 | 124 | 84 | 1189 |
| Technology Adoption Rate | 635 | 233 | 124 | 97 | 1098 |
| Research Productivity | 422 | 128 | 57 | 336 | 954 |
| Output Quality | 476 | 179 | 59 | 47 | 761 |
| Decision Quality | 328 | 177 | 81 | 47 | 640 |
| Firm Productivity | 435 | 57 | 88 | 20 | 606 |
| AI Safety & Ethics | 218 | 277 | 65 | 33 | 599 |
| Market Structure | 180 | 170 | 123 | 24 | 502 |
| Task Allocation | 213 | 64 | 72 | 33 | 387 |
| Skill Acquisition | 170 | 61 | 61 | 17 | 309 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 54 | 107 | 13 | 281 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 153 | 48 | 26 | 3 | 230 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 56 | 56 | 26 | 13 | 154 |
| Training Effectiveness | 94 | 21 | 13 | 19 | 149 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 31 | 18 | 8 | 3 | 61 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Innovation
Remove filter
The paper distinguishes technical substitutability (the feasible replacement ceiling implied by AGI capability) from actual adoption (the realized replacement share chosen under cost, profitability, and adoption frictions).
Conceptual/theoretical definition introduced in the political-economy model (no empirical sample; definitional argument within the paper).
The paper proposes a five-pillar diagnostic framework combining fundamental valuation, residual-exuberance tests, SADF/GSADF explosive-root procedures, LPPL/HLPPL price-pattern diagnostics, sentiment and issuance measures, and capex-payback analysis.
Methodological proposal presented in the paper (framework description); this is a stated contribution rather than an empirical result.
Traditional software and agentic systems are distinct: in traditional software code is the carrier of decision logic, whereas in agentic systems code is ephemeral tooling used by an LLM-driven reasoning loop.
Formalization and conceptual definitions developed in the paper (first-principles formal distinction; no empirical sample size reported).
For over half a century, software engineering has operated on a foundational premise: human engineers decompose problems, encode decision logic into static code, and manually adapt that code as requirements evolve.
Historical/descriptive claim presented in the paper's framing and literature review; citation of longstanding software engineering practices (qualitative, no empirical sample size reported).
The study introduces a methodological framework for evaluating LLM citation behaviors, integrating information retrieval theory, semantic search optimization, and structured content engineering.
Explicit claim about the paper's contribution: introduction of a methodological framework combining IR theory, semantic search, and structured content engineering. This is a factual statement about the paper's content (no sample size reported in excerpt).
Traditional SEO strategies have historically focused on keyword density, backlink authority, and ranking positions within search engine results pages (SERPs).
Descriptive claim about historical SEO practices presented as background/context in the paper; based on domain knowledge and literature references (no new empirical data reported in the excerpt).
We extend the representation-completion principle to device cold-start by constructing cohort-based embeddings from demographic features.
Methodological extension described in paper (approach for device cold-start handled via cohort-based demographic embeddings).
We propose Shallow-RHS, an asymmetric link-prediction architecture in which the left-hand side (LHS) device tower leverages temporally valid watch-history message passing to capture collaborative signals, while the right-hand side (RHS) content tower is intentionally shallow and encodes content solely from intrinsic features.
Model architecture description in paper (design specification; no numeric evaluation included in excerpt).
We formulate cold-start recommendation as an inductive graph-completion problem on a temporal bipartite device-content graph.
Methodological framing presented in the paper (problem formulation).
In Tubi's production retrieval system, new content must be assigned a standalone embedding immediately, and the model must also produce device embeddings suitable for approximate nearest-neighbor retrieval.
Description of production serving constraints in Tubi stated in paper (system design / operational constraint).
The study integrates ICT4D, socio-technical systems theory, and the capability approach as its theoretical framing.
Methodological/theoretical statement in the paper describing the integrative framework used for analysis.
While grounded in the DRC, the findings offer broader insights into AI adoption dynamics across informal economies in Sub-Saharan Africa and beyond.
Authors' claim of broader relevance/generalizability based on the DRC case study and theoretical framing.
AI adoption in the DRC emerges through hybrid socio-technical interactions between bottom-up youth innovation and weakly coordinated institutional frameworks, rather than following policy-led or infrastructure-first trajectories.
Theoretical integration (ICT4D, socio-technical systems, capability approach) and qualitative interview evidence used to characterize observed adoption pathways.
The article introduces 'compressed professionalization', defined as the accelerated acquisition and immediate market enactment of professional-level digital capabilities outside formal institutional pathways.
Conceptual/theoretical contribution presented and defined in the paper, supported by illustrative field observations from the interviews.
The study drew on 125 semi-structured interviews conducted in Kinshasa, Lubumbashi, and Goma.
Primary qualitative fieldwork reported in the paper: 125 semi-structured interviews across three DRC cities (Kinshasa, Lubumbashi, Goma).
The paper analyzes multiple dimensions of scientific creativity and impact, specifically recombinant novelty, object novelty, 3-year short-run citation impact, and 10-year long-run citation impact.
Methodological description in paper listing the specific dependent variables and time horizons used to measure novelty and impact.
The analysis draws on over one million publications from OpenAlex.
Descriptive statement in paper specifying dataset source (OpenAlex) and sample size of publications used for analysis.
This study uses panel data from 281 Chinese cities between 2005 and 2022, treats establishment of national GIPs as a quasi‑natural experiment, and applies a double machine learning approach.
Methods description in the paper explicitly states data coverage (281 Chinese cities, 2005–2022), research design (quasi‑natural experiment), and estimation strategy (double machine learning).
This study constructs a comprehensive evaluation system of urban ecological resilience from three dimensions: potential, elasticity, and stability.
Methodological description in the paper: authors state they constructed a composite resilience evaluation system composed of three specified dimensions for prefecture-level cities.
Two-stage field experiments in healthcare prescription messaging encompassed 693,139 patient visits in total.
Paper statement of total sample size across Stage 1 and Stage 2.
Stage 2 (Tool-Augmented Agentic AI) autonomously extracted principles from Stage 1 data and generated 17 new message variants tested on 248,448 patient visits.
Study design and reported results from Stage 2 of the two-stage field experiment described in the paper.
Stage 1 (Human + Chatbot) produced 13 message variants and was tested on 444,691 patient visits.
Study design details reported in the paper describing the two-stage field experiment.
The empirical analysis is based on panel data of new energy vehicle firms in the Yangtze River Delta from 2001 to 2023.
Dataset description provided in the paper's abstract/introduction indicating the time span and regional coverage.
R&D expenditure does not constitute a significant mediating channel between artificial intelligence and firms' new quality productive forces.
Mediation analysis using the panel data and constructed indicators; reported nonsignificant mediation effect of R&D expenditure (no sample size or statistics reported in excerpt).
The study uses listed companies in China's manufacturing industry from 2010 to 2023 as the research sample.
Authors explicitly state the empirical sample: listed manufacturing firms in China covering 2010–2023.
The positive relationship between BDTA and CEE remains robust after a series of robustness tests and endogeneity tests.
Authors state they conducted robustness checks and endogeneity tests (unspecified in the summary) and report that the main regression results remain robust.
Brain privacy has both personal and social attributes; its protection therefore implicates individual interests and technological development.
Normative/legal argumentation and conceptual analysis presented in the paper (no empirical data reported).
The study employs a comparative mixed-methods approach (comparative institutional analysis) of leading financial systems in China, the United States, and the United Kingdom (2022–2025), integrating secondary quantitative indicators with qualitative documentary evidence.
Direct methodological statement in the abstract describing the study design and data sources.
We identify four archetypes (data orchestrators, aggregators, niche specialists, and cloud orchestrators).
Paper states it develops a taxonomy and explicitly lists four archetypes; based on the taxonomy development and conceptual classification reported in the paper (no sample size or quantitative empirical test reported in abstract).
The model introduces the 'Sciencepreneur' as the central human archetype in agentic R&D.
Conceptual/design claim within the HARMONY artifact presented in the paper.
Evidence also includes pattern matching with documented agentic R&D deployments.
Methodological statement in the paper claiming pattern matching with documented agentic R&D deployments (unspecified number/source).
The study includes a foresight scenario analysis projecting four plausible 2040 R&D futures to stress-test design choices.
Methodological statement in the paper describing a four-scenario foresight analysis.
Empirical evidence for the design is triangulated from four semi-structured expert interviews with senior R&D leaders across industrial, healthcare, and academic settings.
Methodological statement in the paper specifying four semi-structured expert interviews.
The paper proposes a policy framework consisting of six groups of solutions for Vietnam to both promote AI development and control risks in the digital age.
Declared in abstract: the paper presents a six-group policy framework for Vietnam; the framework itself is the paper's output (proposal), not empirically tested in the paper.
This study employs document synthesis and comparative analysis of international policies.
Methodological statement in the paper abstract describing the research approach; no sample size specified beyond document sources.
The rise of artificial intelligence (AI) is shaping a new Agent Economy (AE), in which autonomous AI agents represent humans in performing a wide range of complex tasks.
Statement in paper abstract/intro (conceptual definition); no empirical data or sample reported.
Identification limits prevent a strict causal claim; the paper outlines an agenda for cleaner tests.
Authors' explicit caveat in the abstract noting limits to identification and stating they outline future cleaner tests.
The analysis exploits the staggered rollout of Claude Code across GitHub between May 2025 and January 2026, using a panel of 5,838 developers observed monthly over 28 months, with treatment defined by a developer's first Claude-co-authored commit and not-yet-treated developers as controls, and estimates obtained via the doubly robust Callaway and Sant'Anna (2021) estimator.
Methods and data description as stated in the abstract: staggered rollout timing, sample size (5,838), observation window (28 months), treatment definition (first Claude-co-authored commit), estimator (Callaway & Sant'Anna 2021).
Results are robust to two stricter activity filters.
Robustness checks reported in the paper applying two stricter activity filters to the sample; claim refers to consistency of estimated effects under these alternate sample definitions.
The actual water footprint of a specific load varies dynamically with generation dispatch and network conditions.
Conceptual claim presented in the paper motivating the need for dynamic attribution (discussion/analysis rather than a reported empirical sample).
Water withdrawals associated with electricity consumption occur at generation sites and are virtually allocated to demand based on network power flows.
Conceptual statement about how water withdrawals are attributed to loads via network power flow accounting (methodological description in paper).
The system was evaluated on a real 64-GPU A100 testbed emulating three wind-powered sites with Azure production traces.
Experimental evaluation described in abstract: 64-GPU A100 testbed, emulation of three sites, use of Azure production traces.
The paper examines the legal implications of overusing export controls.
Statement of the paper's analytic scope and structure (description of content).
AI capability is conceptualized/measured as having sub-dimensions including technical infrastructure and management.
Measurement/model description in paper: AI capability broken into sub-dimensions (technical infrastructure, management); supported by survey instrument and measurement model using PLS-SEM on 251 firms.
The mixed-method approach, combining partial least squares–structural equation modeling (PLS-SEM) and fuzzy-set qualitative comparative analysis (fsQCA), was used for analyzing the survey data of 251 firms.
Methods statement in paper: authors report using a mixed-method approach (PLS-SEM and fsQCA) on survey data; sample size explicitly stated as 251 firms.
The paper identifies five major research gaps and proposes future research directions in intelligent international marketing.
Author-reported outcome of the paper's systematic review and content analysis (2010–2025); descriptive claim about the paper's contributions.
Using a qualitative approach with 17 expert interviews from employees at startups.
Methods statement in paper specifying qualitative study design and sample size of 17 interviews.
Process-related insights into how GenAI transforms startups are limited.
Authors' literature positioning / gap statement in paper (no empirical metric provided).
Agentic payments are distinct from traditional automated systems because they emphasise autonomy, contextual reasoning and adaptability.
Conceptual distinction asserted in the abstract (comparative analysis between agentic payments and traditional automated systems).
The paper examines operational logic, defining features and emerging use cases of agentic payments across retail, e-commerce and decentralised finance.
Stated scope in the abstract; analysis and case-study-driven review across specified sectors (retail, e-commerce, DeFi). No sample sizes reported.