Evidence (13827 claims)
Adoption
8454 claims
Productivity
7544 claims
Governance
6789 claims
Human-AI Collaboration
6327 claims
Org Design
4126 claims
Innovation
4058 claims
Labor Markets
3520 claims
Skills & Training
2924 claims
Inequality
2057 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 749 | 195 | 97 | 889 | 1979 |
| Governance & Regulation | 815 | 391 | 188 | 121 | 1539 |
| Organizational Efficiency | 771 | 189 | 124 | 83 | 1177 |
| Technology Adoption Rate | 624 | 233 | 123 | 96 | 1084 |
| Research Productivity | 410 | 121 | 56 | 331 | 929 |
| Output Quality | 466 | 177 | 59 | 47 | 749 |
| Decision Quality | 320 | 174 | 75 | 42 | 618 |
| Firm Productivity | 435 | 55 | 88 | 20 | 604 |
| AI Safety & Ethics | 214 | 276 | 65 | 33 | 593 |
| Market Structure | 178 | 166 | 122 | 24 | 495 |
| Task Allocation | 206 | 64 | 70 | 31 | 376 |
| Skill Acquisition | 165 | 57 | 60 | 17 | 299 |
| Innovation Output | 201 | 27 | 41 | 18 | 288 |
| Employment Level | 105 | 51 | 107 | 13 | 278 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 116 | 63 | 42 | 11 | 232 |
| Firm Revenue | 149 | 46 | 26 | 3 | 224 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Task Completion Time | 169 | 29 | 8 | 12 | 219 |
| Worker Satisfaction | 89 | 61 | 20 | 12 | 182 |
| Error Rate | 69 | 91 | 10 | 2 | 172 |
| Regulatory Compliance | 76 | 68 | 14 | 5 | 163 |
| Training Effectiveness | 92 | 19 | 13 | 19 | 145 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Automation Exposure | 51 | 54 | 22 | 12 | 142 |
| Team Performance | 86 | 17 | 27 | 9 | 140 |
| Developer Productivity | 94 | 17 | 14 | 6 | 132 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 51 | 7 | 8 | 3 | 69 |
| Skill Obsolescence | 5 | 45 | 6 | 1 | 57 |
| Creative Output | 31 | 16 | 7 | 2 | 57 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 17 | 17 | — | 51 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
FLUID uses a late-fusion, ID-free design that injects slice-level and room-level LUCID as independent tokens, stabilized by a staged warmup under online incremental training.
Methodological/system design description in the paper specifying late-fusion ID-free architecture, token injection strategy, and staged warmup for online incremental training.
FLUID couples a cross-domain multimodal encoder, jointly trained on short videos and livestreams, to produce discrete hierarchical codes (LUCID).
Methodological description in paper: joint training on short videos and livestreams to generate discrete hierarchical codes named LUCID. This is a system/design claim from the methods section.
FLUID is the first framework to fully retire the candidate-side item ID from a production-scale livestreaming ranker.
Authors' claim of novelty and system capability; supported in-document by description of FLUID's ID-free design and production deployment note. No independent verification provided in the excerpt.
Live-agent performance depends on objective tracking, execution conversion, cost, and runtime reliability, supporting evaluation of LLMs as components in bounded workflows rather than as isolated benchmark respondents.
Synthesis of experimental results (cross-provider differences in end-to-end play, planner bakeoff, and trace analyses) that link specific mechanisms (objective tracking, execution conversion, cost, runtime reliability) to performance.
In a replicated 32-game cross-provider championship under frozen rules, gemini-3.1-pro-preview won 20 of 32 games against gpt-5.1, claude-opus-4-7, and kimi-k2.6, and the pooled winner distribution differs strongly from an equal-strength null (p approx 1.5 x 10^-5).
Empirical tournament experiment: 32 games played under frozen rules across four provider models; reported win counts and a statistical test vs an equal-strength null yielding p ≈ 1.5×10^-5.
Reskilling policy should emphasize portfolio breadth and portable competency frameworks rather than deeper single-track specialization, particularly for workers in small, lower-threshold firms.
Policy recommendation in abstract based on empirical findings about skill-demand shifts and heterogeneity across firm types.
Augmentation exposure is positively associated with the nonroutine analytical skill share.
Empirical result stated in abstract: positive association between augmentation exposure and nonroutine analytical share, using the authors' augmentation measure and within-firm identification.
The deep integration of the digital and real economies and the accumulation of human capital are fundamental drivers of sound and rapid development of the overall economy.
Theoretical framing and empirical emphasis in the paper asserting the importance of human capital (digital talent) and digital-real economy integration for economic growth; supported by the paper’s cross-regional empirical analysis linking digitalization and talent to growth outcomes.
Digital talent agglomeration and industrial digitalization are important drivers of regional economic growth.
Overall empirical results from cross-provincial/regional analysis in China reported in the paper, which link measures of digital talent concentration and industrial digitalization to regional economic growth outcomes.
In the Yangtze River Delta region, digital talent agglomeration and industrial digitalization have achieved a positive and interactive relation that promotes regional economic growth.
Regional-case empirical analysis focused on the Yangtze River Delta showing a positive interaction between talent agglomeration and industrial digitalization associated with higher regional economic growth (reported in the paper). Specific sample size for the region is not stated in the excerpt.
Focusing on observation instead of prediction, and governance rather than control, complements existing alignment and safety practices while preserving human judgment, institutional choice, and long-term wellbeing.
Normative argument presented in the paper linking observational monitoring to governance objectives; no empirical evaluation provided.
Interpretable, aggregate behavioral signals (as described) support human-in-the-loop interpretation and enable earlier awareness of when AI use patterns may be drifting from creative augmentation toward automation pressure, authority substitution, or unintended displacement of human agency.
Conceptual claim about intended use of monitoring signals; no empirical test or sample presented.
A system-level framework for externalized behavioral monitoring should treat generative AI systems as participants in socio-technical ecosystems rather than static tools, emphasizing interpretable, aggregate behavioral signals such as shifts in output velocity, semantic and structural reuse, persistence of synthetic roles, and cross-context propagation.
Proposed conceptual framework and list of candidate behavioral signals in the paper (design/specification, no empirical validation).
Post-deployment observability is a foundation for well-being-aligned human–AI co-evolution.
Conceptual argument and system-level framework presented in the paper (no empirical study or sample reported).
The findings carry significant implications for entrepreneurs, policymakers, and educators seeking to leverage AI as a driver of inclusive and sustainable entrepreneurial success in urban India.
Authors' stated implications in the discussion and conclusion sections, derived from thematic findings across the 16 interviews.
An entrepreneur's mindset—specifically cognitive openness, risk tolerance, and iterative experimentation—is the strongest predictor of successful AI adoption outcomes, superseding firm size, sector, and financial capacity.
Cross-cutting finding from thematic analysis of the 16 interview transcripts indicating recurring emphasis on mindset attributes as drivers of successful adoption; comparative qualitative assessment across interviewees suggested these factors mattered more than firm size, sector, or finances.
Overall, AI adoption produces measurable benefits in operational efficiency, strategic decision-making, and customer personalisation among the entrepreneurs studied.
Synthesis of interview findings/themes from the 16-case qualitative study; authors state AI adoption 'produces measurable benefits' across these domains based on participant reports.
AI acts as a competitive equaliser among entrepreneurs in Delhi/NCR.
Theme 'AI as a Competitive Equaliser' produced by thematic analysis of the 16 interviews; participants reported that AI lowered barriers and allowed smaller firms to compete more effectively.
AI adoption transforms customer experience by enabling greater personalisation.
Theme 'Customer Experience Transformation' from thematic analysis of interviews (n=16); entrepreneurs described AI-driven personalisation and improved customer interactions.
AI adoption improves strategic decision-making and market intelligence among entrepreneurs.
One of five thematic findings ('AI-Enabled Decision Making and Market Intelligence') derived from thematic analysis of 16 interviews; participants reported using AI for market insights and better decisions.
AI functions as an operational accelerator for entrepreneurs, producing benefits in operational efficiency.
Thematic analysis of interview data (n=16) generated a theme labelled 'AI as an Operational Accelerator' reporting interviewee accounts of operational efficiency gains.
This study integrates observed GenAI uses into a coherent, processual view of growth hacking by developing first-order concepts, second-order themes and three aggregate dimensions mapped onto a seven-stage growth pipeline.
Methodological claim supported by the study's adopted approach: Gioia methodology applied to 17 semi-structured interviews with founders/growth leaders (nine startups), plus secondary sources.
Generative AI reallocates human attention from asset production to problem framing, inference quality and organizational learning across the seven stages of the growth pipeline.
Interview-derived themes (17 interviews across nine startups) and process mapping of GenAI uses onto the seven-stage growth pipeline.
Generative AI acts as a data orchestrator that automates cleaning, cohorting, variance checks and knowledge capture, tightening feedback loops and institutionalizing learning.
Findings derived from 17 semi-structured interviews with founders and growth leaders across nine startups, supported by secondary sources and Gioia-style thematic analysis.
Generative AI serves as a cognitive sparring partner that reduces bounded rationality and groupthink via premortems, counter-arguments and stakeholder role-plays while preserving human judgment.
Same qualitative data set of 17 interviews across nine startups, with Gioia-method coding producing first-order concepts and themes describing AI-mediated decision practices.
Generative AI functions as an experimentation accelerator, lowering the marginal cost of variation and compressing the idea-to-test cycle, enabling parallel selections of controlled tests.
Exploratory multiple-case qualitative study using 17 semi-structured interviews with founders and growth leaders across nine startups, plus secondary sources; analysis via the Gioia methodology to derive themes mapped onto a seven-stage growth pipeline.
Findings extend digital transformation theory by showing that GenAI moves organizing from human-driven adaptation toward technology-embedded reconfiguration.
Authors' theoretical interpretation linking empirical findings from 17 interviews to broader digital transformation theory.
The paper conceptualizes 'AI-augmented orchestration', where human and algorithmic actors jointly configure work and value creation.
Theoretical contribution / conceptualization derived from analysis of interview data and authors' synthesis.
The study links GenAI-driven organizational changes to four value dimensions: operational, structural, innovation, and market value.
Authors' analytical framework developed from interview data (17 interviews) mapping changes to four value dimensions.
Startups integrate GenAI not as a peripheral tool but as a structural collaborator.
Interpretive finding from interviews and authors' theorization based on the dataset (17 interviews).
Generative AI (GenAI) is influencing how startups form, operate, and create value.
Statement in paper's introduction/abstract; supported by the study's framing and qualitative interview data (17 expert interviews).
Large-scale validation in a production code completion environment shows Echo increased the acceptance rate from 25.7% to 35.7%.
Reported result from 'large-scale validation' in a production code completion environment as stated in the paper's abstract; no sample size, statistical tests, or additional experimental details provided in the excerpt.
User-driven refinement sequences distill agents' flawed proposals into high-quality training signals.
Conceptual/empirical claim in the paper that user refinements produce verified solutions which serve as high-quality signals; supported by the paper's later validation claim but no separate sample size or statistical detail provided in the excerpt.
Echo is a generalized framework that operationalizes the transition from raw experience to learnable knowledge by echoing environmental feedback into the training loop for model optimization.
Methodological contribution described by the authors (framework description); no implementation details or quantitative validation given in the excerpt besides later mention of validation.
Widespread deployment of AI agents provides low-cost access to massive streams of real-world experience data.
Stated observation in the paper; no quantitative deployment statistics or sample sizes provided in the excerpt.
Continuous learning from 'experience data' (interactions between agents and their environments) promises to transcend the scalability and knowledge limitations of static human data.
Conceptual claim in the paper proposing continuous learning from experience data as a solution; no empirical details provided in the excerpt.
The ML community should adopt PBOS as its default contract for such collaborations.
Normative recommendation by the authors; presented as a conclusion/proposal rather than empirically validated policy.
The boundary could not have been drawn correctly without scientists at the negotiating table.
Normative/analytic claim offered by the authors asserting the necessity of scientist involvement in contract design; no empirical evidence provided in the excerpt.
This boundary (pre-training open / post-training proprietary) is technically meaningful, legally clean, and auditable.
Claim about the properties of the PBOS boundary presented as an argument/claim in the paper; no empirical/legal audit data provided in the excerpt.
PBOS: pre-training artifacts (architectures, training code, benchmarks, untrained weights) are open science; post-training artifacts (weights trained on proprietary data) are business IP.
Proposed contract template/definition presented in the paper (prescriptive/design proposal); no empirical validation reported in the provided text.
There is a session-level carryover effect: a participant's prior AI use leads to further AI adoption and entrenches their miscalibration about time savings.
Observed analyses across sessions in the three pre-registered user studies (combined N = 2691) showing that prior within-session AI use predicts subsequent AI adoption and stronger miscalibration.
People display 'efficiency-gain illusions': they overestimate how much time and effort savings AI use provides.
Same three pre-registered user studies (combined N = 2691) that measured participants' perceived time/effort savings from AI versus actual measured time/effort.
People frequently choose to use AI even when doing so is inefficient (i.e., provides no meaningful time or effort savings).
Three pre-registered user studies reported in the paper (combined N = 2691) measuring participants' choices to use AI on cognitively simple tasks and comparing those choices to measured time/effort savings.
Future research should focus on empirical assessments of the economic ramifications of artificial intelligence, particularly regarding productivity enhancement, labour market restructuring, and equitable income distribution.
Recommendation in the paper's discussion/conclusion based on identified gaps and the theoretical model (no empirical study presented to support specific magnitudes).
Regulatory bodies should ensure access to data, support platform markets, and promote that artificial intelligence redistributes wealth among the owners of capital, data and labour.
Normative recommendation grounded in the paper's theoretical-legal model and comparative policy discussion (method: deductive/inductive reasoning; no empirical intervention or evaluation).
The European Union has established a comprehensive legal and regulatory framework for the digital economy and artificial intelligence, including rules on platform usage, digital goods liability, data protection (GDPR), and AI.
Comparative legal review of EU regulations and statutes described in the paper (method: comparative approach).
The rise of digital technologies and artificial intelligence will dramatically improve the way existing economic systems function.
Theoretical synthesis and comparative legal analysis presented in the paper; no empirical data or sample reported (methodology: inductive and deductive reasoning, comparative approach).
Empirically stable pricing near the Nash Bargaining benchmark is observed in testing.
Reported empirical observation from experiments across varying population sizes and a 30-day horizon (abstract statement).
Testing across 6–100 agents over a 30-day horizon confirms scalability across population size.
Reported experimental sweep over agent population sizes from 6 to 100 across a 30-day horizon (as stated in abstract).
Nash-guided price proximity rewards align agent learning toward bargaining-optimal strategies.
Algorithmic design claim from the paper: inclusion of a Nash-guided price-proximity reward to shape agent learning (abstract statement).