Evidence (13870 claims)
Adoption
8467 claims
Productivity
7558 claims
Governance
6805 claims
Human-AI Collaboration
6363 claims
Org Design
4132 claims
Innovation
4065 claims
Labor Markets
3526 claims
Skills & Training
2945 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 749 | 196 | 98 | 892 | 1984 |
| Governance & Regulation | 817 | 394 | 188 | 121 | 1544 |
| Organizational Efficiency | 771 | 189 | 124 | 83 | 1177 |
| Technology Adoption Rate | 627 | 233 | 123 | 96 | 1088 |
| Research Productivity | 411 | 123 | 56 | 332 | 933 |
| Output Quality | 467 | 178 | 59 | 47 | 751 |
| Decision Quality | 320 | 174 | 75 | 42 | 618 |
| Firm Productivity | 435 | 55 | 88 | 20 | 604 |
| AI Safety & Ethics | 214 | 276 | 65 | 33 | 593 |
| Market Structure | 178 | 167 | 122 | 24 | 496 |
| Task Allocation | 207 | 64 | 71 | 32 | 379 |
| Skill Acquisition | 165 | 59 | 60 | 17 | 301 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 52 | 107 | 13 | 279 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 116 | 63 | 42 | 11 | 232 |
| Firm Revenue | 150 | 48 | 26 | 3 | 227 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Task Completion Time | 169 | 29 | 8 | 12 | 219 |
| Worker Satisfaction | 89 | 63 | 20 | 12 | 184 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 76 | 68 | 14 | 5 | 163 |
| Training Effectiveness | 93 | 21 | 13 | 19 | 148 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Automation Exposure | 51 | 54 | 22 | 12 | 142 |
| Team Performance | 86 | 17 | 27 | 9 | 140 |
| Developer Productivity | 94 | 17 | 14 | 6 | 132 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 51 | 7 | 8 | 3 | 69 |
| Creative Output | 31 | 17 | 7 | 3 | 59 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 17 | 17 | — | 51 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Together, these measures can properly establish a behavioral‑regulation model for brain‑privacy protection.
Concluding synthesis in the paper arguing that combined measures would yield the proposed regulatory model (normative conclusion without empirical validation).
Implement a 'pre‑market regulatory sandbox + post‑market tracking' regime to manage product risks.
Prescriptive policy design proposed in the paper (conceptual recommendation; no empirical pilot data reported).
Establish a compliance filing‑review mechanism for BCI privacy policies.
Policy recommendation in the paper proposing a procedural compliance mechanism (normative proposal without empirical testing).
Apply the principles of lawfulness, legitimacy, necessity and good‑faith to all brain‑privacy processing.
Policy recommendation formulated in the paper (prescriptive legal proposal; no empirical evaluation included).
A behavioral‑regulation model better reflects the multi‑interest, non‑exclusive nature of brain privacy and balances risk control with innovation.
Normative policy argument and conceptual comparison of regulatory models presented in the paper (theoretical, not empirically tested).
The machines are increasingly becoming competent.
Authorial assertion about the trend in AI capability (no metrics or studies provided in the excerpt).
The concept of co-intelligence describes a new cognitive ecology where the human and artificial minds mutually influence one another to come up with ways of comprehending, creating and making choices that neither of them could accomplish individually.
Conceptual claim attributed to Ethan Mollick (2024) and extended by the author — described conceptually rather than demonstrated empirically in the excerpt.
None of the past technologies have spread into so many aspects of human life, so fast.
Author's comparative assertion about the speed and breadth of AI diffusion relative to prior technologies (no empirical comparison provided in the excerpt).
Artificial intelligence has become a partner in our everyday activities: it dictates our emails, diagnoses our diseases, educates our young children, controls our budgets, creates our artworks, and influences the policies made by governments and corporations.
Authorial assertion listing domains of current AI use (no empirical study or quantified data provided in the excerpt).
The internet had to cope with more or less a decade before it could reach one billion users; social media did it in half times.
Comparative historical adoption claim presented by the author (no citation or empirical method given in the excerpt).
Less than a year after its debut, hundreds of millions of individuals on all seven continents were using large language models, in virtually every field of professional activity, and in most languages.
Authorial assertion summarizing global LLM adoption (no specific study, dataset, or methodology provided in the excerpt).
There were now a hundred million ChatGPT users in two months.
Authorial assertion in the text citing a user-count milestone for ChatGPT (no study or data source provided in the excerpt).
This provocation introduces fiduciary design as a guiding principle and argues that conversational AI trust and accountability could be unified into a single design and legal paradigm.
Proposal/argument presented in the paper (conceptual design + legal framing); no empirical evaluation or implementation data provided in the excerpt.
When a client hires a personal lawyer, undergoes surgery, or receives advice from an investment manager, the expert they consult often has a fiduciary duty to act in their client's best interests; conversational agents should be held to a similar standard.
Analogy to existing professional fiduciary duties used as the core normative argument in the paper; no empirical testing of legal applicability reported in the excerpt.
Conversational AI agents, designed to feel and interact anthropomorphically with human users, must be held to a standard of care commensurate with their capabilities and access.
Normative assertion/proposal laid out in the paper (argumentative reasoning); no empirical test or legal analysis with sample size provided in the excerpt.
Conversational agents are increasingly integrated into the most private and intimate aspects of users' lives, from discussions of mental health to financial decisions.
Asserted as descriptive background in the paper (position/argumentative claim); examples provided (mental health, financial decisions); no empirical study or sample size reported in the excerpt.
The scientific results converged in both runs.
Paper statement reporting that the scientific results from both agents converged across the two experimental runs (descriptive outcome of the runs).
SLMs can effectively benefit from LLM assistance.
Empirical finding stated in the abstract based on experiments with hybrid MAS architectures where SLMs were assisted by cloud LLMs.
We adapt two representative MAS architectures to support hybrid inference and study how individual design choices shift the operating point along the Pareto frontier of power, cost, and performance.
Methodological statement from the paper: the authors implemented/adapted two MAS architectures and performed empirical evaluation to map Pareto trade-offs.
Hybrid multi-agent systems (MASs) combining on-device and cloud models offer a promising middle ground between LLMs and SLMs.
Framed as a motivation in the paper; the authors adapt two MAS architectures and study hybrids empirically to support this position.
Small language models (SLMs) are more cost-efficient and amenable to on-device inference.
Background claim in the paper's introduction/abstract; motivated by literature and the paper's focus on on-device SLMs.
Frontier large language models (LLMs), typically hosted in the cloud, offer strong performance across a wide range of tasks at substantially high cost.
Background claim in the paper's introduction/abstract; supported by literature context and framing rather than a specific experiment in this paper.
Audit outputs can be leveraged as red-teaming inputs to stress-test fairness robustness and strengthen AI governance through improved data quality and oversight (proposed intervention).
Proposed methodological/policy recommendation in the paper (proposal, not evaluated empirically in the excerpt).
AI-enabled hiring systems are widely adopted.
Statement in paper (background claim); no empirical sample or citation provided in the excerpt.
The study advances an integrative framework of sustainable AI governance emphasizing regulatory adaptability, institutional coordination, and ethical oversight as mechanisms for aligning AI innovation with long-term financial stability and sustainability objectives, and offers policy-relevant guidance for regulators and financial institutions.
Study conclusion reported in the abstract describing the proposed integrative framework and its policy relevance; based on the study's comparative mixed-methods analysis.
AI-enabled financial innovation is associated with improvements in risk assessment capabilities.
Comparative institutional analysis and integration of secondary quantitative indicators with qualitative documentary evidence across China, the United States, and the United Kingdom (2022–2025) as described in the abstract.
AI-enabled financial innovation is associated with improvements in ESG integration.
Same comparative mixed-methods approach across China, the United States, and the United Kingdom (2022–2025) using secondary quantitative indicators and qualitative documentary evidence, reported in the abstract.
AI-enabled financial innovation is associated with improvements in market efficiency.
Comparative mixed-methods analysis (comparative institutional analysis) across leading financial systems in China, the United States, and the United Kingdom (2022–2025), integrating secondary quantitative indicators with qualitative documentary evidence as reported in the study abstract.
Managers can make both (Agentic Technical Debt and Stochastic Tax) visible through lightweight dashboards and governance controls.
Prescriptive/recommendation in the paper; authors state they 'outline' approaches for managers to surface these concepts using dashboards and governance controls. No empirical evaluation or case study evidence reported in the provided excerpt.
Category leaders are persona-resistant (~80% same-brand consistency across personas).
Measured same-brand consistency across personas in audit; reported approximate consistency level for category-leading brands.
Clustered 95% CIs exclude zero on all three measured cells (the sonnet cell's CI rests on only 4 prompt clusters and is correspondingly wider).
Reported clustered 95% confidence intervals for the three measured model/prompt cells; note about sonnet cell having only 4 prompt clusters (hence wider CI).
Using the three metrics (data product adoption, time-to-find, time-to-insight) ties platform success to measurable business value rather than internal activity.
Argument in the paper about metric selection and their role in assessing platform success (methodological rationale).
A staged framework that shifts ownership from hub to spokes avoids both centralized bottlenecks and uncoordinated decentralization.
Organizational/process recommendation presented in the paper as a way to manage decentralization (design rationale).
Natural-language conversational interfaces democratize access for business users and expose historically underutilized enterprise data.
Proposed UX/interaction benefit asserted in the paper (design claim; no empirical measurement reported in the excerpt).
Large language models (LLMs) that automate governance tasks also lower the barrier for domain practitioners to develop genuine cross-functional expertise spanning business and data engineering, enabling spoke teams to take on greater end-to-end ownership without proportionally increasing their dependence on the hub.
Argument in the paper linking AI/LLM capabilities to skill enablement and reduced hub dependence (conceptual claim; no empirical results in the excerpt).
Domain spokes own business semantics, product backlogs, and local iteration cadence, progressively assuming greater responsibility as they mature (shifting operational ownership outward over time).
Architectural/organizational design element described in the paper (procedural proposal for staged ownership transfer).
A central hub (Center of Excellence) can provide shared platform services, policy automation, and AI-enabled governance that automatically standardizes data products, generates quality rules, drafts data contracts, and reviews changes for regressions.
Functional capabilities described in the proposed architecture; presented as what the hub component will provide (design/specification).
An AI-augmented hub-and-spoke model layered on a modern lakehouse architecture can relax the flexibility-versus-control trade-off inherent in enterprise data platforms.
Proposed architectural solution and theoretical argument in the paper (design proposal; no reported experimental/field results provided in the text excerpt).
Affordance actualization (i.e., the realization of GenAI affordances) can shift strategic choices between replacement and retainment of target systems.
Theoretical contribution supported by empirical illustration from two consecutive acquisitions of the same target in the authors' case study (qualitative evidence).
GenAI reconfigures perceived knowledge challenges, alters integration logics, and expands feasible paths for value capture in M&A IS integration decisions.
Synthesis claim based on the paper's two-case comparative study and theoretical framing using the knowledge-based view and technology affordance lens (qualitative, interpretive evidence).
GenAI affordances reduced prior assumptions about system intransparency, personnel dependence, and conversion costs during IS integration.
Authors' analysis of the comparative case evidence showing changed perceptions and lowered barriers in the second acquisition after GenAI affordance discovery (qualitative evidence).
LLM-supported affordances, such as learning system knowledge through chat, increased knowledge transferability, knowledge aggregation, and efficiency.
Observed and interpreted affordance actualization in the second acquisition within the paper's qualitative case study (authors report that LLM/chat features enabled these improvements).
In the second acquisition the acquirer adopted a 'retain-and-revive' approach for the same target, enabled by newly discovered GenAI affordances.
Empirical observation from the paper's comparative case study of two consecutive acquisitions of the same digital target (qualitative case evidence showing contrasting integration choices across the two acquisitions).
Aggregators and niche specialists employ more open governance and sourcing logics that foster innovation, specialization, and ecosystem diversity.
Presented as a comparative finding from the taxonomy and qualitative examination of non-hyperscaler ML platform providers; supports drawn from conceptual analysis and examples in the paper rather than quantitative measures (no sample size reported in abstract).
These findings suggest a dynamically adaptive LLM-teacher collaboration as student proficiency increases.
Interpretive/recommendation claim in the abstract: authors conclude that collaboration should adapt dynamically with student proficiency based on observed efficacy and ceiling effects.
Both LLM and teacher are critical for student skill improvement.
Abstract statement reporting that both LLM and teacher contributions were important for skill improvement; supported by empirical analysis on the reported dataset (57,954 essays).
Teachers act as pedagogical gatekeepers and bridges to guarantee feedback quality.
Stated in the abstract that within the triadic system teachers ensure feedback quality, implying a complementary role confirmed by the authors' empirical analysis or system design.
The triadic collaboration system is efficacious in improving writing quality.
Empirical claim in the abstract supported by analysis of the large dataset (57,954 essays from 10,195 students across 120 schools over two years). The paper states findings confirm the system's efficacy in improving writing quality.
We introduce a multidimensional evaluation framework grounded in Systemic Functional Linguistics and the suggestion trajectory tracing pipeline.
Methodological contribution explicitly reported in the abstract: a new evaluation framework combining SFL and a suggestion trajectory tracing pipeline.
We developed a triadic collaboration system to support K-12 writing learning that coordinates LLMs, teachers, and students.
Methodological claim stated in the abstract that the authors designed and developed a triadic collaboration system for K-12 writing learning; presumably implemented and evaluated using the dataset.