Evidence (4560 claims)
Adoption
5267 claims
Productivity
4560 claims
Governance
4137 claims
Human-AI Collaboration
3103 claims
Labor Markets
2506 claims
Innovation
2354 claims
Org Design
2340 claims
Skills & Training
1945 claims
Inequality
1322 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 378 | 106 | 59 | 455 | 1007 |
| Governance & Regulation | 379 | 176 | 116 | 58 | 739 |
| Research Productivity | 240 | 96 | 34 | 294 | 668 |
| Organizational Efficiency | 370 | 82 | 63 | 35 | 553 |
| Technology Adoption Rate | 296 | 118 | 66 | 29 | 513 |
| Firm Productivity | 277 | 34 | 68 | 10 | 394 |
| AI Safety & Ethics | 117 | 177 | 44 | 24 | 364 |
| Output Quality | 244 | 61 | 23 | 26 | 354 |
| Market Structure | 107 | 123 | 85 | 14 | 334 |
| Decision Quality | 168 | 74 | 37 | 19 | 301 |
| Fiscal & Macroeconomic | 75 | 52 | 32 | 21 | 187 |
| Employment Level | 70 | 32 | 74 | 8 | 186 |
| Skill Acquisition | 89 | 32 | 39 | 9 | 169 |
| Firm Revenue | 96 | 34 | 22 | — | 152 |
| Innovation Output | 106 | 12 | 21 | 11 | 151 |
| Consumer Welfare | 70 | 30 | 37 | 7 | 144 |
| Regulatory Compliance | 52 | 61 | 13 | 3 | 129 |
| Inequality Measures | 24 | 68 | 31 | 4 | 127 |
| Task Allocation | 75 | 11 | 29 | 6 | 121 |
| Training Effectiveness | 55 | 12 | 12 | 16 | 96 |
| Error Rate | 42 | 48 | 6 | — | 96 |
| Worker Satisfaction | 45 | 32 | 11 | 6 | 94 |
| Task Completion Time | 78 | 5 | 4 | 2 | 89 |
| Wages & Compensation | 46 | 13 | 19 | 5 | 83 |
| Team Performance | 44 | 9 | 15 | 7 | 76 |
| Hiring & Recruitment | 39 | 4 | 6 | 3 | 52 |
| Automation Exposure | 18 | 17 | 9 | 5 | 50 |
| Job Displacement | 5 | 31 | 12 | — | 48 |
| Social Protection | 21 | 10 | 6 | 2 | 39 |
| Developer Productivity | 29 | 3 | 3 | 1 | 36 |
| Worker Turnover | 10 | 12 | — | 3 | 25 |
| Skill Obsolescence | 3 | 19 | 2 | — | 24 |
| Creative Output | 15 | 5 | 3 | 1 | 24 |
| Labor Share of Income | 10 | 4 | 9 | — | 23 |
Productivity
Remove filter
We created a 770-question multiple-choice benchmark dataset of difficult, but realistic questions that a caseworker might receive.
Paper reports creation of a benchmark dataset containing 770 multiple-choice questions described as difficult and realistic; questions and dataset construction described in methods (no sample-of-questions or external validation details provided in the excerpt).
The study's conclusions draw on three complementary evidence bases: (a) task-level evidence on what generative AI can already do in practice; (b) occupational exposure and complementarity analysis using Philippine labor force data; and (c) firm- and worker-level evidence on AI adoption.
Description of methods and data sources in the paper: task-level capability testing/assessment, analysis of national labor force/occupation data for exposure/complementarity, and firm/worker surveys or qualitative adoption evidence.
Extensive experiments were conducted using both synthetic and real hospital datasets to evaluate the framework.
Statement in the paper indicating experiments on synthetic and real datasets; exact sizes, sources, and composition of these datasets are not provided in the excerpt.
The paper explains the main legal frameworks that currently regulate AI in India, as well as proposals for future legislation.
Author's legal and policy analysis / document review of existing statutes and proposed laws (qualitative review). No quantitative sample size; based on review of legal texts and policy proposals cited in the article.
The machine-learning based analytical approach used in the study captures complex, nonlinear relationships among emotional, psychological and economic variables.
Methodological claim: authors used machine learning (including ensembles) to model nonlinear and complex relationships. The excerpt does not provide algorithmic details, tuning, validation strategy, or sample size.
Work environment and digital/AI intensity were incorporated as contextual moderators in the analysis to reflect contemporary labor market conditions.
Methodological description in the excerpt states these variables were included as moderators; no details on measurement, operationalization, or sample size are provided.
Most evidence came from retrospective studies or meta-analyses, with limited prospective or randomized controlled trials.
Summary of study designs across the 40 included studies as reported in the review.
The impact of AI on patient outcomes (e.g., mortality, rebleeding) was rarely addressed.
Statement in results indicating few included studies reported patient-centered outcomes such as mortality or rebleeding.
This systematic review adhered to PRISMA 2020 guidelines.
Methods statement in the paper specifying adherence to PRISMA 2020; the review included 40 studies.
The review focuses on AI applications within small‑scale business environments, with a special focus on women‑owned micro firms in Jaipur, India.
Scope and aim articulated in the paper; geographic and demographic focus explicitly stated by the authors.
The systematic review follows PRISMA 2020 guidelines.
Methodological statement in the paper indicating adherence to PRISMA 2020 for the review process.
After screening and eligibility filtering, 55 open‑access journal articles were included for in‑depth analysis.
PRISMA‑guided screening and eligibility process reported in the review; final included sample explicitly stated as 55 open‑access journal articles.
A Scopus search identified 265 records using keywords related to women’s entrepreneurship and AI.
Systematic literature search reported in the paper following PRISMA 2020; search executed in Scopus with specified keywords; initial yield stated as 265 records.
This study draws on a critical AI media literacy framework to analyze user-generated discussions in the two largest higher education subreddits on Reddit.com.
Author-reported study design: application of a critical AI media literacy theoretical framework to a qualitative dataset consisting of user-generated discussions from the two largest higher-education subreddits. (Sample size/number of posts/threads not specified in the provided excerpt.)
The study used a mixed-methods design incorporating surveys from 150 LEP immigrants, interviews with 50 employers, and interviews with 20 translation service providers in various linguistically diverse U.S. cities, with quantitative analysis performed in SPSS Version 28 and qualitative thematic coding in NVivo 14.
Reported study design and sample: survey n=150 LEP immigrants; employer interviews n=50; translation provider interviews n=20; analytic software specified as SPSS v28 (quantitative) and NVivo 14 (qualitative).
The framework was evaluated on 2,847 queries across 15 task categories.
Paper reports an evaluation dataset consisting of 2,847 queries spanning 15 task categories; used as the sample for reported empirical results.
Non-text processing paths use SLM-assisted modality decomposition.
Paper reports that non-text queries are decomposed using SLM-assisted modality decomposition; described as the non-text routing approach in the framework.
For text-only queries, the framework uses learned routing via RouteLLM.
Paper states text-only routing is handled by a learned model named RouteLLM; presented as part of the system architecture.
A central Supervisor dynamically decomposes user queries, delegates subtasks to modality-appropriate tools (e.g., object detection, OCR, speech transcription), and synthesizes results through adaptive routing strategies rather than predetermined decision trees.
Methodological description in the paper of a Supervisor component that performs dynamic decomposition, delegation to modality-appropriate tools (examples given), and adaptive routing; supported by the framework's implementation details.
We present an agentic AI framework for autonomous multimodal query processing that coordinates specialized tools across text, image, audio, video, and document modalities.
Paper describes the framework design and components (Supervisor, modality-specific tools) and states support for text, image, audio, video, and document modalities; no external benchmark cited for this capability beyond the paper's own implementation.
The study employs an input–output (I–O) modeling framework using IMPLAN 2022 data to estimate direct, indirect, and induced impacts of investments in greenhouse and robotics sectors for Northwest Indiana as part of Project TRAVERSE.
Explicit methodological statement in the paper: use of IMPLAN 2022 I–O model; geographic scope NWI; linkage to EDA Project TRAVERSE.
The essay reviews seven books from the past dozen years by social scientists examining the economic impact of artificial intelligence (AI).
Qualitative book-review performed by the author; sample size explicitly stated as seven books published within the last ~12 years; method = synthesis/assessment of those seven books.
This systematic review follows PRISMA guidelines to examine the evolution, advancements, and state-of-the-art AI applications for GS-BESS optimization.
Methodological statement in the paper indicating the use of PRISMA guidelines for the review process. The excerpt does not include the PRISMA flow diagram or the exact article selection numbers.
The experimental sample underlying the statistical tests comprised 20 observations (implied by ANOVA degrees of freedom: df between = 1, df within = 18).
Interpretation of the reported one-way ANOVA degrees of freedom (F(1,18) for multiple outcomes) indicating total N = 20 observations.
Field experiments at the Al‐Ra'id Research Station in Baghdad during the 2025 season compared conventional diesel‐based irrigation with AI‐assisted irrigation using soil moisture sensors, IoT controllers, and predictive weather algorithms.
Reported field experiment design in the paper (Al‐Ra'id Research Station, Baghdad, 2025 season) specifying two treatments: conventional diesel irrigation vs AI-assisted irrigation using soil moisture sensors, IoT controllers, and predictive weather algorithms.
Definitions and scopes of Material Passports vary among authors.
Content analysis of the 46 included studies showing differing definitions and scope treatments for MPs reported by the authors.
Among the included studies, 65% focused primarily on Material Passports (MPs), while 35% addressed MPs within the broader context of a circular economy (CE).
Quantitative categorization of the 46 included studies reported in the paper (percentages attributed to focus areas).
A total of 54 peer-reviewed articles and book chapters were screened from the Scopus database, of which 46 were included for in-depth analysis in April 2025.
Reported screening and inclusion counts from the Scopus search (54 screened, 46 included); date of in-depth analysis given as April 2025.
This article presents a Systematic Literature Review (SLR) following the PRISMA methodology.
Stated methodology in the paper: SLR using PRISMA; literature search performed in Scopus; review process and inclusion/exclusion described (screening and inclusion counts reported).
Future research could strengthen causal identification by exploiting exogenous policy shocks rather than relying solely on matching methods like PSM.
Authors' methodological suggestion for future work, based on limitations of current causal inference strategy (PSM and observational panel regression).
Propensity Score Matching (PSM) and other robustness checks were used to mitigate selection bias and support the causal interpretation of AI's effects.
Paper reports use of Propensity Score Matching in robustness analyses on the panel of A-share-listed design firms (2014–2023).
The paper operationalizes firm-level AI exposure by constructing an AI lexicon via natural language processing and applying text analysis to annual reports and patents to generate enterprise-level AI indicators.
Described methodology: NLP to generate an AI lexicon and text-analysis of annual reports and patents to build AI measures for each listed design enterprise in the 2014–2023 panel.
The paper presents relevant tradeoffs and design choices across human-LLM archetypes, including decision control, social hierarchies, cognitive forcing strategies, and information requirements.
Qualitative analysis and discussion in the paper synthesizing insights from the literature review and empirical evaluation. Method: thematic synthesis and design analysis. Sample size: based on the review of 113 papers and the clinical-case evaluation (details in full text).
We describe 17 human-LLM archetypes derived from a scoping literature review and thematic analysis of 113 LLM-supported decision-making papers.
Scoping literature review and thematic analysis method; corpus size = 113 LLM-supported decision-making papers (as reported in the paper).
The paper introduces the concept of human-LLM archetypes, defined as re-occurring socio-technical interaction patterns that structure the roles of humans and LLMs in collaborative decision-making.
Conceptual contribution presented in the paper (definition and framing). Method: theoretical/conceptual description in the manuscript. Sample size: not applicable.
Battery and motor performance were evaluated (in laboratory tests).
Laboratory tests assessing battery and motor performance are reported in the methods/results; no quantitative battery/motor metrics provided in the summary.
A composite index capturing concerns about mental health, privacy, climate impact, and labor market disruption was constructed to measure societal risk perceptions of AI.
Author-constructed composite index derived from survey items on mental health, privacy, climate, and labor market disruption concerns in the 2023–2024 UK survey.
The study treats AI-agent populations as a system in which four key variables governing collective behaviour can be independently toggled: nature (innate LLM diversity), nurture (individual reinforcement learning), culture (emergent tribe formation), and resource scarcity.
Study design described in the paper (experimental setup allowing independent manipulation of the four variables: model diversity, individual RL, emergent tribe formation, and resource scarcity).
The analysis is framed through the integrated lens of the Technology-Organization-Environment (TOE) framework and Institutional Theory to provide a multi-faceted understanding of adoption dynamics.
Stated theoretical framing and analytical approach in the study (methodological claim).
The research synthesizes evidence from a wide array of sources, including recent academic literature by Nigerian scholars, NPA official performance reports, policy documents, and international trade facilitation reports (e.g., UNCTAD).
Explicit description of data sources in the study methodology; method: secondary data synthesis (no sample size applicable).
This study investigates the current state of adoption, the prevailing barriers, and the resultant performance outcomes of digital and AI-driven logistics within Nigeria’s maritime supply chain.
Stated study aim and scope; method: rigorous secondary data analysis drawing on multiple documentary sources (Nigerian academic literature, NPA reports, policy documents, UNCTAD).
This study uses a conceptual and analytical approach to examine the impact of AI and automation on work.
Stated methodology in the paper's abstract/introduction: methodological description that the study is conceptual and analytical; no empirical sample or quantitative data reported.
The study uses a recently developed firm-year measure of investment in AI-related human capital, applied to a broad sample of U.S. nontechnology firms between 2010 and 2018.
Methodological statement in the abstract describing the independent variable and the sample years and population (U.S. nontechnology firms, 2010–2018).
The paper's findings are based on a combination of literature review, data analysis, and an empirical study involving HR professionals.
Methodological description given in the paper's summary (no further methodological details, sample size, instruments, or statistical methods provided in the summary).
The adoption and implementation of AI in entrepreneurial firms is an under-studied area of research.
Paper's literature review and motivation statement asserting limited empirical research on AI adoption in entrepreneurial contexts.
The study collected data from 207 entrepreneurial businesses (including SMEs, startups, and knowledge-based businesses) using a structured questionnaire and analyzed the data using Partial Least Squares Structural Equation Modeling (PLS-SEM) with SmartPLS 3.
Structured questionnaire administered to a sample of 207 entrepreneurial businesses; analysis conducted with PLS-SEM (SmartPLS 3) as reported in the paper.
The study analyzes the influence of artificial intelligence, financial technology, economic performance, monetary policy, financial development, and governance quality on the growth of G7 countries over 2000–2024 using the Method of Moments Quantile Regression (MMQR).
Statement in paper specifying use of Method of Moments Quantile Regression on G7 countries during 2000–2024. Implied panel sample: 7 countries × 25 years ≈ 175 country-year observations (if annual, balanced panel).
We conducted preregistered experiments in two tasks (a sentiment-analysis task and a geography-guessing task) to study whether user characteristics influence the effectiveness of AI explanations.
Preregistered experimental studies described in the paper; two distinct tasks (sentiment-analysis and geography-guessing). (Sample sizes and additional procedural details are not provided in the excerpt.)
The paper empirically analyzes the algorithm-automated versus human decision-making debate using the AST and STS theoretical lenses.
Theoretical analysis and empirical synthesis across the reviewed studies (n=85), explicitly stated use of AST and STS frameworks to interpret findings.
To address the duality of benefits and harms, the paper proposes a dynamic Human-in-the-Loop (HITL) model that reconciles algorithmic determinism with normative HRM demands.
Conceptual/theoretical contribution presented in the paper (proposed HITL model based on synthesis of findings and theory).