Evidence (4793 claims)
Adoption
5539 claims
Productivity
4793 claims
Governance
4333 claims
Human-AI Collaboration
3326 claims
Labor Markets
2657 claims
Innovation
2510 claims
Org Design
2469 claims
Skills & Training
2017 claims
Inequality
1378 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 402 | 112 | 67 | 480 | 1076 |
| Governance & Regulation | 402 | 192 | 122 | 62 | 790 |
| Research Productivity | 249 | 98 | 34 | 311 | 697 |
| Organizational Efficiency | 395 | 95 | 70 | 40 | 603 |
| Technology Adoption Rate | 321 | 126 | 73 | 39 | 564 |
| Firm Productivity | 306 | 39 | 70 | 12 | 432 |
| Output Quality | 256 | 66 | 25 | 28 | 375 |
| AI Safety & Ethics | 116 | 177 | 44 | 24 | 363 |
| Market Structure | 107 | 128 | 85 | 14 | 339 |
| Decision Quality | 177 | 76 | 38 | 20 | 315 |
| Fiscal & Macroeconomic | 89 | 58 | 33 | 22 | 209 |
| Employment Level | 77 | 34 | 80 | 9 | 202 |
| Skill Acquisition | 92 | 33 | 40 | 9 | 174 |
| Innovation Output | 120 | 12 | 23 | 12 | 168 |
| Firm Revenue | 98 | 34 | 22 | — | 154 |
| Consumer Welfare | 73 | 31 | 37 | 7 | 148 |
| Task Allocation | 84 | 16 | 33 | 7 | 140 |
| Inequality Measures | 25 | 77 | 32 | 5 | 139 |
| Regulatory Compliance | 54 | 63 | 13 | 3 | 133 |
| Error Rate | 44 | 51 | 6 | — | 101 |
| Task Completion Time | 88 | 5 | 4 | 3 | 100 |
| Training Effectiveness | 58 | 12 | 12 | 16 | 99 |
| Worker Satisfaction | 47 | 32 | 11 | 7 | 97 |
| Wages & Compensation | 53 | 15 | 20 | 5 | 93 |
| Team Performance | 47 | 12 | 15 | 7 | 82 |
| Automation Exposure | 24 | 22 | 9 | 6 | 62 |
| Job Displacement | 6 | 38 | 13 | — | 57 |
| Hiring & Recruitment | 41 | 4 | 6 | 3 | 54 |
| Developer Productivity | 34 | 4 | 3 | 1 | 42 |
| Social Protection | 22 | 10 | 6 | 2 | 40 |
| Creative Output | 16 | 7 | 5 | 1 | 29 |
| Labor Share of Income | 12 | 5 | 9 | — | 26 |
| Skill Obsolescence | 3 | 20 | 2 | — | 25 |
| Worker Turnover | 10 | 12 | — | 3 | 25 |
Productivity
Remove filter
Seed 2.0 Lite achieved 75.7% success rate with-skill, an increase of +18.9 percentage points over baseline.
Model-specific reported result in the paper: Seed 2.0 Lite with-skill success rate (75.7%) and reported improvement (+18.9pp); reported from the benchmark runs.
GLM-5 Turbo achieved 78.4% success rate with-skill, an increase of +5.4 percentage points over baseline.
Model-specific reported result in the paper: GLM-5 Turbo with-skill success rate (78.4%) and reported improvement (+5.4pp); based on the benchmark evaluation.
Nemotron 120B achieved 78.4% success rate with-skill, an increase of +18.9 percentage points over baseline.
Model-specific reported result in the paper: Nemotron 120B with-skill success rate (78.4%) and reported improvement (+18.9pp); results drawn from the benchmark runs.
MiniMax M2.5 achieved 81.1% success rate with-skill, an increase of +13.5 percentage points over baseline.
Model-specific reported result in the paper: MiniMax M2.5 with-skill success rate (81.1%) and reported improvement (+13.5pp); based on subset of the 185 scenario-runs across the evaluated models.
Results across 5 open-weight model conditions and 185 scenario-runs show consistent skill lift across all models.
Aggregate experimental results reported in the paper: evaluation over 5 model conditions and 185 scenario-runs, with cross-model improvement when SKILL is provided.
AI-adopting firms increase R&D expenditures following adoption.
Firm financial data showing higher R&D spending for adopters relative to nonadopters in post-adoption periods using the diff-in-diff framework.
Post-adoption patents by AI adopters receive more citations than those of nonadopters.
Difference-in-differences estimates comparing citation counts per patent before and after AI installation versus nonadopters; patent citation data used as the dependent variable.
Firms that adopt AI subsequently increase patenting relative to nonadopters.
Firm-level analysis using a novel AI adoption measure based on timing of AI product installations and a stacked difference-in-differences design exploiting staggered adoption; dependent variable = firm patent counts (patenting rate). (Sample size and exact time period not specified in the provided text.)
Using distributed systems as a principled foundation is a useful approach for creating and evaluating LLM teams.
Primary methodological proposal of the paper; supported by conceptual argument and (per the paper) mappings between distributed-systems concepts and LLM team design (specific experimental validation not detailed in the excerpt).
Large language models (LLMs) are growing increasingly capable.
Statement in the paper's introduction/abstract summarizing the field; based on observed progress in LLM development cited by the authors (no experimental sample size provided in the excerpt).
Only seven specialized skills produce meaningful gains (up to +30%).
Empirical results showing that 7 out of 49 skills yielded meaningful positive improvements in acceptance-test pass rates, with gains up to 30%.
The average gain from injecting skills is only +1.2% in pass rate.
Aggregated pass-rate differences computed across the benchmark tasks comparing with-skill vs without-skill conditions, reported as an average +1.2% gain.
Analysis of benchmark data (n = 667) reveals substantial synergy effects: Llama-3.1-8B improves human performance by 23 percentage points.
Empirical analysis of the same benchmark dataset (n = 667) using the Bayesian IRT model; reported improvement in human performance with Llama-3.1-8B assistance of +23 percentage points.
Analysis of benchmark data (n = 667) reveals substantial synergy effects: GPT-4o improves human performance by 29 percentage points.
Empirical analysis of a benchmark dataset of n = 667 using the paper's Bayesian IRT framework; reported improvement in human performance with GPT-4o assistance of +29 percentage points.
The work offers a blueprint for converting the ideological potential of AI into implementable, regulator-compatible utilities in pharmaceutical science by synthesizing quantitative measures and practical measures.
Claim about the paper's contribution (blueprint). It is an author claim about the synthesis and guidance provided; the excerpt does not include empirical validation that following the blueprint yields successful implementation.
The paper proposes a systematized framework of integration that emphasizes creating high-impact pilot projects, in-the-wild testing, and ongoing monitoring of models in accordance with FDA, EMA, and EU AI Act guidance.
Described as the paper's proposed framework and recommendations for regulatory-aligned implementation. The excerpt indicates the proposal but does not present validation or empirical testing of the framework.
Grounded in the Resource-Based View (RBV), AI is conceptualized as a strategic intangible resource that can confer a competitive advantage when integrated with complementary capabilities.
Theoretical framing presented in the paper (RBV-based conceptualization); not an empirical finding but an explicit conceptual claim.
Firms with high AI adoption had an average profit growth rate of 9.5%, compared to 5.8% for low adopters.
Reported profit growth rates for high vs. low AI adoption groups from the questionnaire data (N=400); the paper gives the specific averages: 9.5% (high adopters) vs. 5.8% (low adopters).
O artigo discute implicações gerenciais e de políticas públicas para reduzir fricção, acelerar adoção responsável e orientar investimentos em produtividade e inclusão.
Seção de discussão mencionada no resumo abordando encargos gerenciais e políticas públicas; não há avaliação empírica de políticas no resumo.
O artigo entrega instrumentos replicáveis — a escala SCF-30, um checklist de governança mínima de IA e uma matriz 30-60-90 dias — para uso prático.
Afirmação explícita no resumo de que instrumentos replicáveis são disponibilizados; presunção de inclusão dos instrumentos no corpo do artigo.
AI significantly enhances firms' total factor productivity (TFP).
Empirical results from the multidimensional fixed-effects panel model applied to the 2007–2023 sample of agricultural A-share firms; statistical significance reported in the paper.
High-quality chatbots (96–100% accurate) improved caseworker accuracy by 27 percentage points.
Experimental result reported in paper: treatment with chatbots at 96–100% aggregate accuracy produced a 27 percentage-point increase in caseworker accuracy compared to control; based on the randomized experiment on the 770-question benchmark.
Caseworker performance significantly improves as chatbot quality improves.
Aggregated results from the randomized experiment show monotonic improvement in caseworker accuracy as the chatbot suggestion accuracy increases; paper states the improvement is statistically significant (specific p-values/statistical tests not provided in the excerpt).
AI-integrated fuel blending systems achieve very high precision, demonstrated by a coefficient of determination (R2) of 0.99 during validation.
Model validation results reported in the paper (fuel blending system validation, R2 = 0.99), indicating very high explanatory/ predictive fit compared to traditional models.
DARE posits that responsible AI deployment requires the simultaneous and integrated development of Digital readiness, Administrative governance, Resilience & ethics, and Economic equity.
Descriptive claim about the framework's components as reported in the abstract (conceptual proposition).
This paper introduces the DARE Framework, a holistic, four-dimensional model for national AI strategy and international cooperation.
Factual description of paper content in abstract — the framework is introduced by the authors (conceptual/model contribution).
The authors curated a set of guidelines called the Incentive-Tuning Framework to aid researchers in designing effective incentive schemes for human–AI decision-making studies.
Authors' contribution described in the paper: development of a framework (framework content and evaluation details not provided in excerpt).
The intelligent scheduling model incorporates legal, contractual, skill-based, and preference-aware constraints to generate equitable and efficient rosters.
Methodological description of constraints encoded in the optimization model for scheduling; experimental validation of resulting rosters reported (conflict reduction and fairness metrics), but specific constraint formulations and datasets are not detailed in the excerpt.
The performance evaluation framework combines structured metrics (task completion, attendance, punctuality) with unstructured feedback (patient surveys, peer reviews) analyzed using natural language processing.
Methodological description in the paper of the performance evaluation module and use of NLP for unstructured feedback analysis; implementation details and dataset sizes not specified in the excerpt.
The proposed AI-driven HRM framework integrates forecasting, optimization, and performance evaluation to enhance workforce planning, staff scheduling, and continuous assessment.
Methodological contribution described in the paper: framework design with three core modules (demand forecasting, intelligent scheduling, performance evaluation); validated via experiments on synthetic and real hospital datasets (dataset sizes not specified in the text).
The Indian government believes that artificial intelligence (AI) will play an important role in India’s continued economic growth, both through its contribution to productivity in the private sector and through smarter and more data-led government.
Reported position in the paper based on review of government statements and policy documents (policy analysis/legal review). No empirical sample size applies; claim is descriptive of government belief.
The study extends human capital theory by integrating emotional and psychological dimensions into explanations of productivity and employment outcomes.
Theoretical contribution asserted by the authors based on their empirical findings linking emotional intelligence and psychological factors to economic outcomes; this is a conceptual extension rather than a statistical result.
Convolutional neural networks achieved 95.4% accuracy in identifying ulcers and hemorrhages.
Specific result reported from an included study using convolutional neural networks (accuracy = 95.4%) as cited in the review.
Technological innovation is the primary mediating mechanism through which NQPF affects supply chain efficiency, accounting for 84.6% of the effect.
Mediating-effect models applied to the 2012–2022 panel data (Shanghai and Shenzhen A-share listed firms) estimating mediation proportions; technological innovation mediation proportion reported as 84.6%.
New quality productivity forces (NQPF) significantly improve supply chain efficiency.
Empirical analysis using 2012–2022 panel data of Shanghai and Shenzhen A-share listed companies; results robust to robustness tests and reported as statistically significant in main regressions.
AI tools—ranging from machine learning algorithms in inventory management to natural language processing in customer engagement—are applied in micro‑enterprise contexts.
Descriptive synthesis from included articles reporting specific AI applications (ML for inventory management; NLP for customer engagement) across the reviewed literature.
We demonstrate three distinct workflows across five environments.
Paper lists and evaluates five target environments and describes three workflows (direct translation, translation verified against existing performance implementations, and new environment creation). Sample size: five environments.
Mainstreaming shared input and embracing climate-resilient management approaches are fundamental action items for building institutional resilience.
Paper conclusion lists these recommended action items based on its analysis of governance and sustainability linkages grounded in SDG and global governance literature; the summary does not indicate empirical testing of these recommendations.
The study builds and calibrates an integrated system dynamics model that connects demographics, labor supply, economic output, and public finance.
Method: development and calibration of a system dynamics model using official statistics for demographics, labor, output, and fiscal variables (model structure and calibration described in paper).
Extending existing behavioral frameworks (e.g., TAM, JD–R, Organizational Trust) to the AI-augmented workplace constitutes a theoretical contribution of the paper.
Theoretical elaboration and integration presented in the paper; contribution characterized as an extension of pre-existing models to AI contexts (no quantitative validation described in the summary).
The paper proposes a five-phase strategic roadmap for phased organizational implementation that integrates HRM practice redesign, psychological support systems, and evidence-based governance mechanisms.
Prescriptive/strategic proposal based on the paper's theoretical synthesis and applied recommendations (roadmap described in the paper; summary contains no implementation trial data).
The paper develops a comprehensive, multi-dimensional organizational psychology framework for preparing the U.S. workforce for AI integration composed of six interdependent dimensions: human–AI symbiosis, trust and transparency, job redesign, AI-enabled recruitment and selection, learning and adaptation, and ethical AI governance.
Conceptual framework derived from theoretical integration (TAM, Human–AI Symbiosis Theory, JD–R Model, Organizational Trust Theory) and review of AI–HRM literature; framework construction is a theoretical contribution of the paper (no empirical validation reported in the summary).
Grid-Scale Battery Energy Storage Systems (GS-BESS) play a crucial role in modern power grids, addressing challenges related to integrating renewable energy sources (RESs), load balancing, peak shaving, voltage support, load shifting, frequency regulation, emergency response, and enhancing system stability.
Synthesis of prior literature reported in this systematic review (methodology: literature review following PRISMA guidelines). The excerpt does not specify the number or identity of primary studies summarized for this claim.
The growth effect of AI exhibits industry heterogeneity: high‑tech manufacturing industries benefit more significantly.
Heterogeneity/subgroup regressions on the 2003–2017 Chinese industry panel showing larger estimated AI effects in high‑tech manufacturing sectors.
The positive effect of AI on industry growth increases over time.
Dynamic/DID analysis across the 2003–2017 panel showing that the estimated treatment effect grows larger in later periods.
The industry growth rate of the treatment group (industries with intensive AI application or high AI patent concentration) is significantly higher than that of the control group.
DID comparison between treatment and control industry groups in the China 2003–2017 panel, where treatment is defined by intensive AI application or AI patent concentration.
AI technology innovation has a significant positive impact on economic growth.
Industry panel data for Chinese industries from 2003 to 2017 analyzed using a differences-in-differences (DID) approach; main specification estimates effect of AI-related innovation on economic growth.
One-way ANOVA confirmed that observed improvements in yield, water use, WUE, and energy consumption were highly significant.
Statistical validation reported as one-way ANOVA with F and p values for wheat yield (F(1,18)=1335.66, p<0.001), water use (F(1,18)=15228.16, p<0.001), WUE (F(1,18)=13065.49, p<0.001), and energy consumption (F(1,18)=24312.67, p<0.001). Degrees of freedom imply 20 total observations (df between=1, df within=18).
Water-use efficiency (WUE) improved by 109% under AI-assisted irrigation (ANOVA F(1,18) = 13065.49, p < 0.001).
Reported WUE improvement percentage and one-way ANOVA treatment effect for WUE: F(1,18) = 13065.49, p < 0.001 from the field experiments.
AI-assisted irrigation decreased energy consumption by 30% (p < 0.001).
Field experiment results with one-way ANOVA showing treatment effect for energy consumption: F(1,18) = 24312.67, p < 0.001. Percentage change reported in the paper.