Evidence (8066 claims)
Adoption
5586 claims
Productivity
4857 claims
Governance
4381 claims
Human-AI Collaboration
3417 claims
Labor Markets
2685 claims
Innovation
2581 claims
Org Design
2499 claims
Skills & Training
2031 claims
Inequality
1382 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 417 | 113 | 67 | 480 | 1091 |
| Governance & Regulation | 419 | 202 | 124 | 64 | 823 |
| Research Productivity | 261 | 100 | 34 | 303 | 703 |
| Organizational Efficiency | 406 | 96 | 71 | 40 | 616 |
| Technology Adoption Rate | 323 | 128 | 74 | 38 | 568 |
| Firm Productivity | 307 | 38 | 70 | 12 | 432 |
| Output Quality | 260 | 71 | 27 | 29 | 387 |
| AI Safety & Ethics | 118 | 179 | 45 | 24 | 368 |
| Market Structure | 107 | 128 | 85 | 14 | 339 |
| Decision Quality | 177 | 75 | 37 | 19 | 312 |
| Fiscal & Macroeconomic | 89 | 58 | 33 | 22 | 209 |
| Employment Level | 74 | 34 | 78 | 9 | 197 |
| Skill Acquisition | 98 | 36 | 40 | 9 | 183 |
| Innovation Output | 121 | 12 | 24 | 13 | 171 |
| Firm Revenue | 98 | 35 | 24 | — | 157 |
| Consumer Welfare | 73 | 31 | 37 | 7 | 148 |
| Task Allocation | 87 | 16 | 34 | 7 | 144 |
| Inequality Measures | 25 | 76 | 32 | 5 | 138 |
| Regulatory Compliance | 54 | 61 | 13 | 3 | 131 |
| Task Completion Time | 89 | 7 | 4 | 3 | 103 |
| Error Rate | 44 | 51 | 6 | — | 101 |
| Training Effectiveness | 58 | 12 | 12 | 16 | 99 |
| Worker Satisfaction | 47 | 33 | 11 | 7 | 98 |
| Wages & Compensation | 54 | 15 | 20 | 5 | 94 |
| Team Performance | 47 | 12 | 15 | 7 | 82 |
| Automation Exposure | 27 | 26 | 10 | 6 | 72 |
| Job Displacement | 6 | 39 | 13 | — | 58 |
| Hiring & Recruitment | 40 | 4 | 6 | 3 | 53 |
| Developer Productivity | 34 | 4 | 3 | 1 | 42 |
| Social Protection | 22 | 11 | 6 | 2 | 41 |
| Creative Output | 16 | 7 | 5 | 1 | 29 |
| Labor Share of Income | 12 | 6 | 9 | — | 27 |
| Skill Obsolescence | 3 | 20 | 2 | — | 25 |
| Worker Turnover | 10 | 12 | — | 3 | 25 |
Procedural fairness perceptions positively predict employee experience outcomes, including organizational commitment, job satisfaction, and employer trust.
PLS-PM paths from procedural fairness perceptions to employee experience measures (organizational commitment, job satisfaction, employer trust) using survey data from HR professionals' reports.
Algorithmic transparency is a strong predictor of procedural fairness perceptions.
PLS-PM results linking measured algorithmic transparency to procedural fairness perceptions in the survey data (n=523 respondents).
AI adoption is positively associated with improvements in quality-of-hire.
PLS-PM association between AI adoption and reported quality-of-hire improvement from HR respondents across sampled organizations.
AI adoption is positively associated with reductions in cost-per-hire.
PLS-PM association between AI adoption and cost-per-hire reduction reported in the survey (firm-level outcomes across sampled organizations).
AI adoption is positively associated with reductions in time-to-hire (recruitment time).
PLS-PM association between AI adoption and recruitment efficiency metrics reported in the survey (firm-level outcomes across sampled organizations).
Top management support and HR digital readiness are both positively associated with organizational AI adoption, with top management support demonstrating greater explanatory power.
PLS-PM tests of organizational antecedents predicting organizational AI adoption using survey responses aggregated to organization level (184 organizations referenced).
Perceived usefulness and perceived ease of use significantly predict AI adoption intention, with perceived usefulness exhibiting a stronger effect.
PLS-PM results on relationships between TAM constructs (perceived usefulness, perceived ease of use) and adoption intention using survey data (n=523).
A large portion of the interactive activities' AI market value (26%) involves transferring information.
Descriptive subcategory statistic: within interactive activities, authors report 26% of market value pertains to information transfer tasks.
Interactive activities (which include both information-based and physical activities) account for 48% of AI market value.
Descriptive aggregate: authors define an 'interactive' category spanning info and physical activities and report it holds 48% of AI market value.
A substantial portion of AI market value (36%) is used in activities that involve creating information.
Descriptive aggregate: subcategory within information-based activities—authors report 36% of market value allocated to 'creating information'.
Most of the AI market value is used in information-based activities (72%).
Descriptive aggregate: authors categorize activities into information-based vs physical and report that 72% of estimated AI market value maps to information-based activities.
There is a highly uneven distribution of AI market value across activities: the top 1.6% of activities account for over 60% of AI market value.
Descriptive statistical result from mapping estimated AI market values to the ~20K activities; authors report concentration metrics (top 1.6% share >60%).
We use the data about AI software and robotic systems to generate graphical displays of how the estimated units and market values of all worldwide AI systems used today are distributed across the work activities that these systems help perform.
Analytic/mapping procedure: authors combine classifications of software (13,275) and robots (20.8M) with market-value estimates to create visual distributions across activities.
We classify a worldwide tally of 20.8 million robotic systems using the developed work-activity ontology.
Empirical classification/counting: authors report mapping 20.8 million robotic systems worldwide to the activity ontology.
We classify descriptions of 13,275 AI software applications using the developed work-activity ontology.
Empirical classification: authors state they mapped 13,275 AI software application descriptions to the ontology.
We disaggregate and then substantially reorganize the approximately 20K activities in the US Department of Labor's O*NET occupational database to produce a comprehensive ontology of work activities.
Methodological: authors report transforming the O*NET activity taxonomy (~20,000 activity-level records) by disaggregation and reorganization into a new ontology.
Models trained in EnterpriseLab remain robust across diverse enterprise benchmarks, including EnterpriseBench (+10%) and CRMArena (+10%).
Benchmark evaluations reported in the paper showing reported +10% improvements on EnterpriseBench and CRMArena relative to baseline; exact baselines, statistical tests, and sample sizes are not specified in the abstract.
8B-parameter models trained in EnterpriseLab reduce inference costs by 8-10x compared to frontier models (implied GPT-4o).
Empirical cost comparison reported in the paper; the abstract states an 8-10x reduction in inference costs for the 8B models trained in EnterpriseLab versus the referenced frontier model(s). Detailed cost accounting and sample sizes not provided in the abstract.
8B-parameter models trained within EnterpriseLab match GPT-4o's performance on complex enterprise workflows.
Empirical evaluation reported in the paper comparing 8B-parameter models trained in EnterpriseLab to GPT-4o on complex enterprise workflows; specific benchmark tests and metrics are referenced but details (sample sizes, exact metrics) are not provided in the abstract.
We validate the platform through EnterpriseArena, an instantiation with 15 applications and 140+ tools across IT, HR, sales, and engineering domains.
Reported instantiation/experimental setup in the paper: EnterpriseArena contains 15 applications and 140+ tools spanning specified domains.
EnterpriseLab provides integrated training pipelines with continuous evaluation.
System/design claim in paper describing integrated training and evaluation tooling as part of the platform.
EnterpriseLab includes automated trajectory synthesis that programmatically generates training data from environment schemas.
System/design claim described in paper; supported by the authors' description of an automated data-generation component.
EnterpriseLab provides a modular environment exposing enterprise applications via a Model Context Protocol, enabling seamless integration of proprietary and open-source tools.
Feature/design claim in paper; supported by implementation details of the 'Model Context Protocol' and reported integration capabilities in the platform description.
We introduce EnterpriseLab, a full-stack platform that unifies tool integration, data generation, and training into a closed-loop framework.
System/design claim describing the contribution of the paper (platform implementation and architecture); supported by the paper's implementation description rather than independent validation.
AIGQ overcomes limitations of traditional HintQ methods (shallow semantics, poor cold-start performance, and low serendipity) that arise from reliance on ID-based matching and co-click heuristics.
Claimed comparative advantage in the abstract; implied support from the paper's offline and online experiments but no detailed quantitative comparisons provided in the abstract.
Extensive offline evaluations and large-scale online A/B experiments on Taobao demonstrate that AIGQ consistently delivers substantial improvements in key business metrics across platform effectiveness and user engagement.
Empirical claim supported by unspecified offline evaluations and large-scale online A/B testing on Taobao as stated in the abstract. The abstract does not report sample sizes, metric names, or numerical effect sizes.
A hybrid offline-online deployment architecture composed of AIGQ-Direct (nearline personalized user-to-query generation) and AIGQ-Think (reasoning-enhanced trigger-to-query mappings) enables meeting strict real-time and low-latency requirements while enriching interest diversity.
System/architecture description in the paper; the abstract states the two-component architecture and its intended operational benefits (real-time/low-latency and increased diversity). The paper references large-scale online deployment and experiments but no concrete latency numbers in the abstract.
IL-GRPO is enhanced by a model-based reward from the online click-through rate (CTR) ranking model.
Methodological detail in the paper: inclusion of a model-based reward signal derived from an online CTR ranking model to augment the policy optimization; described in abstract as part of IL-GRPO's design.
Interest-aware List Group Relative Policy Optimization (IL-GRPO) is a novel policy gradient algorithm with a dual-component reward mechanism that jointly optimizes individual query relevance and global list properties.
Algorithmic contribution described in the paper (policy gradient design and dual-component reward). The abstract states this design and that it is used in experiments; no numeric effect sizes provided in the abstract.
Interest-Aware List Supervised Fine-Tuning (IL-SFT) is a list-level supervised learning approach that constructs training samples through session-aware behavior aggregation and interest-guided re-ranking to faithfully model nuanced user intent.
Methodological description in the paper: definition of IL-SFT and its training sample construction; supported implicitly by offline evaluations and downstream experiments referenced in the paper (no sample size or numeric results given in abstract).
AIGQ is the first end-to-end generative framework for the HintQ (pre-search query recommendation) scenario.
Explicit novelty/assertion in the paper's introduction/abstract claiming AIGQ as the first end-to-end generative framework for HintQ; no numerical experiment used to support the 'first' claim (methodological/positioning claim).
Organizations can design more effective recruitment strategies by signaling AI adoption to increase attractiveness to prospective applicants.
Practical implication drawn from the combined experimental findings (Study 1 N = 145; Study 2 N = 240; total N = 385) showing AI-adoption signals increase organizational attractiveness via perceived innovation ability, particularly for applicants with high AI self-efficacy.
Conceptualizing AI adoption as an organizational signal extends signaling theory to the context of technology-infused recruitment.
Theoretical argumentation in the paper, supported by the two experimental studies (Study 1 and Study 2) that test signaling mechanisms in recruitment contexts.
The positive indirect effect of AI-adoption signals on organizational attractiveness via perceived innovation ability is stronger for job seekers with high AI self-efficacy (Study 2 moderated mediation).
Study 2: moderated mediation model showing AI self-efficacy moderates the mediated relationship; sample size N = 240; participants were active job seekers.
Perceived innovation ability mediates the positive association between AI-adoption signals and organizational attractiveness (Study 2).
Study 2: moderated mediation analysis in an experiment recruiting active job seekers; sample size N = 240; mediation of AI-signal -> perceived innovation ability -> organizational attractiveness was validated.
AI-adoption signals are significantly positively associated with organizational attractiveness (Study 1).
Study 1: scenario-based experiment comparing AI-adoption signal vs no-signal conditions; sample size N = 145.
The paper reports details from a 100% deployment of DRL with policy regularizations on Alibaba's e-commerce platform, Tmall.
Direct statement in the abstract claiming full deployment across Tmall; implies a real-world, company-scale deployment but the abstract provides no operational metrics or counts.
Imposing policy regularizations improves the final performance of several DRL methods for inventory management.
Empirical claim supported by the paper's synthetic experiments and reported production deployment on Alibaba/Tmall (as stated in the abstract); no quantitative effect sizes provided in the abstract.
Imposing policy regularizations, grounded in classical inventory concepts such as 'Base Stock', can significantly accelerate hyperparameter tuning for DRL methods.
Paper reports synthetic experiments and a production deployment (Alibaba/Tmall) where policy regularizations were applied; abstract claims acceleration in hyperparameter tuning but does not report numeric tuning-time metrics in the abstract.
Deep Reinforcement Learning (DRL) provides a general-purpose methodology for training inventory policies that can leverage big data and compute.
Argument/assertion made in the paper's introduction/abstract (conceptual claim about DRL capabilities); no empirical sample or quantitative test reported in the abstract.
Human-replacing technologies have a strategic role in enhancing industrial productivity and ensuring the long-term resilience of Ukraine’s mining and metallurgical sector amid workforce shortages and structural labour-market changes due to war and demographic decline.
Integrated sectoral assessment in the paper combining current context (workforce shortages, structural changes), literature on technology-driven productivity/resilience, and industry-specific considerations; presented as a high-level conclusion.
Integrating ergonomic assessments and human–systems–interaction approaches into automation projects is important to prevent cognitive overload, occupational stress and operational risks for control‑room operators.
Recommendation and emphasis in the paper, supported by references to ergonomics and human-factors literature; presented as a preventive/mitigative approach rather than a quantified empirical result for the sector.
Successful technological modernization requires continuous investment in human capital, reskilling and the development of digital and engineering competencies.
Policy/recommendation based on the paper's synthesis of the sector analysis and literature on skill requirements and technology adoption; not presented as an original empirical estimate in the summary.
Higher robot density is associated with productivity gains, particularly in low-robotized sectors such as Ukraine’s mining and metallurgical industry.
Empirical evidence cited from international and industry-specific studies reviewed in the paper (literature review/meta-analytic style evidence); no Ukraine-specific causal estimate with sample size reported in the summary.
Human-replacing technologies also have an indirect impact on productivity by increasing total factor productivity (TFP).
Analytical argumentation in the paper supported by references to empirical studies showing TFP effects of automation/digitalization; literature synthesis rather than a new econometric estimate presented for Ukraine.
Human-replacing technologies (mechanization, automation, robotization, digitalization and AI-augmentation) make a direct contribution to labour productivity growth in Ukraine's mining and metallurgical sector.
Sectoral analysis and synthesis in the paper drawing on empirical international and industry-specific studies; literature review of productivity impacts of mechanization/automation/robotization/digitalization/AI in industrial contexts.
Industrial intelligence and the digital economy can be leveraged as a 'dual engine' to boost regional TFCP and advance high-quality green and low-carbon economic development, supporting differentiated regional coordination policies.
Synthesis/implication drawn from the paper's empirical findings (SDM results on 30 provinces, 2010–2023) showing positive total/spillover effects and regional heterogeneity.
Green finance has an insignificant positive effect on regional TFCP.
Coefficient on green finance control variable in the Spatial Durbin Model (30 provinces, 2010–2023) is positive but not statistically significant.
The digital economy presents different regional driving patterns: a 'local-spillover dual drive' in the east, a 'local-dominated drive' in the central region, and a 'spillover-dominated drive' in the west.
Regional/subsample Spatial Durbin Model estimates for digital economy variables across east, central, and west subsamples (30 provinces, 2010–2023) with reported direct and indirect effects.
The digital economy exerts a significantly positive direct effect on local TFCP and a strong positive spatial spillover effect, forming a 'local driving + spatial radiation' promotion pattern.
Spatial Durbin Model estimates on panel data (30 provinces, 2010–2023) showing statistically significant positive direct and indirect (spillover) coefficients for digital economy variables.