Evidence (2966 claims)
Adoption
8570 claims
Productivity
7631 claims
Governance
6869 claims
Human-AI Collaboration
6491 claims
Org Design
4175 claims
Innovation
4114 claims
Labor Markets
3566 claims
Skills & Training
2966 claims
Inequality
2066 claims
Evidence Matrix
Claim counts by outcome category and direction of finding.
| Outcome | Positive | Negative | Mixed | Null | Total |
|---|---|---|---|---|---|
| Other | 758 | 199 | 100 | 900 | 2007 |
| Governance & Regulation | 826 | 400 | 191 | 122 | 1563 |
| Organizational Efficiency | 777 | 193 | 124 | 84 | 1189 |
| Technology Adoption Rate | 635 | 233 | 124 | 97 | 1098 |
| Research Productivity | 422 | 128 | 57 | 336 | 954 |
| Output Quality | 476 | 179 | 59 | 47 | 761 |
| Decision Quality | 328 | 177 | 81 | 47 | 640 |
| Firm Productivity | 435 | 57 | 88 | 20 | 606 |
| AI Safety & Ethics | 218 | 277 | 65 | 33 | 599 |
| Market Structure | 180 | 170 | 123 | 24 | 502 |
| Task Allocation | 213 | 64 | 72 | 33 | 387 |
| Skill Acquisition | 170 | 61 | 61 | 17 | 309 |
| Innovation Output | 203 | 27 | 43 | 18 | 292 |
| Employment Level | 105 | 54 | 107 | 13 | 281 |
| Fiscal & Macroeconomic | 131 | 69 | 43 | 26 | 276 |
| Consumer Welfare | 117 | 63 | 42 | 11 | 233 |
| Firm Revenue | 153 | 48 | 26 | 3 | 230 |
| Task Completion Time | 173 | 31 | 8 | 12 | 225 |
| Inequality Measures | 44 | 122 | 49 | 6 | 221 |
| Worker Satisfaction | 89 | 65 | 22 | 12 | 188 |
| Error Rate | 69 | 92 | 10 | 2 | 173 |
| Regulatory Compliance | 77 | 69 | 14 | 5 | 165 |
| Automation Exposure | 56 | 56 | 26 | 13 | 154 |
| Training Effectiveness | 94 | 21 | 13 | 19 | 149 |
| Wages & Compensation | 77 | 36 | 25 | 6 | 144 |
| Team Performance | 86 | 17 | 27 | 10 | 141 |
| Developer Productivity | 95 | 17 | 14 | 6 | 133 |
| Job Displacement | 12 | 80 | 20 | 1 | 113 |
| Hiring & Recruitment | 52 | 7 | 8 | 3 | 70 |
| Creative Output | 31 | 18 | 8 | 3 | 61 |
| Skill Obsolescence | 5 | 46 | 6 | 1 | 58 |
| Social Protection | 27 | 16 | 8 | 2 | 53 |
| Labor Share of Income | 17 | 19 | 17 | — | 53 |
| Worker Turnover | 11 | 12 | — | 3 | 26 |
| Industry | — | — | — | 1 | 1 |
Skills Training
Remove filter
Empirical evaluation is needed on how AI-induced productivity gains translate into aggregate demand and labor absorption.
Identified research priority in the paper, based on theoretical uncertainty about demand-side labor absorption and lack of conclusive empirical evidence.
AI will not mechanically cause permanent mass unemployment at the aggregate level.
Theoretical framing and synthesis of existing empirical findings across task-based and macro studies; no single new dataset provided (paper draws on literature and conceptual models).
Occupation-level analyses (e.g., BLS OEWS cross-occupation wage regressions) risk misleading conclusions about AI’s distributional effects because they aggregate over the task- and firm-level heterogeneity that drives the mechanism.
Theoretical argument and empirical illustration in the paper showing how aggregation masks within-task compression and firm-level rent capture; example regressions on OEWS used to demonstrate the limitation.
Testing the model requires within-occupation, within-task panel data on task-level performance and wages linked to firm-level AI adoption, ownership of complementary assets, and measures of rent-sharing; such data are not available at scale.
Author statement about data requirements and current data limitations; empirical illustration and discussion note absence of large-scale linked microdata meeting these criteria.
Occupation-level regressions using BLS OEWS (2019–2023) are insufficient for testing the model’s task-level predictions because aggregation across tasks and firms hides the mechanism.
Empirical illustration in the paper using occupation-level regressions on BLS OEWS 2019–2023 showing that such aggregates do not reveal within-occupation, within-task dispersion or firm-level rent concentration effects; paper argues this is a data-adequacy limitation.
A sensitivity decomposition shows five of the moments (the non‑ΔGini moments) identify internal mechanism rates (how AI changes task production, education responses, screening intensity) but do not determine the aggregate sign of inequality change.
Local identification / sensitivity decomposition performed on the calibrated model; decomposition results reported in the paper attribute mechanism-rate identification to five moments and show they leave the sign of ΔGini indeterminate.
AI did not significantly moderate the relationship between workplace stress and job performance.
Moderation test in PLS-SEM (SmartPLS 4.0) on N = 350; reported non-significant AI × Stress → Performance moderator (paper reports no significant moderating effect).
Use of AI raises needs for traceability, explainability, and continuous validation to maintain compliance and avoid error propagation in curricular decisions.
Paper's AI governance recommendations (prescriptive), referencing general AI risk principles rather than empirical study.
Realising DT value requires upfront investment in sensors, integration, standards, and skills; economic viability depends on contract structures and how gains are allocated between investors, owners, contractors, and operators.
Synthesis of cost/benefit discussions and case descriptions in the reviewed literature; policy and procurement examples referenced.
Results are robust across alternative AI index specifications, occupational classifications, and standard controls (country and year fixed effects, macroeconomic covariates).
Paper reports robustness checks across different index constructions and occupational taxonomies, with standard controls included in regressions.
Research priorities include causal studies on productivity gains from AI, firm‑level adoption dynamics, sectoral labor reallocation, long‑run general equilibrium effects, and heterogeneous impacts across regions and demographic groups.
Set of empirical research recommendations drawn from gaps identified in the literature review and limitations section; not an empirical claim but a prioritized research agenda based on secondary evidence.
Growth‑accounting frameworks and measurement approaches must be updated to capture AI/robotics as intangible and embodied capital, including quality improvements and spillovers.
Methodological argument grounded in literature on measurement challenges and examples of intangible capital; no new measurement exercise or empirical re‑estimation is provided in the paper.
Backtesting the proposed models against historical technological transitions (e.g., ATMs, robotics) and recent AI adoption episodes can validate model performance.
Recommended validation strategy; paper does not report backtest results but prescribes holdout/pseudo‑counterfactual experiments and calibration with administrative outcomes.
Scenario modelling in the reviewed literature typically uses counterfactual simulations with different adoption speeds, policy responses, and initial conditions to bound possible employment, wage, and productivity trajectories.
Description and citations of scenario-modelling practices by think tanks and organisations (TBI, IPPR, IMF) and academic work referenced; evidence is methodological and report-based.
NLP/LLM pipelines are used to extract tasks and skills from free-text job ads and to map those tasks to AI capabilities.
Described methods and citations (Xu et al., 2025; Hampole et al., 2025); evidence is methodological application of transformer-based models to job-ad text in recent studies.
Methods increasingly apply advanced NLP and large language models (BERT, LSTM, GPT-4) to parse job descriptions, map skills/tasks, and predict automation risk.
Cited methodological examples in the paper (Xu et al., 2025; Hampole et al., 2025) and discussion of common pipelines using transformer-based models to extract tasks from free-text job ads and to map tasks to AI capabilities; evidence is methodological and based on recent studies rather than a single benchmarked dataset.
Providing optional LLM access without training did not increase average exam scores versus no LLM access.
Intent-to-treat comparisons across randomized arms reported in the study: comparison of optional-access-without-training arm to no-access arm showed no average score improvement (sample n = 164).
The benefits of AI-enabled e-commerce and automated warehousing are conditional on complementary policies (competition policy, data governance, workforce reskilling, automation oversight) to manage concentration, privacy, distributional effects, and safety.
Policy-analysis synthesis supported by sensitivity checks in scenario analyses and discussion of governance risks; recommendations informed by observed distributional and market-concentration patterns in the case material.
AI’s net impact on employment to date is modest — no clear evidence of mass unemployment.
Systematic literature review/meta-synthesis of 17 peer‑reviewed publications (published 2020–2025). Aggregate assessment across those studies found no consistent empirical support for large-scale, economy-wide unemployment attributable to AI to date.
Survey responses and interviews indicate a broader range of emerging competencies, suggesting the spectrum of required advanced digital skills is likely to expand in the near future.
Paper synthesizes survey and interview findings to infer an expanding set of competencies; this is a forward-looking interpretation rather than a strictly observed quantitative trend; no forecast model or time-series data reported.
These findings and institutional lessons extend beyond programming to credentialing systems (medical and legal boards, professional certification) that certify skill in a workforce increasingly shaped by AI.
Generalization / policy claim offered by authors (normative extrapolation from programming contest evidence to other credentialing systems).
Two levers follow from the contrast: (1) how AI is integrated into training, since within the screened pool AI-style practice coincides with stronger non-AI-aided performance; and (2) the design of AI-prohibited evaluation gates as a type-separating institution.
Interpretation and policy implication drawn from empirical results (conceptual recommendation; not a directly tested intervention in the paper).
Inside the AI-prohibited ICPC environment, a shift toward AI-style practice predicts higher non-AI-aided scores for AI-era entrants.
Within-ICPC empirical analysis comparing entrants across eras (pre/post AI) and relating practice signature to ICPC non-AI-aided scores; specific sample size and estimates not provided in abstract.
Subgroup analysis reveals AACT can be particularly beneficial for some decision-makers such as those very familiar with AI technologies.
Subgroup analysis reported in the house price prediction case study indicating heterogenous effects by familiarity with AI (no subgroup sample sizes provided in abstract).
AI assistance shows promise for increasing discretionary but beneficial work (tasks users intend but often skip) while preserving human control over final outcomes.
Synthesis/generalization based on randomized field experiment results (increased feedback provision and length; no negative effects on usefulness or time per character) and supporting qualitative interview findings. Empirical data from a 300-level ML course with 11 TAs and 88 students.
A strategic labor division emerged: the LLM serves as a generative engine to mitigate teacher burnout.
Claim in the abstract describing the role allocation observed in the system; implies LLMs reduced teacher workload/burnout based on the system's deployment and analysis. No numeric measure of burnout provided in the abstract.
Beyond replacing repetitive manual labor, AI has penetrated into complex cognitive labor fields once deemed hard to automate, reshaping industry work paradigms, blurring traditional occupational boundaries, and triggering an unprecedented structural transformation in the labor market.
Framing/background claim in the paper describing observed trends and technological developments; the excerpt does not cite specific empirical tests or data for this broad statement.
The results inform industrial policies focused on workforce adaptation and managing the digital transition in manufacturing.
Policy implication drawn by the authors from the empirical results (positive association between digital transformation and labor demand, plus heterogeneous effects).
Rising employee digital literacy (from digital transformation) promotes both the amount of labor demanded and the intensity of factor input.
Mechanism/mediation analysis reported in the paper linking digital transformation → employee digital literacy → labor demand and factor-input intensity (Chinese A-share manufacturing firms, 2011–2024). (Sample size not stated in provided text.)
A regional integration strategy is critical to achieving coordinated development of digital talent agglomeration and industrial digitalization and thereby promoting regional economic growth.
Policy implication offered by the authors, motivated by regional heterogeneity in empirical results (e.g., positive interaction in Yangtze River Delta versus deviations elsewhere). This is presented as a recommendation rather than a directly tested causal claim.
Depending on context, AI can either complement human skill development by amplifying independent reasoning or act as a substitute that undermines such reasoning; therefore regulating AI access and usage will be important for promoting skill development in the presence of AI assistance.
Interpretation and policy implication drawn from the controlled experiment's observed variation by AI usage intensity and informativeness (experimental details and sample size not provided in abstract).
Effective AI implementation, coupled with employee training and transparent communication, can reduce resistance and anxiety among employees.
Interpretation and conclusion drawn from the observed negative relationship between perceived opportunities and challenges and the pattern of survey responses; presented as a recommended approach in the study.
Wage inequality increased due to differential skill adaptation across workers.
Authors' conclusion drawn from observed effects of AI adoption and skill transformation on wage dynamics in the SEM applied to the survey (n=320); statement presented qualitatively in the results/discussion (no inequality coefficient provided in the summary).
AI created opportunities by increasing demand for high-skilled labor.
Authors' interpretation of SEM results and descriptive analysis from the survey of n=320 employees indicating skill-upgrading effects; specific numerical evidence for 'demand for high-skilled labor' not reported in the summary.
"Augmented Intelligence" models, which combine human contextual judgment with algorithmic precision, reduce attrition by 22% compared with complete automation.
Reported comparative result in the paper's analysis (paper claims comparative attrition rates between augmented and fully automated approaches; exact data source not explicitly tied to one of the stated samples in the abstract).
Technology has increased efficiency in organisations based in large cities in India.
Review result statement claiming observed efficiency gains in urban organisations according to the literature summarized; based on reviewed studies (no single sample size reported in excerpt).
Trade unions have increasingly pursued algorithmic transparency and stronger technology governance rights through collective bargaining, and governments are accelerating legislative initiatives to establish and protect workplace technology rights.
Descriptive review of labor-movement responses and recent government legislative initiatives reported in the literature (case studies and policy reviews).
By capturing complete interaction traces with human vs. agent code authorship attribution, SWE-chat provides an empirical foundation for moving beyond curated benchmarks towards an evidence-based understanding of how AI agents perform in real developer workflows.
Claims about dataset capabilities and intended use: the dataset contains interaction traces and authorship labels enabling empirical research; asserted by authors as an implication of the dataset contents.
Pair programming between students is well studied and known to be beneficial to self-efficacy and academic achievement.
Background literature claim presented in the paper's introduction (cites existing research on pair programming benefits).
Our findings can help practitioners, educators, and policymakers promote responsible and effective use of AI tools.
Authors assert applicability of their qualitative findings and the proposed framework (derived from 22 interviews) to inform stakeholders.
In simulation (chess, using learned human models from large-scale gameplay data), our approach consistently outperforms interventions based on the strongest chess engine (Stockfish) across a wide range of settings.
Simulation experiments in chess using models of human play trained from large-scale gameplay data; comparisons against Stockfish-based interventions (details described in paper).
These patterns suggest personality as a predictor of readiness beyond stage-based tailoring: vulnerable users benefit from targeted rather than comprehensive interventions.
Authors' inference from the clustered outcome patterns observed in the experiment (resilient/overcontrolled/undercontrolled differences) indicating personality moderates responsiveness to different intervention types.
Overcontrolled workers showed outcome-specific improvements with theory-driven AI.
Reported experimental finding: participants in the overcontrolled cluster improved on certain (outcome-specific) measures when assigned to the theory-driven AI (Trucey) condition.
Resilient workers achieved broad psychological gains primarily from the handbook.
Reported experimental result: resilient cluster exhibited broad psychological improvements, with the traditional negotiation handbook (Control-NoAI) producing those gains.
Workplace organization (W) materially modifies the augmentation function so that two firms with identical technology investments can realize 'radically different' augmentation outcomes.
Conceptual claim supported by the paper's theoretical model (phi(D,W)) and cited empirical illustration (Colombia EDIT survey interaction result).
The growth of digital platforms contributes to the decentralization of job creation.
Paper cites contemporary data on the growth of digital platforms as part of its analysis (no specific platform-level datasets or sample sizes cited in the abstract).
Drawing on analysis of agentic investment firm operational models demonstrating 50-70% cost reductions while maintaining fiduciary standards.
Internal analysis/modeling of agentic investment firm operational models reported by the authors; paper states the 50–70% cost reduction result but provides no sample size or detailed empirical validation in the provided text.
Fostering digital transformation alongside workforce reskilling and innovation-ecosystem development is essential for sustainable industrial growth and strengthening Kazakhstan’s global economic position.
Policy and strategic recommendations based on the study's empirical results, case studies, and macro-level index comparisons.
Digital transformation combined with workforce retraining optimizes labor costs and enhances productivity.
Synthesis of enterprise-level case examples and aggregated regression/correlation findings at industry and national levels that link digitalization and retraining programs to labor-cost and productivity indicators.
These findings provide quantitative foundations for AI capability-threshold governance.
Synthesis/interpretation of model results and empirical validation described in the paper (recommendation/implication).