Machine learning for workplace mental health is still mostly a lab exercise: researchers prioritize algorithmic novelty over domain-driven, causal, or deployment studies. Without interdisciplinary co‑design and randomized or quasi‑experimental evaluation linking predictions to wages, absenteeism and productivity, ML’s economic value for workplaces will remain unproven.

Machine learning in the analysis of mental health at work: a scoping review

Pekka Varje, Ari Väänänen, Olli Haavisto, Ilkka Kivimäki, Simo Taimela, Tiina Kalliomäki-Levanto · March 09, 2026 · Journal of Occupational Health

openalex review_meta n/a evidence 7/10 relevance DOI Source PDF

ML work in occupational mental health is currently dominated by methodological innovation with little integration of domain expertise or causal/evaluation designs, and interdisciplinary, evaluation-focused research is needed to link predictive tools to economic outcomes and policy.

The application of machine learning in occupational mental health research remains in its preliminary stages, with a primary focus on methodology and computer science. The review highlights the necessity for interdisciplinary collaboration to fully leverage the potential of machine learning in advancing occupational health research.

Summary

Main Finding

The use of machine learning (ML) in occupational mental health is still nascent and largely centered on methodological and computer-science development. To realize ML’s substantive potential for improving occupational health research and policy, greater interdisciplinary collaboration—bringing together ML specialists, occupational health researchers, economists, clinicians, and policymakers—is required.

Key Points

Current work emphasizes algorithmic and methodological innovation rather than applied, domain-driven questions in occupational mental health.
There is limited integration of domain expertise (psychology, occupational health, economics) into model design, validation, and interpretation.
Practical deployment, external validity, causal identification, and evaluation of interventions remain underexplored.
Ethical, privacy, and fairness concerns (sensitive health data, workplace power dynamics) receive insufficient attention in many methodological studies.
Interdisciplinary collaboration is necessary to move from proof-of-concept models to tools that can inform workplace policy and practice.

Data & Methods

Typical data sources used or needed:
- Self-report surveys and standardized mental-health instruments.
- Administrative/HR records (absences, turnover, performance metrics).
- Passive digital traces and sensors (smartphone usage, wearables, email/text metadata) — high potential but privacy-sensitive.
- Clinical records where available (with strict privacy controls).
- Longitudinal cohort studies to assess dynamics and causal effects.
Common ML methods observed or relevant:
- Supervised learning for prediction (classification/regression of mental-health outcomes).
- Natural language processing (NLP) for text-based signals (emails, chat, free-text survey responses).
- Unsupervised learning for clustering and phenotyping.
- Representation learning and transfer learning to leverage heterogeneous data sources.
- Emerging integration with causal-inference approaches (causal forests, targeted learning) is limited but necessary.
Methodological gaps:
- Few studies combine predictive ML with causal identification or randomized evaluation.
- Model validation on out-of-sample, cross-population, and real-world deployment settings is rare.
- Insufficient attention to measurement error, confounding, and sample selection typical in workplace data.
- Data governance, de-identification, and privacy-preserving techniques (federated learning, differential privacy) are underutilized.

Implications for AI Economics

Measurement improvements: ML can produce higher-frequency, granular measures of worker mental health and stress, improving labor-market models of productivity, absenteeism, job exit, and human capital accumulation.
Policy targeting and evaluation: Predictive models could enable earlier, better-targeted interventions (EAPs, workload adjustments), but must be paired with causal evaluation to assess impact on economic outcomes.
Labor-market dynamics: Better identification of mental-health-related productivity shocks can refine estimates of wage penalties, turnover costs, and the returns to workplace mental-health programs.
Cost–benefit and adoption: Economists can quantify the welfare and firm-level returns to ML-enabled interventions; adoption will hinge on demonstrated effectiveness, legal compliance, and trust.
Risks and distributional effects: ML deployment in workplaces risks bias and adverse selection; economic analysis should assess heterogeneity across occupations, firms, and worker types to avoid exacerbating inequalities.
Research agenda for AI economics:
- Foster co-designed studies: pair ML teams with occupational-health researchers and economists from project inception.
- Prioritize datasets that link mental-health measures to economic outcomes (wages, productivity, employment spells).
- Develop standards for external validation, causal inference, and transparent reporting of algorithmic performance and harms.
- Study governance, privacy, and incentive structures influencing firm and worker uptake.
- Evaluate scalable interventions with randomized or quasi-experimental designs to estimate welfare impacts.

In short, realizing the economic and policy value of ML in occupational mental health requires shifting from isolated methodological work toward interdisciplinary, evaluation-focused research that links predictive tools to causal evidence and economic outcomes.

Assessment

Paper Typereview_meta Evidence Strengthn/a — This is a narrative review/synthesis of the literature rather than an empirical paper making causal claims; it highlights methodological trends and gaps rather than providing new causal evidence. Methods Rigormedium — The review systematically describes common data sources, ML methods, and methodological gaps, but appears to be a narrative synthesis focused on methodological critique rather than a systematic review or quantitative meta-analysis with formal inclusion criteria and pooled estimates. SampleA literature sample of ML studies applied to occupational mental health, typically using self-report surveys and standardized mental-health instruments, administrative/HR records (absences, turnover, performance), passive digital traces and sensors (smartphone usage, wearables, email/text metadata), and clinical records or longitudinal cohorts where available; most studies are methodological/proof-of-concept with limited out-of-sample or deployment data. Themesproductivity labor_markets adoption governance human_ai_collab GeneralizabilityField is nascent with many proof-of-concept/methodological studies rather than large, representative samples, Frequent reliance on self-report or convenience datasets limits external validity, Privacy and access constraints bias available data toward certain firms/sectors and geographies, Heterogeneity across occupations, firm sizes, and countries reduces transferability of findings, Lack of randomized or quasi-experimental evaluations limits ability to generalize causal impacts

Claims (4)

Claim	Direction	Confidence	Outcome	Details
The application of machine learning in occupational mental health research remains in its preliminary stages. Research Productivity	negative	medium	developmental stage/extent of application of machine learning in occupational mental health research	preliminary stage (qualitative assessment) 0.02
Current research in this area has a primary focus on methodology and computer science rather than applied occupational health questions. Research Productivity	negative	medium	topic/focus areas of published research (methodology/computer science vs applied occupational health)	research focus skewed toward methodology/computer science (qualitative) 0.02
Interdisciplinary collaboration is necessary to fully leverage the potential of machine learning in advancing occupational health research. Research Productivity	positive	speculative	capacity to leverage machine learning potential to advance occupational health research (inferred improvement via interdisciplinary collaboration)	interdisciplinary collaboration recommended (qualitative) 0.0
Machine learning has potential to advance occupational health research if its capabilities are fully leveraged through interdisciplinary work. Research Productivity	positive	speculative	advancement of occupational health research attributable to machine learning methods	ML has potential to advance occupational health research (conditional/qualitative) 0.0