Hybrid Deep Learning Approach for Coupled Demand Forecasting and Supply Chain Optimization

Supply chain resilience and efficiency are vital in industries characterized by volatile demand and uncertain supply, such as textiles and personal protective equipment (PPE). Traditional forecasting and optimization approaches often operate in isolation, limiting their real-world effectiveness. This paper proposes a Hybrid AI Framework for Demand-Supply Forecasting and Optimization (HAF-DS), which integrates a Long Short-Term Memory (LSTM)-based demand forecasting module with a mixed integer linear programming (MILP) optimization layer. The LSTM captures temporal and contextual demand dependencies, while the optimization layer prescribes cost-efficient replenishment and allocation decisions. The framework jointly minimizes forecasting error and operational cost through embedding-based feature representation and recurrent neural architectures. Experiments on textile sales and supply chain datasets show significant performance gains over statistical and deep learning baselines. On the combined dataset, HAF-DS reduced Mean Absolute Error (MAE) from 15.04 to 12.83 (14.7%), Root Mean Squared Error (RMSE) from 19.53 to 17.11 (12.4%), and Mean Absolute Percentage Error (MAPE) from 9.5% to 8.1%. Inventory cost decreased by 5.4%, stockouts by 27.5%, and service level rose from 95.5% to 97.8%. These results confirm that coupling predictive forecasting with prescriptive optimization enhances both accuracy and efficiency, providing a scalable and adaptable solution for modern textile and PPE supply chains.

Summary

Main Finding

The paper proposes HAF-DS, a hybrid AI framework that jointly trains an LSTM-based demand forecaster with a supply-side mixed-integer linear programming (MILP) optimizer via a shared, differentiable loss. On textile and PPE datasets the integrated model outperforms statistical and standalone deep-learning baselines: MAE falls from 15.04 to 12.83 (–14.7%), RMSE from 19.53 to 17.11 (–12.4%), MAPE from 9.5% to 8.1%. Operationally, inventory cost decreased by 5.4%, stockouts fell 27.5%, and service level rose from 95.5% to 97.8%. The joint predictive–prescriptive coupling ensures forecast learning is directly informed by downstream cost/service objectives, yielding both better accuracy and more efficient supply decisions.

Key Points

End-to-end differentiable coupling: forecasting parameters receive gradients from a supply-side loss, turning predict-then-optimize into a unified learning problem (Ltotal = λ1Lforecast + λ2Lsupply).
Forecasting module: stacked LSTM (128 units, dropout), embedding layers for categorical features, sliding-window time-series inputs (L = 30).
Optimization module: MILP-based supply/replenishment decision layer that uses predicted demand (and sampled forecasts) to compute operational costs and constraints.
Probabilistic treatment: predicted demand modeled as ˆyt+1 ∼ N(µθ, σ2θ) and sampled during training for robustness to uncertainty.
Data engineering: mean/mode imputation, Min–Max normalization, categorical embeddings, train/val/test chronological split (70/15/15), data augmentation (Gaussian noise, dropout of supplier records).
Empirical gains vs. baselines: notable improvements in predictive metrics (MAE/RMSE/MAPE) and operational metrics (inventory cost, stockout rate, service level).
Ablation: trainable embeddings and joint training materially contribute to performance (embeddings alone reduced MAE by ~4–6%, RMSE by ~5–7%).
Claimed novelty: fully integrated, differentiable predictive–prescriptive architecture (as opposed to standard two-stage or loosely coupled hybrids).

Data & Methods

Datasets: combined textile sales and supply-chain operational datasets (textile and PPE contexts). Exact dataset sizes not specified in the provided excerpt.
Preprocessing:
- Missing values: mean imputation for continuous, mode for categorical.
- Scaling: Min–Max normalization to [0,1].
- Categorical encoding: trainable embeddings (E ∈ R|C|×d).
- Sequence generation: sliding windows of past L=30 time steps to predict next-step demand.
- Data augmentation: Gaussian noise injection and simulated supplier dropouts.
Model architecture:
- Forecasting: stacked LSTM layers (128 units), dense layers (e.g., 32 units ReLU), dropout.
- Optimization: MILP decision layer producing replenishment/allocation actions given demand forecasts and contextual features.
- Hybrid integration: total loss Ltotal = λ1Lforecast + λ2Lsupply; gradients flow into θ (forecast model) and ϕ (optimizer parameters where differentiable). Stochastic sampling of forecasts used to expose optimizer to uncertainty.
Training & evaluation:
- Chronological train/validation/test split (70/15/15).
- Metrics: MAE, RMSE, MAPE for forecasting; inventory cost, stockout rate, service level, total cost for operational evaluation.
- Baselines: classical statistical methods (e.g., ARIMA, smoothing), tree-based and deep-learning baselines (details in paper).
- Ablation analyses: impact of embeddings and joint training measured.
Methodological caveats noted by authors (and inferred):
- Differentiable coupling to MILP likely requires relaxation/approximation (paper asserts differentiable coupling but implementation details matter).
- Use of LSTM rather than transformer architectures (transformers noted in related work as competitive for time series).

Implications for AI Economics

Operational value: Joint predictive–prescriptive learning produces measurable cost savings (inventory cost down ~5.4%) and service improvements (service level +2.3 percentage points), indicating positive firm-level ROI potential from integrated AI investments in forecasting + optimization.
Welfare and resilience: Large reductions in stockouts (–27.5%) are especially important for PPE and essential goods; better allocation reduces externalities from shortages during crises (health, disruption costs).
Resource allocation and capital efficiency: Lower inventories and fewer stockouts imply capital tied in inventory can be reduced and redeployed, affecting working-capital dynamics and firms’ financing needs.
Adoption incentives: The framework strengthens the case for firms to move beyond predict-then-optimize pipelines toward end-to-end systems, justifying investments in data infrastructure, joint-model development, and optimization tooling.
Market structure & competition: Firms that successfully implement such coupled systems may gain service- and cost-based competitive advantages, potentially affecting market concentration in industries where inventory/service matters.
Policy & public goods: For critical supply chains (e.g., PPE), policymakers could promote shared data standards or incentives to adopt predictive–prescriptive systems to increase system-wide resilience.
Risks & limits for economic deployment:
- Generalizability: Gains reported on textile/PPE datasets may not fully transfer to other sectors or multi-echelon global supply chains without adaptation.
- Data quality & distribution shifts: Imputation strategies and historical data-driven models are sensitive to structural breaks (e.g., pandemics, regulatory changes). Economic value depends on robustness to regime change.
- Implementation costs and complexity: Joint training with differentiable optimization may require specialized engineering and compute; smaller firms may face adoption barriers.
- Labor and organizational effects: Automation of replenishment decisions could change roles in planning and procurement, creating re-skilling needs.
Research directions with economic significance:
- Quantify firm-level ROI and payback periods from integrated systems; cost–benefit analyses across firm sizes and supply-chain structures.
- Extend to multi-echelon stochastic optimization and market-level supply resilience analyses.
- Assess distributional impacts: which firms/sectors benefit most, and how does this affect competition and labor demand.
- Robustness to structural breaks: causal or distributionally-robust predictive–prescriptive approaches to reduce downside economic risk.

Concise critique/notes - Strengths: clear empirical improvements linking forecasting gains to operational outcomes; principled joint-loss formulation; attention to categorical embeddings and uncertainty sampling. - Limitations: paper relies on LSTM rather than state-of-the-art transformer/time-series models (though transformers are discussed); implementation details of making MILP differentiable are not fully described in the excerpt (practical relaxations/solvers matter); external validity and dataset scale need clarity for broader economic claims.

Assessment

Paper Typedescriptive Evidence Strengthmedium — The paper provides empirical evidence of improved forecasting and operational metrics by comparing the proposed hybrid LSTM+MILP pipeline to statistical and deep-learning baselines on textile and PPE datasets, with clear out-of-sample performance metrics (MAE, RMSE, MAPE) and supply-chain outcomes (inventory cost, stockouts, service level). However, evidence is limited to retrospective experiments on specific datasets (no randomized or quasi-experimental deployment), with no reported statistical significance testing, limited description of dataset sizes or variability, and no real-world field validation. Methods Rigormedium — The approach combines a plausible ML forecasting module with a prescriptive optimization layer and evaluates multiple relevant metrics versus baselines, which is methodologically sound. But the paper (as summarized) omits key rigor details: dataset sizes and splits, cross-validation/hyperparameter tuning procedures, statistical tests for differences, sensitivity analyses (e.g., demand regime shifts, noisy inputs), and computational/solution-time analysis for the MILP at scale—limiting confidence in robustness and reproducibility. SampleExperimental evaluation on proprietary/public textile sales and PPE supply-chain datasets (reported as a 'combined dataset'); training of an LSTM demand-forecasting model with embedding-based contextual features, followed by a MILP optimization layer for replenishment/allocation. Reported metrics are MAE, RMSE, MAPE for forecasts and inventory cost, stockouts, and service level for operations. Exact sample sizes, time spans, SKU counts, train/validation/test splits, and data provenance are not reported in the summary. Themesproductivity adoption GeneralizabilityEvaluated only on textile and PPE datasets — results may not transfer to other industries or demand patterns, Retrospective/benchmarked experiments — no field deployment or randomized trial to show real-world operational effects, Lack of reported robustness checks for demand regime shifts, data sparsity, or noisy/corrupted inputs, Scalability and solve-times for the MILP not reported — may not generalize to very large or more complex supply networks, Framework requires accurate cost/lead-time parameters and contextual features that may be unavailable or unreliable in some firms

Claims (14)

Claim	Direction	Confidence	Outcome	Details
The paper proposes a Hybrid AI Framework for Demand-Supply Forecasting and Optimization (HAF-DS), which integrates a Long Short-Term Memory (LSTM)-based demand forecasting module with a mixed integer linear programming (MILP) optimization layer. Other	positive	high	design/architecture integration of forecasting and optimization	0.18
The LSTM captures temporal and contextual demand dependencies. Other	positive	high	ability to capture temporal and contextual demand dependencies	0.18
The optimization layer prescribes cost-efficient replenishment and allocation decisions (MILP). Organizational Efficiency	positive	high	cost-efficient replenishment and allocation decisions	0.18
The framework jointly minimizes forecasting error and operational cost through embedding-based feature representation and recurrent neural architectures. Organizational Efficiency	positive	high	combined forecasting error and operational cost	0.18
Experiments on textile sales and supply chain datasets show significant performance gains over statistical and deep learning baselines. Output Quality	positive	high	forecasting performance relative to baselines	0.18
On the combined dataset, HAF-DS reduced Mean Absolute Error (MAE) from 15.04 to 12.83 (14.7%). Output Quality	positive	high	Mean Absolute Error (MAE)	MAE reduced from 15.04 to 12.83 (14.7%) 0.18
On the combined dataset, HAF-DS reduced Root Mean Squared Error (RMSE) from 19.53 to 17.11 (12.4%). Output Quality	positive	high	Root Mean Squared Error (RMSE)	RMSE reduced from 19.53 to 17.11 (12.4%) 0.18
On the combined dataset, HAF-DS reduced Mean Absolute Percentage Error (MAPE) from 9.5% to 8.1%. Output Quality	positive	high	Mean Absolute Percentage Error (MAPE)	MAPE reduced from 9.5% to 8.1% 0.18
Inventory cost decreased by 5.4%. Organizational Efficiency	positive	high	inventory cost	Inventory cost decreased by 5.4% 0.18
Stockouts decreased by 27.5%. Organizational Efficiency	positive	high	stockout frequency	Stockouts decreased by 27.5% 0.18
Service level rose from 95.5% to 97.8%. Organizational Efficiency	positive	high	service level (fill rate / on-time fulfillment)	Service level rose from 95.5% to 97.8% 0.18
Coupling predictive forecasting with prescriptive optimization enhances both accuracy and efficiency in textile and PPE supply chains. Organizational Efficiency	positive	high	forecast accuracy and operational efficiency	0.18
HAF-DS provides a scalable and adaptable solution for modern textile and PPE supply chains. Adoption Rate	positive	high	scalability and adaptability of the solution	0.09
Traditional forecasting and optimization approaches often operate in isolation, limiting their real-world effectiveness in volatile-demand, uncertain-supply industries. Organizational Efficiency	negative	high	effectiveness of isolated forecasting/optimization approaches	0.09

Coupling LSTM demand forecasts with MILP optimization meaningfully improves supply‑chain performance: forecasting errors fall ~12–15%, inventory cost drops 5.4%, stockouts decline 27.5% and service levels rise to 97.8% in textile and PPE datasets.