Feeding MCMC-drawn parameter posterior samples into deterministic neural surrogates reproduces model uncertainty at a fraction of the computational cost and without training on implausible parameterizations. The architecture-agnostic method includes theoretical bounds tying performance loss to distribution mismatch, giving practitioners a practical diagnostic for retraining or reweighting.

MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems

Heikki Haario, Zhi-Song Liu, Martin Simon, Hendrik Weichel · March 11, 2026

arxiv other n/a evidence 7/10 relevance Source PDF

Train deterministic neural surrogates conditioned on MCMC-drawn parameter posterior samples to cheaply reproduce a model's parameter uncertainty, avoiding Bayesian NNs and unphysical training samples while providing diagnostics for distribution mismatch.

Neural networks are a commonly used approach to replace physical models with computationally cheap surrogates. Parametric uncertainty quantification can be included in training, assuming that an accurate prior distribution of the model parameters is available. Here we study the common opposite situation, where direct screening or random sampling of model parameters leads to exhaustive training times and evaluations at unphysical parameter values. Our solution is to decouple uncertainty quantification from network architecture. Instead of sampling network weights, we introduce the model-parameter distribution as an input to network training via Markov chain Monte Carlo (MCMC). In this way, the surrogate achieves the same uncertainty quantification as the underlying physical model, but with substantially reduced computation time. The approach is fully agnostic with respect to the neural network choice. In our examples, we present a quantile emulator for prediction and a novel autoencoder-based ODE network emulator that can flexibly estimate different trajectory paths corresponding to different ODE model parameters. Moreover, we present a mathematical analysis that provides a transparent way to relate potential performance loss to measurable distribution mismatch.

Summary

Main Finding

Rather than embedding parametric uncertainty inside a neural network (e.g., Bayesian NN or weight sampling), this paper treats the model-parameter distribution itself as an explicit input to the surrogate by drawing parameter samples with MCMC during training. That decoupling of uncertainty quantification from network architecture yields surrogates that reproduce the original model’s uncertainty quantification while dramatically reducing computation and avoiding training on unphysical parameter values. The approach is network-architecture agnostic and is illustrated with a quantile emulator and an autoencoder-based ODE trajectory emulator. A mathematical analysis relates any potential performance loss to measurable distribution mismatch.

Key Points

Problem: standard approaches to parametric uncertainty in surrogates require sampling (or placing priors on) network weights or exhaustively sampling parameter space; this can be computationally prohibitive and may force evaluation at unphysical parameter settings.
Solution: feed the model-parameter distribution into training via MCMC sampling of model parameters; train a deterministic neural surrogate conditioned on those parameter samples.
Advantage: surrogate retains the underlying physical model’s uncertainty quantification (by sampling the parameter posterior and evaluating the cheap surrogate) while reducing training/evaluation cost and avoiding unphysical samples.
Architecture-agnostic: method works with any neural-network architecture because uncertainty is handled through the parameter samples rather than specialized Bayesian layers.
Examples provided: a quantile emulator for prediction tasks, and a novel autoencoder-based ODE network that maps parameter values to different trajectory paths.
Theory: authors provide an analysis bounding or relating expected performance loss to distribution mismatch between the sampled (training) parameter distribution and the target distribution — offering a measurable diagnostic for when performance may degrade.
Practical requirement: access to (or ability to compute) MCMC samples from the model-parameter distribution; the approach shifts some computational burden to MCMC but avoids repeated expensive forward-model evaluations during many-network-sample strategies.

Data & Methods

Parameter sampling: Markov chain Monte Carlo (MCMC) is used to draw samples from the model-parameter distribution/posterior. Those samples constitute the uncertainty input to the surrogate.
Surrogate training: a deterministic neural network is trained conditionally on parameter samples (i.e., the network input includes parameter vectors or a representation thereof). Because the network is deterministic, uncertainty is recovered by repeatedly evaluating the trained network on MCMC parameter draws.
Emulators demonstrated:
- Quantile emulator: network trained to predict conditional quantiles across parameter draws, enabling probabilistic prediction without Bayesian NN training.
- Autoencoder-based ODE emulator: an encoder/decoder architecture that learns a latent representation of ODE trajectories and maps parameter values to corresponding latent trajectories, enabling flexible generation of different solution paths conditioned on parameters.
Theoretical component: mathematical derivations showing how a discrepancy between the distribution used for training (MCMC draws) and the true parameter distribution produces a quantifiable performance loss; provides diagnostics to assess when retraining or reweighting is needed.
Computational evaluation: empirical examples (not detailed here) demonstrate substantial reductions in wall-clock training/evaluation time relative to approaches that sample network weights or exhaustively explore parameter grids.

Implications for AI Economics

Faster policy and scenario analysis: economic models that rely on expensive simulation (e.g., large-scale DSGE, heterogeneous-agent, or agent-based models) can be replaced by cheap surrogates that preserve parameter uncertainty, enabling far more extensive counterfactual and policy-space exploration.
Scalable uncertainty quantification: decoupling UQ from NN architecture lets practitioners use high-performance deterministic networks while still propagating parameter uncertainty transparently via MCMC samples—useful for structural estimation, forecasting with parameter uncertainty, and stress testing.
Avoids unphysical training regimes: by sampling from a calibrated posterior or realistic parameter distribution, training focuses on economically relevant parameter regions and avoids wasted computation on implausible parameterizations.
Diagnostic for distribution shifts: the provided theoretical link between distribution mismatch and performance loss gives economists a practical tool to detect when a surrogate trained on one parameter distribution will underperform after recalibration or policy changes—important for model updating and transfer.
Trade-offs and limitations: requires reliable MCMC sampling (which can be costly for high-dimensional parameter spaces) and assumes the sampled parameter distribution adequately represents relevant uncertainty; if model misspecification or extreme extrapolation is needed, surrogate performance may degrade and retraining or reweighting will be necessary.
Practical adoption: this approach lowers the technical barrier for adopting surrogates in economics because it removes dependence on specialized Bayesian NN techniques while preserving rigorous UQ—potentially accelerating research workflows and real-time policy decision support.

Assessment

Paper Typeother Evidence Strengthn/a — This is a methodological / computational contribution about surrogate modeling and uncertainty quantification, not an empirical study making causal claims about economic outcomes, so conventional evidence-strength ratings for causal inference do not apply. Methods Rigorhigh — The paper combines (i) a clear algorithmic proposal (conditioning deterministic surrogates on MCMC-drawn parameter samples), (ii) empirical demonstrations (quantile emulator and autoencoder-based ODE trajectory emulator) with computational benchmarking, and (iii) theoretical analysis that bounds performance loss due to distribution mismatch; it also discusses practical requirements and diagnostics—indicating thorough methodological and theoretical treatment. Caveats: empirical illustrations appear to be on simulation/ODE examples rather than large-scale economic models, and the method's performance depends on quality and scalability of MCMC in high-dimensional parameter spaces. SampleTraining uses parameter samples drawn by MCMC from the model-parameter posterior; these samples (and corresponding model outputs) are used to train a deterministic neural network conditioned on parameter vectors. Empirical demonstrations include a quantile emulator and an autoencoder-based ODE trajectory emulator, with wall-clock comparisons against alternatives that sample network weights or exhaustively grid parameters. No large-scale DSGE/agent-based empirical dataset is reported—examples appear to be simulated models/ODE systems. Themesproductivity adoption innovation GeneralizabilityRequires access to reliable MCMC samples or ability to compute posterior draws—may be costly or intractable for very high-dimensional parameter spaces., Demonstrations are on simulated/ODE examples; performance on large-scale, computationally expensive economic models (e.g., high-dimensional DSGE or full agent-based models) is not directly validated., Method assumes the sampled parameter posterior adequately represents relevant uncertainty; severe model misspecification or need for extreme extrapolation will degrade surrogate fidelity., Surrogate accuracy depends on network capacity and training; extremely complex mapping from parameters to outputs may require larger networks or more training data., The theoretical bounds hinge on measurable distribution mismatch diagnostics; their tightness and practical utility may vary by application.

Claims (10)

Claim	Direction	Confidence	Outcome	Details
Training a deterministic neural surrogate conditioned on MCMC-drawn parameter samples reproduces the original (forward) model's uncertainty quantification while avoiding embedding parametric uncertainty inside the network weights. Decision Quality	positive	medium	fidelity of uncertainty quantification / posterior predictive distributions produced by the surrogate versus the original model	reproduction of original model's uncertainty via MCMC-conditioned deterministic surrogate (reported) 0.01
This approach dramatically reduces computation (training and/or evaluation wall-clock time) compared to approaches that sample network weights (Bayesian NNs) or exhaustively explore parameter grids. Organizational Efficiency	positive	medium	wall-clock training time and evaluation time	substantial reductions in wall-clock training/evaluation time vs. Bayesian NNs or parameter grids (reported) 0.01
By sampling training parameter vectors from a calibrated posterior (via MCMC), the surrogate avoids training on unphysical or implausible parameter configurations. Training Effectiveness	positive	high	proportion of training samples that fall in implausible/unphysical parameter regions (conceptual / procedural outcome)	reduced proportion of implausible/unphysical training samples (procedural claim) 0.02
The method is architecture-agnostic: uncertainty handling via parameter samples allows use of any deterministic neural-network architecture (e.g., quantile regressors, autoencoders) without specialized Bayesian layers. Adoption Rate	positive	high	applicability across network architectures (demonstrated via example implementations)	architecture-agnostic applicability (qualitative demonstration) 0.02
A quantile emulator trained conditional on MCMC parameter draws can produce conditional quantile predictions without training a Bayesian neural network. Output Quality	positive	high	accuracy of predicted conditional quantiles	conditional quantile predictions produced without Bayesian NN (reported) 0.02
An autoencoder-based ODE emulator that maps parameter values to latent trajectories can flexibly generate different solution paths conditioned on parameters. Output Quality	positive	high	ability to reconstruct/generate ODE solution trajectories conditioned on parameter inputs (e.g., reconstruction error/time series fidelity)	ability to generate/reconstruct ODE trajectories conditioned on parameters (reported) 0.02
A mathematical analysis bounds or relates expected performance loss of the surrogate to measurable distribution mismatch between the training parameter distribution (samples) and the target parameter distribution. Other	negative	high	expected performance loss (e.g., increase in predictive loss) as a function of distributional divergence/mismatch	theoretical bound relating expected performance loss to distribution mismatch (qualitative/theoretical) 0.02
The approach shifts some computational burden to obtaining MCMC samples of the parameter posterior, requiring access to (or ability to compute) MCMC samples before surrogate training. Organizational Efficiency	null_result	high	need for and cost of MCMC sampling (computational requirement)	practical requirement/cost of obtaining MCMC samples (procedural) 0.02
The theoretical diagnostic (linking distribution mismatch to performance loss) gives practitioners a practical tool to detect when a surrogate trained on one parameter distribution will underperform after recalibration or policy changes. Decision Quality	positive	medium	diagnostic effectiveness in detecting performance degradation under distribution shift (conceptual / empirical)	diagnostic tool for detecting surrogate underperformance under distribution shift (theoretical/diagnostic claim) 0.01
The method lowers the technical barrier for adopting surrogates in economics by removing dependence on specialized Bayesian neural-network techniques while preserving rigorous uncertainty quantification. Adoption Rate	positive	medium	ease of adoption / reduction in technical complexity required to obtain UQ-preserving surrogates (qualitative outcome)	lowers technical barrier for surrogate adoption in economics (qualitative) 0.01