Feeding MCMC-drawn parameter posterior samples into deterministic neural surrogates reproduces model uncertainty at a fraction of the computational cost and without training on implausible parameterizations. The architecture-agnostic method includes theoretical bounds tying performance loss to distribution mismatch, giving practitioners a practical diagnostic for retraining or reweighting.
Neural networks are a commonly used approach to replace physical models with computationally cheap surrogates. Parametric uncertainty quantification can be included in training, assuming that an accurate prior distribution of the model parameters is available. Here we study the common opposite situation, where direct screening or random sampling of model parameters leads to exhaustive training times and evaluations at unphysical parameter values. Our solution is to decouple uncertainty quantification from network architecture. Instead of sampling network weights, we introduce the model-parameter distribution as an input to network training via Markov chain Monte Carlo (MCMC). In this way, the surrogate achieves the same uncertainty quantification as the underlying physical model, but with substantially reduced computation time. The approach is fully agnostic with respect to the neural network choice. In our examples, we present a quantile emulator for prediction and a novel autoencoder-based ODE network emulator that can flexibly estimate different trajectory paths corresponding to different ODE model parameters. Moreover, we present a mathematical analysis that provides a transparent way to relate potential performance loss to measurable distribution mismatch.
Summary
Main Finding
Rather than embedding parametric uncertainty inside a neural network (e.g., Bayesian NN or weight sampling), this paper treats the model-parameter distribution itself as an explicit input to the surrogate by drawing parameter samples with MCMC during training. That decoupling of uncertainty quantification from network architecture yields surrogates that reproduce the original model’s uncertainty quantification while dramatically reducing computation and avoiding training on unphysical parameter values. The approach is network-architecture agnostic and is illustrated with a quantile emulator and an autoencoder-based ODE trajectory emulator. A mathematical analysis relates any potential performance loss to measurable distribution mismatch.
Key Points
- Problem: standard approaches to parametric uncertainty in surrogates require sampling (or placing priors on) network weights or exhaustively sampling parameter space; this can be computationally prohibitive and may force evaluation at unphysical parameter settings.
- Solution: feed the model-parameter distribution into training via MCMC sampling of model parameters; train a deterministic neural surrogate conditioned on those parameter samples.
- Advantage: surrogate retains the underlying physical model’s uncertainty quantification (by sampling the parameter posterior and evaluating the cheap surrogate) while reducing training/evaluation cost and avoiding unphysical samples.
- Architecture-agnostic: method works with any neural-network architecture because uncertainty is handled through the parameter samples rather than specialized Bayesian layers.
- Examples provided: a quantile emulator for prediction tasks, and a novel autoencoder-based ODE network that maps parameter values to different trajectory paths.
- Theory: authors provide an analysis bounding or relating expected performance loss to distribution mismatch between the sampled (training) parameter distribution and the target distribution — offering a measurable diagnostic for when performance may degrade.
- Practical requirement: access to (or ability to compute) MCMC samples from the model-parameter distribution; the approach shifts some computational burden to MCMC but avoids repeated expensive forward-model evaluations during many-network-sample strategies.
Data & Methods
- Parameter sampling: Markov chain Monte Carlo (MCMC) is used to draw samples from the model-parameter distribution/posterior. Those samples constitute the uncertainty input to the surrogate.
- Surrogate training: a deterministic neural network is trained conditionally on parameter samples (i.e., the network input includes parameter vectors or a representation thereof). Because the network is deterministic, uncertainty is recovered by repeatedly evaluating the trained network on MCMC parameter draws.
- Emulators demonstrated:
- Quantile emulator: network trained to predict conditional quantiles across parameter draws, enabling probabilistic prediction without Bayesian NN training.
- Autoencoder-based ODE emulator: an encoder/decoder architecture that learns a latent representation of ODE trajectories and maps parameter values to corresponding latent trajectories, enabling flexible generation of different solution paths conditioned on parameters.
- Theoretical component: mathematical derivations showing how a discrepancy between the distribution used for training (MCMC draws) and the true parameter distribution produces a quantifiable performance loss; provides diagnostics to assess when retraining or reweighting is needed.
- Computational evaluation: empirical examples (not detailed here) demonstrate substantial reductions in wall-clock training/evaluation time relative to approaches that sample network weights or exhaustively explore parameter grids.
Implications for AI Economics
- Faster policy and scenario analysis: economic models that rely on expensive simulation (e.g., large-scale DSGE, heterogeneous-agent, or agent-based models) can be replaced by cheap surrogates that preserve parameter uncertainty, enabling far more extensive counterfactual and policy-space exploration.
- Scalable uncertainty quantification: decoupling UQ from NN architecture lets practitioners use high-performance deterministic networks while still propagating parameter uncertainty transparently via MCMC samples—useful for structural estimation, forecasting with parameter uncertainty, and stress testing.
- Avoids unphysical training regimes: by sampling from a calibrated posterior or realistic parameter distribution, training focuses on economically relevant parameter regions and avoids wasted computation on implausible parameterizations.
- Diagnostic for distribution shifts: the provided theoretical link between distribution mismatch and performance loss gives economists a practical tool to detect when a surrogate trained on one parameter distribution will underperform after recalibration or policy changes—important for model updating and transfer.
- Trade-offs and limitations: requires reliable MCMC sampling (which can be costly for high-dimensional parameter spaces) and assumes the sampled parameter distribution adequately represents relevant uncertainty; if model misspecification or extreme extrapolation is needed, surrogate performance may degrade and retraining or reweighting will be necessary.
- Practical adoption: this approach lowers the technical barrier for adopting surrogates in economics because it removes dependence on specialized Bayesian NN techniques while preserving rigorous UQ—potentially accelerating research workflows and real-time policy decision support.
Assessment
Claims (10)
| Claim | Direction | Confidence | Outcome | Details |
|---|---|---|---|---|
| Training a deterministic neural surrogate conditioned on MCMC-drawn parameter samples reproduces the original (forward) model's uncertainty quantification while avoiding embedding parametric uncertainty inside the network weights. Decision Quality | positive | medium | fidelity of uncertainty quantification / posterior predictive distributions produced by the surrogate versus the original model |
reproduction of original model's uncertainty via MCMC-conditioned deterministic surrogate (reported)
0.01
|
| This approach dramatically reduces computation (training and/or evaluation wall-clock time) compared to approaches that sample network weights (Bayesian NNs) or exhaustively explore parameter grids. Organizational Efficiency | positive | medium | wall-clock training time and evaluation time |
substantial reductions in wall-clock training/evaluation time vs. Bayesian NNs or parameter grids (reported)
0.01
|
| By sampling training parameter vectors from a calibrated posterior (via MCMC), the surrogate avoids training on unphysical or implausible parameter configurations. Training Effectiveness | positive | high | proportion of training samples that fall in implausible/unphysical parameter regions (conceptual / procedural outcome) |
reduced proportion of implausible/unphysical training samples (procedural claim)
0.02
|
| The method is architecture-agnostic: uncertainty handling via parameter samples allows use of any deterministic neural-network architecture (e.g., quantile regressors, autoencoders) without specialized Bayesian layers. Adoption Rate | positive | high | applicability across network architectures (demonstrated via example implementations) |
architecture-agnostic applicability (qualitative demonstration)
0.02
|
| A quantile emulator trained conditional on MCMC parameter draws can produce conditional quantile predictions without training a Bayesian neural network. Output Quality | positive | high | accuracy of predicted conditional quantiles |
conditional quantile predictions produced without Bayesian NN (reported)
0.02
|
| An autoencoder-based ODE emulator that maps parameter values to latent trajectories can flexibly generate different solution paths conditioned on parameters. Output Quality | positive | high | ability to reconstruct/generate ODE solution trajectories conditioned on parameter inputs (e.g., reconstruction error/time series fidelity) |
ability to generate/reconstruct ODE trajectories conditioned on parameters (reported)
0.02
|
| A mathematical analysis bounds or relates expected performance loss of the surrogate to measurable distribution mismatch between the training parameter distribution (samples) and the target parameter distribution. Other | negative | high | expected performance loss (e.g., increase in predictive loss) as a function of distributional divergence/mismatch |
theoretical bound relating expected performance loss to distribution mismatch (qualitative/theoretical)
0.02
|
| The approach shifts some computational burden to obtaining MCMC samples of the parameter posterior, requiring access to (or ability to compute) MCMC samples before surrogate training. Organizational Efficiency | null_result | high | need for and cost of MCMC sampling (computational requirement) |
practical requirement/cost of obtaining MCMC samples (procedural)
0.02
|
| The theoretical diagnostic (linking distribution mismatch to performance loss) gives practitioners a practical tool to detect when a surrogate trained on one parameter distribution will underperform after recalibration or policy changes. Decision Quality | positive | medium | diagnostic effectiveness in detecting performance degradation under distribution shift (conceptual / empirical) |
diagnostic tool for detecting surrogate underperformance under distribution shift (theoretical/diagnostic claim)
0.01
|
| The method lowers the technical barrier for adopting surrogates in economics by removing dependence on specialized Bayesian neural-network techniques while preserving rigorous uncertainty quantification. Adoption Rate | positive | medium | ease of adoption / reduction in technical complexity required to obtain UQ-preserving surrogates (qualitative outcome) |
lowers technical barrier for surrogate adoption in economics (qualitative)
0.01
|