Statistical Approaches to Combining Models and Observations
Continual improvements in both computational assets and observational data are revolutionizing science and engineering. However, models, computations, and observations are subject to a variety of sources of uncertainty, mandating the need for quantification and management of uncertainty. Bayesian hierarchical modeling is a framework for combining diverse datasets, mechanistic and statistical models, and computation in a fashion that manages uncertainty (see, for example, [1,5]).
Hierarchical probability models are sequences of conditional distributions that correspond to a joint distribution. Let
where
The data model
that is,
The process model offers the opportunity to incorporate scientific modeling of the quantities of interest. Often, we formulate models from underlying differential equations or discretized versions of them [4,10]. For our wind–pressure example, the geostrophic approximation suggests that winds are proportional to the gradient of the pressure field. We can incorporate this notion in a stochastic geostrophic approximation,
where
Analysis of Bayesian hierarchical modeling is often compute-intensive. Such advances as Markov chain Monte Carlo, sequential Bayes, and particle filtering have made serious BHM applications possible (e.g., [7]). However, use of process models requiring runs of large-scale, supercomputer models for single iterations of a Monte Carlo Bayesian calculation are typically feasible. This suggests the need for approaches that can incorporate ensembles from large models. Let
First, consider modeling
Next, we can use model output to formulate a process model prior in a variety of ways. Much of the literature in the design and analysis of computer experiments (e.g., [9]) begins with a Gaussian process model for model output:
References
[1] L.M. Berliner, Physical–statistical modeling in geophysics, J. Geophy. Res., 108:D24 (2003), 1–10, doi: 10.1029/2002JD002865.
[2] L.M. Berliner and Y. Kim, Bayesian design and analysis for superensemble based climate forecasting, J. Climate, 21 (2008), 1891–1910.
[3] L.M. Berliner, R.A. Levine, and D.J. Shea, Bayesian climate change assessment, J. Climate, 13 (2000), 3805–3820.
[4] L.M. Berliner, R.F. Milliff, and C.K. Wikle, Bayesian hierarchical modeling of air–sea interaction, J. Geophy. Res., 108:C4 (2003), 1–18, doi:10.1029/ 2002JC001413.
[5] N. Cressie and C.K. Wikle, Statistics for Spatio-Temporal Data, John Wiley & Sons, Hoboken, New Jersey, 2011.
[6] D. Higdon, M.C. Kennedy, J. Cavendish, J. Cafeo, and R.D. Ryne, Combining data and computer simulations for calibration and prediction, SIAM J. Sci. Comput., 26:2 (2004), 448–466.
[7] C.P. Robert and G. Casella, Monte Carlo Statistical Methods, 2nd ed., Springer, New York, 2004.
[8] J.A. Royle, L.M. Berliner, C.K. Wikle, and R.F. Milliff, “A hierarchical spatial model for constructing surface winds from scatterometer data in the Labrador Sea,” in Case Studies in Bayesian Statistics IV, C. Gatsonis, R.E. Kass, A. Cariquiry, and B. Carlin, eds., Springer, New York, 1999.
[9] T.J. Santner, B.J. Williams, and W.I. Notz, The Design and Analysis of Computer Experiments, Springer, New York, 2003.
[10] C.K. Wikle, R.F. Milliff, D. Nychka, and L.M. Berliner, Spatiotemporal hierarchical Bayesian blending of tropical ocean surface wind data, J. Amer. Statist. Assoc., 96 (2001), 382–397.
About the Author
L. Mark Berliner
Professor, Ohio State University
L. Mark Berliner is a professor and chair of the Department of Statistics at Ohio State University.
Stay Up-to-Date with Email Alerts
Sign up for our monthly newsletter and emails about other topics of your choosing.