Enhancing Geophysical Modeling with Bayesian Hierarchical Techniques
390 likes | 520 Vues
This presentation explores the integration of various information sources in geophysical modeling through Bayesian hierarchical models. It aims to develop probability distributions for unknown terms of interest by combining observations, theoretical insights, and computer model outputs. Key approaches include modeling priors from data and utilizing Bayesian inference to estimate posterior distributions. Various geophysical examples will be discussed, including applications in glacial dynamics and climate proxies, illustrating how Bayesian models can enhance forecasting and understanding of complex systems.
Enhancing Geophysical Modeling with Bayesian Hierarchical Techniques
E N D
Presentation Transcript
Combining Observations and Models: A Bayesian ViewMark Berliner, OSU Stat Dept • Bayesian Hierarchical Models • Selected Approaches • Geophysical Examples • Discussion
Main Themes Goal: Develop probability distributions for unknowns of interest by combining information sources: Observations, theory, computer model output, past experience, etc. Approaches: Bayesian Hierarchical Models Incorporate various information sources by modeling priors data model or likelihoods
Bayesian Hierarchical Models • Skeleton: • Data Model: [ Y | X , q ] • Process Model Prior: [ X | q ] • Prior on parameters: [ q ] • Bayes’ Theorem: posterior distribution: [ X , q | Y] • Compare to “Statistics”: [ Y | q ] [ q ] “Physics”: [ X | q (Y) ]
Approaches • Stochastic models incorporating science • Physical-statistical modeling (Berliner 2003 JGR) From ``F=ma'' to [ X | q ] • Qualitative use of theory (eg., Pacific SST model; Berliner et al. 2000 J. Climate) • Incorporating large-scale computer models • From model output to priors [ q ] • Model output as samples from process model prior [ X | q ] almost ! • Model output as ``observations'' (Y) • Combinations
Glacial Dynamics (Berliner et al. 2008 J. Glaciol) Steady Flow of Glaciers and Ice Sheets • Flow: gravity moderated by drag (base & sides) & ….stuff…. • Simple models: flow from geometry Data: Program for Arctic Climate Regional Assessments & Radarsat Antarctic Mapping Project • surface topography (laser altimetry) • basal topography (radar altimetry) • velocity data (interferometry)
Modeling: surface – s, thickness – H, velocity - u Physical Model • Basal Stress: t = - rgH ds/dx (+ “stuff”) • Velocities: u = ub + b0 H tn where ub = k tp+ ( rgH )-q Our Model • Basal Stress: t = - rgH ds/dx + h where h is a ``corrector process;” H, s unknown • Velocities: u = ub + b H t n + e where ub = k t p+ ( rgH )-q or a constant; b is unknown, e is a noise process
Paleoclimate (Brynjarsdóttir & Berliner 2009) Climate proxies: Tree rings, ice cores, corals, pollen, underground rock provide indirect information on climate • Inverse problem: proxy f(climate) Boreholes: Earth stores info on surface temp’s • Model: Heat equation Borehole data f(surface temp’s) • Infer boundary condition (initial cond. is nuisance)
Modeling Y r h • Data Model: Y | Tr, q ~ N( Tr + T0 1 + q R(k), s2I) true temp Adjustments for rock types, etc. • Process Model: heat equation applied to Tr with b.cond. surface temp history Th Tr | Th ,q ~ N( BTh , s2I) Th | q ~ N( 0 , s2I)
In progress: • Combining boreholes (parameters and b.cond as samples from a distribution) • Combining with other sources and proxies
Bayesian Hierarchical Models to Augment the Mediterranean Forecast System (MFS) Ralph Milliff CoRA Chris Wikle Univ. Missouri Mark Berliner Ohio State Univ. . Nadia Pinardi INGV (I'Istituto Nazionale di Geofisica e Vulcanologia) Univ. Bologna (MFS Director) Alessandro Bonazzi, Srdjan Dobricic INGV, Univ. Bologna
Bayesian Modeling in Support of Massive Forecast Models • MFS is an Ocean Model • A Boundary Condition/Forcing: Surface Winds • Approach: produce surface vector winds (SVW), for ensemble data assimilation • Exploit abundant, “good” satellite wind data (QuikSCAT) • Samples from our winds-posterior ensemble for MFS (Before us: coarse wind field (ECMWF))
“Rayleigh Friction Model” for winds (Linear Planetary Boundary Layer Equations) (neglect second order time derivative) discretize: Theory Our model
10 members selected from the Posterior Distribution (blue) BHM Ensemble Winds 10 m/s
Approaches • Stochastic models incorporating science • Physical-statistical modeling (Berliner 2003 JGR) From ``F=ma'' to [ X | q ] • Qualitative use of theory (eg., Pacific SST model; Berliner et al. 2000 J. Climate) • Incorporating large-scale computer models • From model output to priors [ q ] • Model output as samples from process model prior [ X | q ] almost ! • Model output as ``observations'' (Y) • Combinations
Part B) Information from Models • Develop prior from model output • Think of model output runs O1, … , On as samples from some distribution • Do data analysis on O’s to estimate distribution • Use result (perhaps with modifications) as a prior for X • Example: O’s are spatial fields: estimate spatial covariance function of X based on O’s. • Example: Berliner et al (2003) J. Climate
Model output as realizations of prior “trends” • Process Model PriorX = O + hq where h is “model error”, “bias”, “offset” • [ Y | X , q ] is measurement error model: Y = X + eq • Substitution yields [ Y | O , hq , q ] Y = O + hq + eq • Modeling his crucial (I have seen h set to 0)
Model output as “observations” • Data Model:[ Y, O | X , q ] ( = [ Y | X, q ] [ O | X , q]) • [ O | X , q ]to include “bias, offset, ..” • Previous approach: start by constructing [ X | O , q ] This approach: construct [ O | X , q] • Model for “bias” a challenge in both cases • This is not uncommon, though not always made clear
A Bayesian Approach to Multi-model Analysis and Climate Projection (Berliner and Kim 2008, J Climate) Climate Projection: • Future climate depends on future, but unknown, inputs. • IPCC: construct plausible future inputs, “SRES Scenarios” (CO2 etc.) • Assume a scenario and get corresponding projection
Hemispheric Monthly Surface Temperatures • Observations (Y) for 1882-2001. Data Model: Gaussian with mean = true temp. & unknown variance (with a change-point) • Two models (O): PCM (n=4), CCSM (n=1) for 2002-2197, and 3 SRES scenarios (B1,A1B,A2). Data Model: assumes O’s are Gaussian with mean = bt + model biast (different for the two models) and unknown, time-varying variances (different for the two models) • All are assumed conditionally independent
Notes (Freeze time) • Data model for kth ensemble member from Model j: Ojk = b + bj + ejk • b is common to both Models • bj is Model j bias • E( ejk ) = 0 and variances of e’s depend on j • Computer model model: b = X + e where E(e) = 0 • Priors for biases, variances, and X • Extensions to different model classes (more b’s) and richer models are feasible.
Discussion: Which approach is best? • Depends on form and quality of observations and models and practicality • Develop prior for X from scientific model (part A)offers strong incorporation of theory, but practical limits on richness of [ X | q ] may arise • Model output as “observations” • Combining models: Just like different measuring devices; • Nice for analysis & mixed (obs’ & comp.) design • Need a prior [ X| q ] • Model output as realizations of prior “trends” • Most common among Bayesian statisticians • Combining models: like combining experts
Discussion: Models versus Reality Need for modeling differences between X’s and O’s. Model “assessment” (“validation”, “verification”) helps, but is difficult in complicated settings: • Global climate models. Virtually no observations at the scales of the models. • Tuning. Modify model based on observations. • Observations are imperfect, and are often output of other physical models. • Massive data. Comparing space-time fields
Discussion, Cont’d • Part C) Combining approaches • Example: Wikle et al 2001, JASA. Combined observations and large-scale model output as data with a prior based on some physics • Usually, many physical models. No best one, so it’s nice to be flexible in incorporating their information Thank You!