1 / 83

Data-model integration: Examples from belowground ecosystem ecology Kiona Ogle

Data-model integration: Examples from belowground ecosystem ecology Kiona Ogle University of Wyoming Departments of Botany & Statistics www.uwyo.edu/oglelab. Today’s Task. What are some ecological questions to which sensor network data could be applied?

sibley
Télécharger la présentation

Data-model integration: Examples from belowground ecosystem ecology Kiona Ogle

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data-model integration: Examples from belowground ecosystem ecology Kiona Ogle University of Wyoming Departments of Botany & Statistics www.uwyo.edu/oglelab

  2. Today’s Task • What are some ecological questions to which sensor network data could be applied? • How would those data be used in models? • Overview modeling of ecological data and processes.

  3. Types of Questions • What are some ecological questions to which sensor network data could be applied? • Spatial & temporal processes • Improved ecological understanding • More accurate prediction & forecasting • Example problems • “Biogeochemical exchanges between the atmosphere & biosphere” • How do environmental perturbations affect carbon & water exchange? • Partitioning ecosystem processes & components • Linking processes & mechanisms operating at multiple temporal & spatial scales

  4. How to Address Such Questions? • Couple data and models • Sensor network data • Very rich • Real-time; large datasets; spatially extensive and/or temporally intensive • Heterogeneous • Different locations, processes, and conditions • Models & data analysis • Less appropriate: • “Classical” analyses that assume linearity and normality of data • Design-based inference about patterns • More appropriate: • Coupling of process-based models with diverse and rich datasets • Model-based inference about patterns and mechanisms

  5. Why Couple Data & Process Models? • Parameter estimation (or “model parameterization”) • Quantification of uncertainty • Improved predictions and forecasts • Decision support, management, conservation • Synthesize multiple types of data • Relate different system components to each other • Learn about important mechanisms • Hypothesis generation • Use data-informed models to generate testable hypotheses • Inform sampling and network design • Data analysis • Go beyond simple “classical” analyses • Explicit integration of multiple data types, diverse scales, and nonlinear and non-Gaussian processes

  6. Process parameters Unknown quantities Observed data Latent (or true) process Data parameters Posterior Likelihood Probabilistic process model Prior(s) How to Couple Data & Process Models? • Multiple approaches, for example: • Maximum likelihood-based models • Least squares, minimization of objective functions • Hierarchical Bayesian models • Hierarchical Bayesian approach • Recall, from Jennifer’s talk …

  7. Outline • The process model: • Types of ecological models • Building process models • Examples from belowground ecosystem ecology: • Motivating issues • Ex 1: Estimating components of soil organic matter decomposition • Ex 2: Deconvolution of soil respiration (i.e., CO2 efflux) • In both examples, highlight: • Data sources • Process models • Data-model integration • Implications of data-model integration for sensor network data & applications

  8. Process parameters Unknown quantities Observed data Latent (or true) process Data parameters Posterior Likelihood Probabilistic process model Prior(s) Hierarchical Bayesian Model Data model (likelihood) Probabilistic process model The “process model”

  9. The Process Model • Model formulation: • Explicit, mathematical eqn’s • Systems equations • State-space equations • Conceptual model: • Systems diagrams • Graphical models Inputs Outputs “Compare” Conceptual model Mathematical model Unobserved quantities (parameters) Observed quantities (data) Analytical output Observed quantities (driving variables) Numerical/ simulation output “Predict” Simulation model Unobserved or latent quantities The “process model”

  10. Types of Process Models Jorgensen (1986) Fundamentals of Ecological Modelling. 389 pp. Elsevier, Amsterdam.

  11. Upcoming Example: Soil Carbon Cycle Model

  12. Example Process Model Pools or state variables Simplified systems diagram of the soil carbon cycle in a temperate forest Flows of carbon Source: Xu et al. (2006) Global Biogeochemical Cycles Vol. 20 GB2007.

  13. Model Formulation A:matrix of flux rates or “carbon transfer coefficients” (parameters) u(t): flux of carbon into the system(e.g., photosynthetic flux) (driving variable or modeled quantity) B:vector of ‘allocation fractions’ (parameters) X:vector of state variables (unobservable latent quantities, outputs) Source: Xu et al. (2006) Global Biogeochemical Cycles Vol. 20 GB2007.

  14. Model Formulation Observable (data) Source: Xu et al. (2006) Global Biogeochemical Cycles Vol. 20 GB2007.

  15. Process parameters Unknown quantities Observed data Latent (or true) process Data parameters Posterior Likelihood Probabilistic process model Prior(s) How to Couple Data & Process Models? • Hierarchical Bayesian approach Data model (likelihood) Probabilistic process model

  16. Outline • The process model: • Types of ecological models • Building process models • Examples from belowground ecosystem ecology: • Motivating issues • Ex 1: Estimating components of soil organic matter decomposition • Ex 2: Deconvolution of soil respiration (i.e., CO2 efflux) • In both examples, highlight: • Data sources • Process models • Data-model integration • Implications of data-model integration for sensor network data & applications

  17. Ecosystem Processes Emphasis on aboveground What about belowground?

  18. Biogeochemical Cycles H20 N H20 N H20 C C P

  19. H20 N H20 N H20 C C P Biogeochemical Cycles Belowground system is critical Tightly linked to aboveground system

  20. Belowground “Issues” Aboveground • Lots of info • Easy to measure Belowground • Little info • Difficult to measure • Aboveground measurements (helpful but limited) Outstanding issues • Partitioning above- & belowground • Quantifying & partitioning belowground • Implications for ecosystem function • Examples: arid & semiarid systems Figure from Kieft et al. (1998) Ecology 79:671-683

  21. Motivating Questions: Soil Carbon Cycle • From where in the soil is CO2 coming from? • What are the relative contributions of autotrophs vs. heterotrophs? • What factors control decomposition rates & heterotrophic activity? • How does pulseprecipitationaffect sourcesof respiredCO2? • Implications ofclimate changefor desert soilcarbon cycling?

  22. Integrative Approach • Diverse data sources • Experimental & observational • Lab & field studies • Multiple scales • Varying “amounts” & “completeness” • Process-based models • Key mechanisms, processes, components • Balance detail & simplicity • Multiple scales & interactions • Statistical models: data-model integration • Hierarchical Bayesian framework • Mark chain Monte Carlo

  23. Examples Presented Today

  24. Ex 1: Soil organic matter decomposition Objectives: • Identify soil & microbial processes affecting decomposition • Learn how vegetation (i.e., microsite) controls these processes

  25. Experimental Design Mesquite shrubland in southern Arizona Microsite types: • bare ground • grass • small mesquite • big mesquite Bare ground Grass Small mesquite Big mesquite 3 cores (reps)

  26. Experimental Design ... Add water ... Add sugar + water Incubate at 25 oC CO2 Measure CO2 efflux (soil respiration rate) at 24 & 48 hours CO2 8 depths (layers) CO2

  27. Experimental Design ... Add water ... Add sugar + water Measure: Microbial biomass Soil organic carbon Soil nitrogen Incubate at 25 oC CO2 Measure CO2 efflux (soil respiration rate) at 24 & 48 hours CO2 8 depths (layers) CO2

  28. Design & Data Overview • Full-factorial design: • Microsite • 4 levels: bare, grass, small mesq, big mesq • Soil layer • 8 levels: 0-2, 2-5, ..., 40-50 cm • Substrate addition type • 2 levels: water only, sugar + water • Incubation time • 2 levels: 24, 48 hrs • Soil core or rep • 3 cores per microsite • Stochastic data: • Soil respiration rate • N = 359 (25 missing) • Microbial biomass • N = 18 (14 missing) • Soil organic carbon • N = 89 (7 missing)

  29. Some Data

  30. Analysis Objectives microbes soil C CO2 flux Estimate microbial respiration (decomposition) parameters (i.e., process parameters) ? ? Soil depth data Respiration biomass & activity Microbial biomass Carbon substrate

  31. Saturating carbon (C) Respiration (R) Low C Microbial biomass (B) Process Model: Soil Respiration Estimate microbial respiration (decomposition) parameters (i.e., process parameters) Respiration Microbial biomass Carbon substrate Michaelis-Menton type model: microbial “base-line” metabolic rate microbial carbon-use efficiency Assume Ac related to “substrate quality”:

  32. Data-Model Integration • Full-factorial design: • Microsite • Soil layer • Substrate addition type • Incubation time • Soil core or rep B C R N • Stochastic data: • Soil respiration rate • Microbial biomass • Soil organic carbon • Things to consider: • Multiple data types • Nonlinear model • Missing data • Experimental design some data some data

  33. Data Model (Likelihood) • Let LR = log(R) • For microsite m, soil depth d, soil core r, substrate-addition type s, and time period t: Mean (“truth”) (latent process) Observationprecision (= 1/variance) Observed rate

  34. Data Model (Likelihood) • Now, for the covariates... • For microsite m, soil depth d, and soil core r: • Note: the likelihoods are for both the observed and missing data Observed Mean (“truth”) (latent process) Observation precision (= 1/variance)

  35. Data Model (Likelihood) Likelihood components Data parameters Latent processes

  36. Probabilistic Process Model Latent processes Deterministic model for soil microbes & carbon contents Stochastic model for latent respiration

  37. Probabilistic Process Model Stochastic model for latent respiration Specify expected process: Michaelis-Menten (process) model

  38. Probabilistic Process Model Process components Process parameters

  39. Parameter Model (Priors) Data parameters Process parameters Conjugate, relatively non-informative priors for precision terms

  40. Parameter Model (Priors) Data parameters Process parameters Non-informative Dirichlet priors for relative distributions of microbes and carbon Multivariate version of the beta distribution (with all parameters set to 1: multidimensional uniform)

  41. Parameter Model (Priors) Data parameters Process parameters Relatively non-informative (diffuse) normal priors for the rest:

  42. The Posterior

  43. The Posterior No analytical solution for the joint posterior distribution No analytical solution for most of the marginal distributions Approximate the posterior: Markov chain Monte Carlo methods,implemented in WinBUGS

  44. Model Implementation: WinBUGS

  45. Model Goodness-of-fit

  46. Example Results C* (total soil carbon, g C/m2) B* (microbial biomass, g dw/m2) Bare Bigmesq. Med.Mesq. Grass Bare Bigmesq. Med.Mesq. Grass

  47. Example Results Bare ground Big mesquite Relative amount ofmicrobial biomass Surface Deep Surface Deep Soil depth (or layer)

  48. Sensitivity to Data Sources

  49. Ex 2: Deconvolution of Soil Respiration • From where in the soil is CO2 coming from? • What are the relative contributions of autotrophs vs. heterotrophs? • What factors control decomposition rates & heterotrophic activity? • How does pulseprecipitationaffect sourcesof respiredCO2? • Multiple datasources • lots • limited data data data data data data data data

More Related