1 / 26

Bayesian Methods for Regional Eutrophication Models

This workshop review from June 16-18, 2004 in Old Town Alexandria, VA, explored Bayesian methods for linking environmental stressors to biological responses, emphasizing model prediction uncertainty. The session covered classification and regression trees (CART), Bayesian CART (BCART), and Bayesian Hierarchical Models for regional eutrophication studies, aiming to identify stressor-response models. The approach utilized tree-based methods to select subsets of predictors and fit linear models to quantify uncertainty. Preliminary findings showcased the effectiveness of Bayesian Treed models in understanding eutrophication dynamics.

magan
Download Presentation

Bayesian Methods for Regional Eutrophication Models

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. STAR Progress Review Workshop Old Town Alexandria, VA June 16-18, 2004 Bayesian Methods for Regional Eutrophication Models E. Conrad Lamon III Dept. of Environmental Studies Louisiana State University and Craig A. Stow Department of Environmental Health Sciences University of South Carolina

  2. Overview • Goals and Objectives • Approach • Preliminary Findings • Significance • Next Steps

  3. Goals and Objectives • Use modern classification and regression trees and hierarchical Bayesian techniques to link multiple environmental stressors to biological responses and quantify uncertainty in model predictions and parameters.

  4. Guidance for TMDL model selection (NRC 2001) • report prediction uncertainty • be consistent with the amount of data available • flexible enough to permit updates and improvements

  5. Approach

  6. Approach Tree based methods • are a flexible approach useful for variable subset selection • when the analyst suspects global non-linearity • and cannot (or does not want to) specify the functional form of possible interactions a priori.

  7. Example tree Approach

  8. Approach Methods • Classification And Regression Trees (CART), • it’s Bayesian analogue, BCART • a recently developed enhancement to the BCART procedure, which includes BCART as a model subclass, known as Bayesian Treed (BTREED) models, and • Bayesian Hierarchical Models

  9. Approach BCART and BTREED Models • Will be used with the EPA Nutrient Criteria Database to identify and estimate regional eutrophication stressor – response models for EPA STAR funded research. • Lamon and Stow, 2004, Water Research, 38(11): 2764-2774.

  10. Approach Bayesian Treed models • Bayesian Hierarchical model to: • Select subsets on XXs • Fit linear models to these subsets Xs • Tree structured models • “ANOVA in Reverse” • “Leaves” contain linear models, not just a mean (like in CART models)

  11. Approach Bayesian Treed modelspecification y|x, with x = (x1, x2, …, xp), where p = number of predictor variables. two components of model 1. tree T with b bottom nodes, 2. parameter vector q = (q1, q2,…, qb), where qi is associated with the ith bottom node. If x is in the ith node, then y|x = f(y|qi), where f is a parametric family indexed by qi.

  12. Approach Bayesian Treed model specification (cont.) Tree is fully specified by (q, T) need a prior, p(q, T). Because q indexes a parametric model for each T, we can use Bayes theorem such that p(q, T) = p(q | T)p(T). So, specify prior in two stages: 1 – on the tree space, p(T), and 2 – on the distribution of Y at the bottom nodes, conditional on T, p(q | T).

  13. Approach Bayesian Treed model search • MCMC used to stochastically search for high posterior probability trees T. • Metropolis –Hastings algorithm simulates a Markov chain with limiting distribution p(T|Y,X) • Chipman, George and McColloch, 2000, JASA. http://gsbwww.uchicago.edu/fac/robert.mcculloch/research/papers/index.html

  14. Approach Data • Response variables may be • either continuous (such as biological indices of abundance) or • discrete (such as designated use attainment classes). EPA NES example: response variable is lake-wide, summer average log10 Chlorophyll a concentration.

  15. Approach Data Predictor variables in tree based methods may also be continuous or discrete, and may include : source agency, basin, sub-watersheds, states, EPA regions, latitude and longitude, and many continuous predictors related to water chemistry, water use, discharges or pollutant loading.

  16. Approach Data For the EPA NES example, Latitude and Longitude were used in the tree portion, and log10Qin, log10 Z Log10 tw In-lake log10 TP In-lake log10 TN For the linear model within each bottom node (leaf)

  17. Preliminary Findings RD-83088701-0 Lamon, E.C., and C.A. Stow, 2004. Bayesian Methods for regional-scale eutrophication models, Water Research, 38(11): 2764-2774.

  18. Preliminary Findings log10(Chla) ~ latitude + longitude 0.432 0.900 1.210 0.734 1.600 1.140 1.240 0.870 1.480 0.798 0.796 0.998 0.925 1.420 MSE=0.1092 14 leaves

  19. log10(Chla)~TP+TN+Lat+Lon+Qin+Z+tw Preliminary Findings TP < 0.0395 | TN < 2.175 TP < 0.0145 Lon < TN < 0.832 - 110.88 TN < 0.815 Z < 0.0495 0.5869 0.2547 0.7827 1.0290 Z < 7.997 Qin < 4.018 0.9766 1.2480 1.2520 0.9045 MSE=0.0802 10 leaves 1.8810 1.3360

  20. Preliminary Findings Results • Bayesian Treed Model

  21. BTREED Model – partitions (T) Preliminary Findings MSE=0.074 4 leaves

  22. Region Int. log10Qin log10Z log10tw In-lake log10TP In-lake log10TN MSE n SW 0.02166 0.0210 (0.0209) -0.0851 -0.0691 (0.0927) -0.4044 -0.4390 (0.1426) 0.1745 0.2107 (0.1534) 0.3319 0.3280 (0.1036) 0.3568 0.4012 (0.1957) 0.027 48 NW -0.0116 -0.0117 (0.0068) 0.0228 0.0241 (0.0478) -0.2752 -0.2763 (0.0647) 0.1074 0.1091 (0.0678) 0.3523 0.3528 (0.0553) 0.3440 0.3449 (0.0715) 0.095 289 SE 0.0290 0.0299 (0.0090) -0.0870 -0.0845 (0.0526) 0.0312 0.0385* (0.0734) 0.3317 0.3325 (0.0862) 0.3787 0.3683 (0.0772) 0.7996 0.8456 (0.1514) 0.037 99 NE 0.0642 0.0653 (0.0111) 0.0169 0.0222 (0.0734) -0.3225 -0.3306 (0.0997) 0.4172 0.4275 (0.0854) 0.7334 0.7398 (0.0653) -0.1586 -0.1665 (0.1048) 0.073 164 total 0.074 600 Preliminary Findings

  23. logQin logZ logtw logTP logTN S W n=48 0.6 0.6 0.6 0.6 0.6 0.4 0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.2 0.2 0.0 0.0 0.0 0.0 0.0 partres partres partres partres partres -0.2 -0.2 -0.2 -0.2 -0.2 -0.6 -0.6 -0.6 -0.6 -0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 logQin logZ logTw logTP logTN N W n=289 0.6 0.6 0.6 0.6 0.6 0.4 0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.2 0.2 0.0 0.0 0.0 0.0 0.0 partres partres partres partres partres -0.2 -0.2 -0.2 -0.2 -0.2 -0.6 -0.6 -0.6 -0.6 -0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 logQin logZ logTw logTP logTN S E n=99 0.6 0.6 0.6 0.6 0.6 Log10(Chla) 0.4 0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.2 0.2 0.0 0.0 0.0 0.0 0.0 partres partres partres partres -0.2 -0.2 -0.2 -0.2 -0.2 -0.6 -0.6 -0.6 -0.6 -0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 logQin logZ logTw logTP logTN 0.6 0.6 0.6 0.6 0.4 0.4 0.4 0.4 0.2 0.2 0.2 0.2 0.0 0.0 0.0 0.0 partres partres partres partres -0.2 -0.2 -0.2 -0.2 -0.6 -0.6 -0.6 -0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 -0.6 -0.2 0.2 0.4 0.6 slope= 0.356836 slope= -0.085098 slope= -0.404363 slope= 0.174524 slope= 0.331859 slope= 0.107354 slope= 0.352297 slope= 0.022849 slope= 0.343962 slope= -0.275202 slope= -0.086991 slope= 0.031206 slope= 0.331706 slope= 0.378674 slope= 0.799635 N E n=164 0.6 0.4 0.2 0.0 partres -0.2 slope= 0.733359 slope= 0.016941 slope= -0.322522 slope= 0.4172 slope= -0.158648 Preliminary Findings -0.6 -0.6 -0.2 0.2 0.4 0.6

  24. Next Steps

  25. Next Steps • More predictor variables • Apply these methods to the Nutrient Criteria Database • Use resultant tree structures to identify important hierarchical structure • Explore these structures with other Hierarchical Bayesian methods • Non-linear specification? Spline basis functions in leaf model or inclusion of all predictors in tree • Tools

  26. Thanks! RD-83088701-0 • EPA STAR program for funding. • Hugh Chipman, Univ. of Waterloo and Robert McColloch, University of Chicago for BCART/BTREED computer code.

More Related