1 / 19

Comparison of Stratified and Non-stratified Most Similar Neighbor Approaches

Comparison of Stratified and Non-stratified Most Similar Neighbor Approaches. Bianca N.I. Eskelson, PhD Student Department of Forest Resources, OSU Hailemariam Temesgen, Assistant Professor Department of Forest Resources, OSU Tara M. Barrett, Research Forester

buzz
Download Presentation

Comparison of Stratified and Non-stratified Most Similar Neighbor Approaches

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Comparison of Stratified and Non-stratified Most Similar Neighbor Approaches Bianca N.I. Eskelson, PhD Student Department of Forest Resources, OSU Hailemariam Temesgen, Assistant Professor Department of Forest Resources, OSU Tara M. Barrett, Research Forester USDA Forest Service, Pacific Northwest Research Station Presented at the 2006 Western Mensurationists Meeting in Fortuna, CA

  2. Outline I. Background • Data and inference for natural resource planning • Methods for handling incomplete data II. Simulation study • Objectives • Methods • Results • Summary III. Challenges and opportunities IV. References

  3. I. Background – For natural resource planning • Forested land is divided into polygons (stands, age, species, crown closure, site class, etc.) • Photos/maps/remote sensing  complete census for aerial variables (X) • 5 to 10% of land base randomly sampled  ground variables (Y)

  4. Inference for natural resource planning is improved by: • “Populating” the forested land database with detailed ground information • Acquiring data on tree and non-tree variables within and across scales • Generating tree-lists (# of stems/ha by species and diameter) for every polygon  these lists can provide: • Inputs to habitat and/or growth and yield models • Estimates of current and future stand structure

  5. Methods for handling incomplete data (after Little and Rubin 2002) 1. Complete-case analysis/Available-case analysis • Simple, but loss of information 2. Weighting procedures 3. Imputation-based approaches • Imputation = replacing missing item/unit with another item/unit that has similar characteristics 4. Likelihood or Bayesian approaches

  6. Nearest Neighbor Imputation • Retain covariance structure of data • Retain logical relationships among Y variables • Result in estimates that are within the bounds of biological reality • Not efficient for estimating attributes of ‘rare’ polygons • Not necessarily consistent or unbiased

  7. II. Objectives • Improve the estimates of stand tables by species group • Investigate use of stratification into forest type groups and age class prior to using most similar neighbor approach (MSN)

  8. Data • 1092 stands in Western Oregon (FIA data) • Collected in 2001 to 2004

  9. Stratification

  10. Most Similar Neighbor Method (MSN) • Randomly divide complete data set into reference stands (80%) and target stands (20%) • MSN approach (Moeur and Stage 1995) used to impute Y set for target stands (500 replications) • Distance measure: • Same procedure for each stratum

  11. Assessment over 500 replications • Mean of distance measure

  12. Results – Mean of distance measure • Range of mean distance smaller for all strata than for whole population except for “other conifer” stratum

  13. Results – Mean RMSE Range of mean RMSE larger for all strata compared to whole population

  14. Range of mean bias smaller only for ‘tph hardwoods’ in Douglas-fir stratum compared to whole population Results – Mean Bias

  15. Possible reasons for results • No real aerial data was used • Small data set (40% of inventory in Western Oregon)  very small data sets for strata • estimated average age used  not a good basis for stratification

  16. Summary • Stratification did not improve estimates • Imputation techniques are data intensive  large data sets required • Use of ‘estimated aerial variables’ was a bad choice • Better stratification

  17. III. Challenges and opportunities • Transformations of Y and X variables • Use of LIDAR data instead of aerial photographs to refine imputation approaches • Likelihood or Bayesian approaches • Variance estimation for NN imputed data

  18. IV. References LeMay, V. and H. Temesgen. 2005. Comparison of nearest neighbor methods for estimating basal area and stems per hectare using aerial auxiliary variables. For. Sci. 51: 109-119. Little, R.J.A. and D.B. Rubin. 2002. Statistical analysis with missing data. 2nd ed. John Wiley & Sons. NY. 381 pp. Moeur, M. and A.R. Stage. 1995. Most similar neighbor: an improved sampling inference procedure for natural resource planning. For. Sci. 41: 337-359. Särndal, C.-E. and S. Lundström. 2005. Estimation in Surveys with Nonresponse. John Wiley & Sons. USA. 199 pp. Stage, A.R. 2005. Some New Twists in the Art and Science of Imputation. Presented at the "Evaluation of quantitative techniques for deriving National scale data for assessing and mapping risk workshop”, Denver, CO, July 26-28, 2005.

  19. Questions?

More Related