1 / 9

Quantifying Uncertainty

Learn about quantifying uncertainty in statistics through two approaches: statistical theory and bootstrapping. Statistical theory involves uncorrelated and correlated methods, while bootstrapping utilizes resampling to estimate unknown probability distributions. Explore significance statistics, confidence intervals, and the principles behind the bootstrap method, all essential for estimating uncertainty in data analysis. Discover how to calculate uncertainty measures like bias, prediction error, and confidence intervals using these techniques. Practice with real-world data examples to better understand and apply these concepts.

baumgarten
Download Presentation

Quantifying Uncertainty

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Quantifying Uncertainty Two approaches • Use statistical theory • Bootstrapping

  2. Statistical Theory Uncorrelated Correlated Effective sample size

  3. Significance Statistics • Significance statistics use the standard error: • Confidence Intervals:

  4. Bootstrapping • Motivated by the absence of equations for other accuracy measures (bias, prediction error, confidence intervals) for statistics of interest (correlation, regressions, ACF) • Definition: “The bootstrap is a data-based simulation method for statistical inference.” • Principle: resample with replacement from data. After Efron and Tibshirani, An Introduction to the Bootstrap, 1993

  5. BOOTSTRAP WORLD F * x* = {x*1, x *2, …, x *n} REAL WORLD F x = {x1, x2, …, xn} Empirical Distribution Bootstrap Sample Bootstrap Replication Bootstrapping Unknown Probability Distribution Observed Random Sample Sampling with replacement Statistic of Interest After Efron and Tibshirani, An Introduction to the Bootstrap, 1993

  6. Hillsborough River at Zephyr Hills, September flows Mean = 8621 mgal S = 8194 mgal N = 31 Uncertainty on estimates of the mean One and two standard errors 95% CI and interquartile range from 500 bootstrap samples Millions of gallons

  7. Box-Cox Normality Plot for Monthly September Flows on Alafia R. Using Kolmogorov-Smirnov (KS) Statistic Peak at  = -0.39 What is the range of uncertainty on this?

  8. Example for the ks.test • How? • Produce 500 new datasets (x*) of the same length as x by sampling with replacement from x • Find the optimal  value for each • Determine the 10th and 90th percentiles to cover 80% of the  values calculated.

  9. Look back at the original plot and verify that the original “optimal” value was at the far left of the broad top, which is reflected in this confidence interval. 80% confidence interval 10% 50% 90% -0.425 -0.250 -0.068

More Related