1 / 24

Why Statistics are Scary: a personal journey

Why Statistics are Scary: a personal journey. “While nothing is more uncertain than a single life, nothing is more certain than the average duration of a thousand lives” - Elizur Wright. By: Gail Larsen MS4 2011. Background. Part of the statistical analysis for my MPH Thesis project

Download Presentation

Why Statistics are Scary: a personal journey

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Why Statistics are Scary: a personal journey “While nothing is more uncertain than a single life, nothing is more certain than the average duration of a thousand lives” -Elizur Wright By: Gail Larsen MS4 2011

  2. Background Part of the statistical analysis for my MPH Thesis project Implantable cardioverter-defibrillators (ICD) decrease mortality in appropriately selected patients However, ICD shocks have been associated with increased risk of death

  3. The Question Are shocks detrimental per se, or are shocks a marker of a sicker patient population?

  4. Why statistics are scary **Note…the following survival curves are unadjusted comparisons and do not control for covariates, which was of course done in adjusted analysis. • Two examples: • Research Question#1: Do patients that receive shocks do worse than patients that do not receive shocks? • Research Question #2: Do patients that only receive ATP (antitachycardia pacing) do worse than patients that receive shocks or patients that receive no therapy? • ATP has not been found to increase mortality

  5. Research Question# 1: Data analysis take one

  6. ??? Results seem to indicate that shocks are protective? That’s fine but this completely opposite of the expected result Let’s take a second look at the analysis

  7. Analysis Assumptions • Study endpoint = time to death or last follow-up • Follow-up time = Implant date – status date • Each patient = one observation • Shock yes/no • Dead yes/no

  8. But…it’s all time dependent **Note…this was the way this was modeled in other studies so prior analyses were appropriate. • Shock should be modeled as a time-dependent covariate. In other words, a person is not as risk from shocks until they have had their first shock. • Risk changes after the occurrence of first shock

  9. Research Question #1:Data analysis take two

  10. That’s “better” Results now consistent with what has been previously found Patients receiving shocks do worse than patients receiving no shocks

  11. Research Question #2: Data analysis take one **Note…ATP only = 2 = Shock. Sorry I didn’t make nicer graphs.

  12. ??? Results seem to indicate that ATP is harmful (even more so than shocks)…which has never been found before? Again, let’s take a second look at the analysis

  13. Again…it’s all Time Dependent Patients stratified as no therapy, ATP only, or shock in initial analysis Many patients in the shock group received ≥1 ATP episodes before their first shock episode and should be included in that group before becoming part of the shock group

  14. Research Question #2: Data Analysis Take Two

  15. That’s “better” Results now consistent with what has been previously found ATP not associated with an increased risk of death

  16. Statistics: A brief history

  17. Quick note • Probability deals with predicting the likelihood of future events • Probability theory- the variables and the initial state are known • Statistics involves the analysis of the frequency of past events • Statistics- the outcome is known, but the past causes are uncertain

  18. Schools of Statistical Thought

  19. Frequentist Inference • Relies on drawing random samples from a population • Assigns probability to a repeatable event in which the uncertainty is due to randomness • Basis for hypothesis testing and confidence intervals • The type of statistics we are used to seeing • Does not condition on the observed data

  20. Baysian Inference Chooses a probability distribution as the prior, which represents beliefs about the parameters of interest Chooses a probability distribution for the likelihood, which represents beliefs about the data Computes the posterior, which represents an update of our beliefs about the parameters after having observed the data

  21. Take Home Point #1 • This exercise was not to illustrate that findings should be the same as the findings that came before • In fact, it would have been great if shocks weren’t associated with increased risk of death in our study. • This exercise was to illustrate that drastically (polar opposite) conclusions can be reached depending on how the data is set up and modeled

  22. Take Home Point #2 Just like medicine, statistical analysis is an evolving science Just like medicine, there is controversy as to what are the best (most appropriate) methods to use

  23. Take Home Point #3 • It is not necessary to completely understand & critique the methods section of every study • I only pretend to understand the underlying math and concepts • It is worth knowing a little bit about this stuff or at the very least trying to elucidate the underlying assumptions, population included, etc. • It is worth knowing somebody who does understand this stuff

  24. Education is the path from cocky ignorance to miserable uncertainty. - Mark Twain Uncertainty and mystery are energies of life. Don’t let them scare you unduly, for they keep boredom at bay and spark creativity. - R.I. Fitzhenry The Black Swan: The Impact of the HIGHLY IMPROBABLE. Nassim Nicholas Taleb

More Related