1 / 60

Data for Biosurveillance: A tutorial

Data for Biosurveillance: A tutorial. Main Point. The range of data being collected for public health surveillance have expanded considerably. ANIMALS. HUMAN BEHAVIORS. NON TRADITIONAL USES. CLINICAL DATA. Biosurveillance Data Space. LATER DETECTION. EARLY DETECTION.

nerina
Download Presentation

Data for Biosurveillance: A tutorial

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data for Biosurveillance: A tutorial

  2. Main Point The range of data being collected for public health surveillance have expanded considerably

  3. ANIMALS HUMAN BEHAVIORS NON TRADITIONAL USES CLINICAL DATA Biosurveillance Data Space LATER DETECTION EARLY DETECTION GOLD STANDARDS NON TRADITIONAL MEDICAL INTELLIGENCE BIOSENSORS OTC Pharm Test Results Test Results Complaints Diagnosis Sentinel MD Agribusiness Poison Centers Influenza isolates Environmental Investi- gations Web Queries Medical Examiner Survey Nurse Calls Public Transport (bus) Wind Speed/ direct. Cloud cover Limited Utility Some Potential Promising ER Visits Radiograph Reports

  4. Overview of Surveillance Data

  5. Clinical Data

  6. Clinical data are highly relevant to public health-surveillance. Clinicians and health systems are a primary point of data collection about the sick, including data about demographics, risk factors, symptoms, signs, special testing, and diagnoses.

  7. Where are the Clinical Data? • Paper charts • HL7 Message Routers • Registration, Scheduling, and Billing Systems • Clinical Laboratory Systems • Radiology Systems • Pathology • Dictation • Pharmacy • Orders • Data Warehouses • Clinical Event Monitors • Point-of-Care Systems • Patient Web Portals and Call Centers

  8. Laboratory Results and Electronic Lab Reporting (ELR) • There is no need to prove the value of laboratory results for public health surveillance • The main issue is getting them • Studies of ELR of culture proven notifiable diseases • Hawaii ( JAMA 1999) • Pittsburgh (Panackal et al EID 2002) • Findings (and methods) similar • Quicker • More complete reporting

  9. Chief Complaints • Chief complaints entered by a triage nurse upon admission to an emergency facility are available electronically from hospitals in the United States and other countries. • ICD-9 coded versus free text • How to group into syndromic categories is a major question • There exist several categorizations (CDC consensus, RODS, WRAIR)

  10. Detection Performance from ICD-9 coded Chief Complaints • Respiratory Case Detection (Espino 2001) Sensitivity 0.43 Specificity >95% • Respiratory Outbreak Detection (Tsui 2001) Small sample 1/1 detected, 1 false alarm • Diarrhea Case Detection (Ivanov 2002) Similar results to Espino

  11. Using Free text Chief Complaints (and Natural Language Processing) Also needed for Web or call center queries, and radiographs “cough” NLP “respiratory prodrome”

  12. GI prodrome Output CoCo Naive Bayesian Parser Maps free-text chief complaint to one of seven prodrome categories (or an eighth category—none) P(Respiratory|NVD)= .05 P(Botulinic|NVD)= .001 P(Constitutional|NVD)= .01 P(GI|NVD) = .9 P(Hemorrhagic|NVD)= .001 P(Neurologic|NVD)= .001 P(Rash|NVD)= .001 P(None|NVD)= .036 “N/V/D” Chief complaint CoCo Naive BayesClassifier

  13. Sensitivity, specificity and likelihood ratio positive (LR+) measurements for the CoCo classifier using the Utah Department of Health emergency department gold standard. CoCo Syndrome UDOH Syndrome Sensitivity Specificity LR+ Respiratory Respiratory infection with fever* 0.52 0.89 5 Gastrointestinal Gastroenteritis without blood 0.71 0.90 7 Encephalitic Meningitis / encephalitis 0.47 0.93 7 Rash Febrile illness with rash* 0.50 0.99 56 Botulinic Botulism-like syndrome 0.17 0.998 104 *Required documentation of fever in the patient record. Accuracy of Case Detection: CoCo Naïve Bayes vs. UDOH Manual ED log review Courtesy Per Gesteland, MD

  14. Accuracy of Outbreak Detection: Respiratory Outbreaks detected by from respiratory chief complaints Hospital P&I Diagnoses Respiratorychief complaints SDs from Mean 7 Years Ivanov and Gesteland

  15. Detecting Respiratory Outbreaks in Children by monitoring Chief Complaints Detection from CCs precede that from admissions by 9 days (95% CI -5-23) kids respiratory (lower respiratory infections): pneumonia influenza bronchiolitis bronchitis

  16. Detecting GI Outbreaks in Children by monitoring Chief Complaints Detection from CCs precede that from admissions by 23 days (95% CI 12-33) gastroenteritis rotavirus

  17. WHICH IS BETTER, ICD-9 or FREE TEXT? At detecting Cases of Acute Infectious GI (Ivanov, Wagner, Chapman)

  18. WHICH IS BETTER, ICD-9 or FREE TEXT? At detecting Acute Lower Respiratory Illness from Chief Complaints (Espino, Wagner, Dowling, Chapman)

  19. Chest Radiograph Reports • Radiologists dictate a report for most chest radiographs performed in the United States. • Reports are transcribed after dictation and available electronically with a twelve to twenty-four hours latency. • The reports describe specific findings important for detection of infectious diseases of the lower respiratory tract such as SARS, Plague, Tularemia, inhalational Anthrax. • The granularity of the information is quite specific and allows for detection of different patterns of pneumonia, pleural effusions, and mediastinal widening. • The data are identified at the level of the individual patient and can therefore be pinpointed to home location and correlated with other patients to detect clusters of cases,

  20. Detecting Pneumonia on Radiographs

  21. Detecting Febrile Illness • Coded temperature (Possibly best, but rarely recorded electronically and may be normal) • From NLP of chief complaints • By NLP of Emergency Department (ED) dictation • Sensitivity = 0.98 • Specificity = 0.89 • ~1 day delay

  22. Lab Test Ordering • Motivation: What if you saw a large number of blood culture orders for people with home addresses in one zip code? • Availability from national laboratory companies (maybe 10-20% coverage of all tests done, perhaps less for infectious disease testing which is done in hospitals) • Demonstrated value: no published studies!

  23. Summary of clinical data by clinical systems and market penetration (estimated) Legend: ED, emergency department; LTCF, long term care facility; -, not applicable; ?, unknown

  24. Sales of OTC Healthcare Products

  25. Take Home Message: OTC • Availability is better than any other data type • Value also better understood of all unconventional types of data because of research, although still more to do

  26. National Retail Data Monitor: How it Works • OTCs products are UPC bar coded • Retail stores scan purchases • Nine chains (18,000 stores) send daily sales data • NRDM groups the UPC-level sales data into categories like “cough syrup, pediatric liquid” • NRDM makes data available to health departments via • Web interface: 500+ accounts/46 States • Aggregated data feeds to state health depts • A BioSense data source NRDM

  27. OTC Product Categories • There are approximately 7500 products (UPC codes) used for self-treatment of infectious diseases • We group them into 18 analytic classes at present (“categories”) • Antifever Pediatric (274) • Antifever Adult (1340) • Bronchial Remedies (43) • Chest Rubs (78) • Diarrhea Remedies (165) • Electrolytes Pediatric (75) • Hydrocortisones (185) • Thermometer Pediatric (125) • Thermometer Adult (313) • Cold Relief Adult Liquid (709 products) • Cold Relief Adult Tablet (2467) • Cold Relief Pediatric Liquid (323) • Cold Relief Pediatric Tablet (74) • Cough Syrup Adult Liquid (592) • Cough Syrup Adult Tablet (32) • Cough Syrup Pediatric Liquid (24) • Nasal Product Internal (371) • Throat Lozenges (364) Numbers in parenthesis are the number of UPC codes in the category

  28. Detecting Cryptosporidium from Sales of OTC Diarrhea Remedies • Diarrhea remedies = {Kaopectate,Imodium,Pepto} • North Battleford Outbreak 2001 • Large, waterborne outbreak of Cryptosporidium in late March/April 2001 • Convenience sample of three pharmacies in North Battleford, Saskatchewan • Approximately 5-fold increase in all three pharmacies (relative to baseline established from Jan 2001 to early March 2001) • Two pharmacies provided March/April 2000 data and those data showed no similar increase • Sales peaked weeks before precautionary drinking water advisory and days to weeks before peak onset of diarrhea *Stirling R, Aramini J, Ellis A, et al. Waterborne cryptosporidiosis outbreak, North Battleford, Saskatchewan, Spring 2001. Can Commun Dis Rep. Nov 15 2001;27(22):185-192.

  29. 2001 Crypto in North Battleford … Precautionary water advisory issued on 4/26 Detectable peak on 4/2 in sales of over-the-counter antidiarrheals

  30. Detecting Crypto from Sales of OTC Antidiarrheal (cont) • Collingwood Ontario* • Cryptosporidium outbreak in Collingwood, Ontario Feb/March 1996 • 3/12 pharmacies that were asked gave data • Pharmacy 1: 26 fold increase in sales in Feb 1996 as compared to February 1995 • Pharmacy 2: 1Q 1996 sales were 3 fold 1Q 1995 • Pharmacy 3: Reported no change in sales • Outbreak detected 3/5 • Yet another Cryptosporidium outbreak in Canada (Kelowna and Cranbrook, British Columbia) • All pharmacists (10-12 of them in each city) interviewed acknowledged increased sales (but there was no data available for study) *Rodman JS et al. Pharmaceutical sales: A method of disease surveillance. Journal of Environmental Health, Nov 1997:8-14. **Proctor et al. Surveillance data for waterborne illness detection: an assessment following a massive waterborne outbreak of Cryptosporidium infection. Epidemiol Infect. 1998;120(1):43-54.

  31. Detecting Crypto from Sales of OTC Antidiarrheal (cont) • Milwaukee Crypt (Proctor et al) • Studied the famous 1993 Milwaukee Cryptosporidium outbreak • One pharmacy in outbreak area provided monthly counts of unit sales • Sales for month of March showed three-fold increase over baseline (March 1994/March 1995) • Public health awareness of outbreak – April 5 • Identified need for knowledge of geographic distribution of water supply to improve outbreak detection (North Milwaukee vs. South Milwaukee)

  32. Cryptosporidium Outbreak: Collingwood, Ontario 26-fold increase in sales in Feb Outbreak detected March 5 Rodman JS et al. Pharmaceutical sales: A method of disease surveillance. Journal of Environmental Health, Nov 1997:8-14.

  33. Cryptosporidium Outbreak: Milwaukee 3X increase in sales in March 1993 Public health awareness April 5, 1993 Proctor et al. Surveillance data for waterborne illness detection: an assessment following a massive waterborne outbreak of Cryptosporidium infection. Epidemiol Infect. 1998;120(1):43-54.

  34. More Evidence that Crypto May Drive OTC Sales • Corso et al • Reviewed 2000 medical records of patients admitted to Milwaukee EDs • Identified 378 persons who had moderate or severe case of Cryptosporidium during 1993 outbreak • Self treatment with OTCs prior to ED visit was documented in the medical recordin 30%

  35. Open Question? How small of a crypt outbreak is detectable • North Battleford outbreak affected half the population • The Milwaukee outbreak was similarly large (estimated 400,000 infected) • Other outbreaks: no estimates of size avail. • Only one to three drug stores were studied • Bottom line: How small of a Cryptosporidium outbreak can be detected is very hard to know from observational studies

  36. Detecting Pediatric Diarrheal and Respiratory Outbreak from Sales of Pediatric Electrolytes • Pediatric Electrolytes = {Pedialyte, competitors} • Hogan et al* • 18 Wintertime outbreaks (1998-2001, six cities) • Strong correlation (>0.9) between hospital diagnoses of respiratory and diarrheal illness in children < 5 and sales of pediatric electrolytes • Usually uptick in sales preceded uptick in hospital diagnoses. Average 2 weeks • Variation in time lag from year to year and city to city suggests need for additional studies Hogan et al. Detection of Pediatric Respiratory and Diarrheal Outbreaks from Sales of Over-the-counter Electrolyte Products. J Am Med Inform Assoc, 10(6) November 2003

  37. SCRANTON, PA HARRISBURG, PA PITTSBURGH, PA PHILADELPHIA, PA SALT LAKE CITY, UTAH INDIANAPOLIS, IN Detecting Pediatric Diarrheal and Respiratory Outbreak from Sales of Pediatric Electrolytes Data courtesy IRI, Utah DOH, Indianapolis Network for Patient Care, and PA HC4 Council

  38. Detectability of Anthrax? Detecting Influenza from OTC Cold Remedies • Welliver et al* • Studied 1976-1977 Influenza B outbreak in Los Angeles • Data from one distribution center of Ralphs Grocery Company in Los Angeles • OTC cold remedy sales peaked 3 weeks prior to peak in positive Influenza cultures • No association between aspirin (antipyretic) sales and influenza Welliver RC, Cherry JD, Boyer KM, et al. Sales of nonprescription cold remedies: a unique method of influenza surveillance. Pediatr Res. Sep 1979;13(9):1015-1017.

  39. Detecting Influenza from OTC Cold Remedies (cont) • Correlation of Cough/Cold/Flu OTC Categories With Hospital Diagnoses of Pneumonia, Influenza, Bronchitis, and Bronchiolitis

  40. From Permissive Environments • A permissive environment is one that allows types of biosurveillance data to be collected that cannot be otherwise collected. • Survey data • 3-4 days earlier • Telephone calls to medical offices • 3-4 days earlier during Influenza than doctor visits • Web queries to medical sites

  41. Absenteeism • Direct measures: attendance systems e.g., school attendance • Indirect through other measures of a person’s physical presence at a location. • Affected by weekends, holidays, and vacations or recess periods (especially school absenteeism!)

  42. Supplemental Data

  43. Spatial Info • Census tract vs. zip code • ZIP HIPAA and availability • Street address 60% automatic recognition problem • Substreet address (floor of building) Office, especially in vertical cities like HK, NY • Longitude, height, and latitude: maximum flexibility

  44. Time Stamping • It is sort of obvious but worth discussing • Time zones can cause confusion • The meaning of time stamps can cause confusion • Is it the time of the order or the result? • Is it the time of patient registration or of admission to the hospital?

  45. Data Related to Route of Transmission • Water supply data • Airplane passenger lists • Food distribution systems • HVAC systems • Employer • Home address • …

  46. Weather • Weather and climate data currently used in epidemiological analysis include temperature, wind direction and speed (for bioaerosol related analyses), and precipitation. • In the United States, weather data are already highly available. • Temperature • wind speed • wind direction • precipitation. • Up-to-the-minute information for the entire nation is available because data are collected in real-time, in standard formats, and integrated in a central location that is publicly available without any technical or administrative barriers (http://weather.noaa.gov/). • But, they are not stored, so if you need historical data you must capture them daily and store them

  47. Which data are required (and how do we assess their relative importance for R&D?)

  48. Overly Simple Answer • Exactly the data public health departments collect at present. • We just need to speed up the collection and processing of the data, for example, • ELR • Electronic reporting of notifiable diseases Yes, but they sometimes miss outbreaks or detect them late Yes, but maybe the data are inherently late. Plus there is still the problem of undetected outbreaks Also, there may just be a better way. There may be highly useful data that they just do not have the infrastructure to collect

More Related