260 likes | 288 Views
AHPS River Data Quality Control Techniques and Tools. OCWWS/HSD Hydrologic Support Branch December 15, 2011. Outline. Introduction Solution Review of Quality Control (QC) Operations Operational Discussion Identifying ...Problem Data Reporting ...Problem Data Correcting ...Problem Data
E N D
AHPS River DataQuality ControlTechniques and Tools OCWWS/HSD Hydrologic Support Branch December 15, 2011 Note: This presentation is being recorded for training purposes.
Outline • Introduction • Solution • Review of Quality Control (QC) Operations • Operational Discussion • Identifying ...Problem Data • Reporting ...Problem Data • Correcting ...Problem Data • Conclusion Note: This presentation is being recorded for training purposes.
Intro: Problem • Bad observed data at water.weather.gov • Data is very visible • Evident on AHPS maps/graphs/tables • Increased use of web info brings greater scrutiny • Data used by public, countless partners at all levels • Bad data characteristics • Usually involves data spikes • Focus is on observed river data • Forecast and precipitation data on web not covered • Current WHFS tools provide only limited support Note: This presentation is being recorded for training purposes.
Intro: Goals • Eliminate bad data presented to public • Ensure reliable observations, best forecasts • Provide decision support at local, regional, and national levels using web info • Minimize burden on WFO staff - provide automated tools where possible Note: This presentation is being recorded for training purposes.
Solution: Summary QC process modeled as three fundamental steps: • Identify bad data automatically - Minimizes burden of manual review • Report/notice detected bad data • Reported to forecaster • Automated, still requires user to take action • Noticed by forecaster - Manual review can simultaneously identify/”report”... • Correct reported bad data - For AHPS, manually set data to missing (not delete) Note: This presentation is being recorded for training purposes.
Solution: Principles • Focus on data spike problems • Most “bad” data spikes detectable via rate-of-change (ROC) check • But not all ROC-failed data are actually bad or questionable • Focus on spikes assumes range checks locally configured • Maintain separation of QC and Alert-Alarm (AA) functions • ROC thresholds exist for both QC and AA • QC judges data quality, AA judges threat • Difficulty in trusting precise thresholds for ROC complicates automated QC process Note: This presentation is being recorded for training purposes.
Solution: Details • Identify using rate-of-change (ROC) check • Configure WHFS ROC_checker • Ensure thresholds are appropriate – Done by local office • Set token so ROC-failed data set as questionable, not bad (qc_questorbad) – HSD Support will do • Report using new QC report utility • Complements Alert-Alarm report utility • Configure new utility operations • Set up new utility in existing cron -HSD Support will do • Set up trigger in textdb operations - Done by local office • Define QC product id in report script - Done by local office • Correct using WHFS TimeSeries application Note: This presentation is being recorded for training purposes.
QC Review: Info Available • Web QC part of general hydro data QC and Alert-Alarm operations • Previous WHFS QC/Alert-Alarm webinar • Given October 6, 2009 • Powerpoint and recording available at: http://www.nwstc.noaa.gov/Hydrology/hydromain/webinars/index.html • Available WHFS documentation https://ocwws.weather.gov/intranet/whfs Note: This presentation is being recorded for training purposes.
QC Review: QC Checks • All value set to one of three QC levels: • Good, Questionable, Bad (G, Q, B) • QC Checks in WHFS • Rate-of-change • Only one ROC threshold • If exceeded, set to Q or B (token controlled) • Range • Separate thresholds for Q and B • Obs-Forecast Difference - set to Q • Station Consistency • New report utility on LAD • Compares data from multiple sources at same station • Does not set QC flag New focus Assume already setup ok Note: This presentation is being recorded for training purposes.
QC Review: Processing • Hydro data organized by type (obs, fcst, other) and physical element (PE) • River Data Access for AHPS • Height / FcstHeight tables (stage) • Discharge / FcstDischarge tables • RejectedData table *not* read • Written by SHEFdecode for out-of-range data (per token shef_post_baddata) • ROC Bad data not moved there! (regardless of token) • HydroGen Data Products • Tables read by genXML function within HydroGen operations • HML product file generated and sent every 15 minutes! • HydroGen does NOT filter Bad data [say what!? change???] Note: This presentation is being recorded for training purposes.
QC Review: HydroBase Controls • QC thresholds defined as: • Default values for physical element • Location-specific values override defaults • ROC thresholds • ROC QC threshold should be set • ROC Alert-Alarm thresholds are separate • Help identify hydromet threats, not bad data • If sensor continually sends bad data, consider disabling the station or specific data elements • Location->Modify Location-> Additional Info: disable station for AHPS and WHFS • Data Ingest->IngestFilter: disable specific element for AHPS and WHFS • Setup->HydroGen Configuration: disable station • Can also use AHPS CMS to... • Define annotated hydro note to describe data QC problem • Disable station able using “Sensor in Service” option Note: This presentation is being recorded for training purposes.
Operational Discussion Identify: Hydro Database ---- Good Questionable Bad NEW ROC_checker Report: QC report product report_hydroQC Text Database Alert/Alarm report product report_alarm Correct: trigger TimeSeries Web HML product HydroGen Note: This presentation is being recorded for training purposes.
Identify Bad Data: ROC Checks • Automated ROC checks • Detects data spikes (QC) or quick changes indicating threat (AA) • Compares successive time series values • Only second value in value pair being checked is flagged • Ignores B data, ignores Q data if program argument set • Data exceeding QC ROC threshold • Flagged as Q or B based on token qc_questorbad • Default behavior is to set to B • Even if B, not moved to the RejectedData table so AHPS will still use it since AHPS does not filter bad data!!! • HSD Support will change token to set to Q Note: This presentation is being recorded for training purposes.
Report Bad Data: Options Options for becoming aware of data QC problems: • Internal text reports using textdb notification: • New QC summary product • Existing AA summary product • Could set up dedicated report for only report ROC alert-alarms • Use –fRMT in run_roc_checker script • Must consider this since some “bad” data may actually only be flagged for alert-alarm purposes • Manual review using WHFS applications • Manual review using AHPS web pages themselves • Manual review using local CRH app • Other ideas? Note: This presentation is being recorded for training purposes.
Report Bad Data: New QC Report • Name: report_hydroQC • Arguments: • Table – Height, Discharge, or both • Lookback time – number of minutes (MM) • Location filter – all, HSA only, HSA AHPS only • Other switches/filters to be added per request • Function: • Reads database table(s) • Considers all questionable or bad values • Filters on location filter • Filters on time period ending at current time and beginning MM minutes earlier • Formats text product • Sends text product to textdb if Q/B values found Note: This presentation is being recorded for training purposes.
Report Bad Data: New QC Report Sample QC Report: ** Report of Questionable/Bad Hydro Obs Data **Period (Z): Tue 12-13 06:04:09 to Tue Dec 13 18:04:09BOGW1 HG RP Dec 13 10:30 [ROC 1/3] (Bogachiel River near La Push) CENW1 HG RP Dec 13 12:00 [RNG 1/2] (Chehalis River at Centralia)CRNW1 HG RP Dec 13 10:30 [ROC 1/1] (Snoqualmie River near Carnation)UIL HZ RZ Dec 13 12:00 [SHF 0/1] (Quillayute)ALRW1 QR RZ Dec 13 15:00 [ROC 1/0] (Alder Dam)Created: 12-13-2011 18:04:09 GMTReading Height, Discharge data.Shows latest obs time [QCreason questionable/bad value count] Note: This presentation is being recorded for training purposes.
Notice Bad Data: WHFS Applications • HydroView LiveData-> • Questionable/Bad window • Rejected Data (‘trash”) window • TimeSeries • Tabular/graphical display • River Monitor • Automatically updates Note: This presentation is being recorded for training purposes.
Sample HydroView Questionable/Bad Window Note: This presentation is being recorded for training purposes.
“A” means alert/alarm In effect Sample RiverMonitor Main Window Note: This presentation is being recorded for training purposes.
Sample RiverMon AlertAlarm Window Note: This presentation is being recorded for training purposes.
Notice Bad Data: AHPS Pages • AHPS Page features • “Area HydroGraph” feature • Under “Additional Resources” heading • Allows quick, multi-station review • Monitor AHPS map on screen in office • Snapshots only - must refresh • Local CRH app is compact, auto-refresh of “area hydrograph: feature • http://intra.crh.noaa.gov/metdat/DDC/AHPS/ahps_qc.html • On METDAT server, under DDC/AHPS/ Note: This presentation is being recorded for training purposes.
Correcting Bad Data:TimeSeries • General functions • Primary tool to edit data • Also very helpful for reviewing data • Using the "pre-defined groups" mode, can be pre-configured to show select station data • Correcting of data • Data already sent to AHPS must be set to missing to correct it at the web farm database • Next HydroGen run will retransmit data to web • Presentation, titled "Data QC Procedural Change for WHFS", describing this: <http://www.werh.noaa.gov/Hyd/ahps/ahps_resource_phase6_final.htm#9._Data_QC_Procedural_change_for_WHFS_> Note: This presentation is being recorded for training purposes.
Sample TimeSeries Tabular Window Note: This presentation is being recorded for training purposes.
Conclusion: HQ To-Do • HSD Support To-Do • Will update token • Will update script to create hydroQC product • Will provide instructions for local changes • May do this as early as next week • Future Changes • Will push for future AWIPS work to improve displays and tools for QC and situational awareness Note: This presentation is being recorded for training purposes.
Conclusion: WFO To-Do • Configure • Review QC/AA thresholds in HydroBase • Define triggers in textDB • Define product id in report_hydroQC script • Review data regularly • RiverMon/PrecipMon, AA reports, HydroView, TimeSeries… • Correct • Set AHPS data to missing...don’t delete Note: This presentation is being recorded for training purposes.
Conclusion • Questions? • Contact: whfs@noaa.gov • Thanks Note: This presentation is being recorded for training purposes.