660 likes | 671 Views
This presentation provides an overview of the setup and first results of the COSMO-DE-EPS pre-operational phase, including verification and forecasters' feedback. It also discusses the enlargement of the sample at low cost and looks into past production cycles.
E N D
COSMO-DE-EPS Susanne Theis Christoph Gebhardt, Zied Ben Bouallègue, Michael Buchhold
Presentation Overview • setup of COSMO-DE-EPS • first results of pre-operational phase • verification • forecasters‘ feedback • enlarging the sample at low cost • looking into past production cycles
Presentation Overview • setup of COSMO-DE-EPS • first results of pre-operational phase • verification • forecasters‘ feedback • enlarging the sample at low cost • looking into past production cycles new new
COSMO-DE-EPS status • pre-operational phase has started: Dec 9th, 2010 • pre-operational setup: • 20 members • grid size: 2.8 km convection-permitting • lead time: 0-21 hours, 8 starts per day (00, 03, 06,... UTC) • variations in physics, initial conditions, lateral boundaries model domain
Generation of Ensemble Members • plus variations of • initial conditions • model physics Ensemble Chain COSMO-DE-EPS 2.8km COSMO 7km BC-EPS GME, IFS, GFS, GSM
Generation of Ensemble Members • plus variations of • initial conditions • model physics Ensemble Chain COSMO-DE-EPS 2.8km COSMO 7km BC-EPS is running as a time-critical application at ECMWF BC-EPS GME, IFS, GFS, GSM
Generation of Ensemble Members Perturbation Methods Peralta, C., Ben Bouallègue, Z., Theis, S.E., Gebhardt, C. and M. Buchhold, 2011: Accounting for initial condition uncertainties in COSMO-DE-EPS. Submitted to Journal of Geophysical Research. Peralta, C. and M. Buchhold, 2011: Initial condition perturbations for the COSMO-DE-EPS, COSMO Newsletter 11, 115–123. Gebhardt, C., Theis, S.E., Paulat, M. and Z. Ben Bouallègue, 2011: Uncertainties in COSMO-DE precipitation forecasts introduced by model perturbations and variation of lateral boundaries. Atmospheric Research 100, 168-177. (contains status of 2009)
COSMO-DE-EPS plans (2011-2014) 2011 • upgrade to 40 members, redesign • statistical postprocessing • initial conditions by LETKF • lateral boundary conditions by ICON EPS reach operational status 2012 2013
new First Results of Pre-operational Phase- verification - forecasters‘ feedback
new First Results of Pre-operational Phase- verification - forecasters‘ feedback
SYNOP Verification Method RADAR • Ensemble Members • Probabilities of Precipitation
PREC 1h accumulation, threshold: 0.1 mm DETERMINISTIC SCORES for Individual Members • Do the ensemble members • have different long-term statistics? • (multi-model / multi-configuration) • Are there many cases with • the same „best member“ • or „wettest member“? • look at Equitable Threat Score • - look at Frequency Bias Index • (results similar, not shown) Equitable Threat Score 0.5 0.4 0.3 0.2 0.1 0.0 JUNE 2011 IFS GME GFS GSM }20 members 0 5 10 15 20 Forecast Time [h]
PREC 1h accumulation, threshold: 0.1 mm DETERMINISTIC SCORES for Individual Members • Do the ensemble members • have different long-term statistics? • (multi-model / multi-configuration) • Are there many cases with • the same „best member“ • or „wettest member“? • look at Equitable Threat Score • - look at Frequency Bias Index • (results similar, not shown) Equitable Threat Score 0.5 0.4 0.3 0.2 0.1 0.0 JUNE 2011 IFS GME GFS GSM }20 members 0 5 10 15 20 Forecast Time [h] Only small differences in long-term statistics Members may be treated as equally probable
PREC 1h accumulation • observation... • …treated as „Ensemble Member“ • …ranked according to prec amount • at each grid point and forecast hour • How frequent is each rank? • If ensemble underdispersive • U-shaped rank histogram RANK HISTOGRAM 0.05 0.00 JUNE 2011 Frequency 1 6 11 16 21 Observation Rank
PREC 1h accumulation • observation... • …treated as „Ensemble Member“ • …ranked according to prec amount • at each grid point and forecast hour • How frequent is each rank? • If ensemble underdispersive • U-shaped rank histogram RANK HISTOGRAM 0.05 0.00 JANUARY 2011 Frequency 1 6 11 16 21 Observation Rank
PREC 1h accumulation • observation... • …treated as „Ensemble Member“ • …ranked according to prec amount • at each grid point and forecast hour • How frequent is each rank? • If ensemble underdispersive • U-shaped rank histogram RANK HISTOGRAM 0.05 0.00 JANUARY 2011 Frequency 1 6 11 16 21 Observation Rank a) Underdispersiveness relatively small b) Four groups Many cases with large influence by global models
PREC 1h accumulation How good are the probabilities derived from the ensemble? compared to the deterministic COSMO-DE (always forecasting 0% or 100%) Look at Brier Skill Score (no skill: zero) - for different precipitation thresholds (colors) (probabilites of exceeding a certain threshold) - for different forecast lead times (x-axis) BRIER SKILL SCORE JANUARY 2011 > 0.1 mm > 1 mm > 2 mm 0 5 10 15 20 Forecast Time [h]
PREC 1h accumulation How good are the probabilities derived from the ensemble? compared to the deterministic COSMO-DE (always forecasting 0% or 100%) Look at Brier Skill Score (no skill: zero) - for different precipitation thresholds (colors) (probabilites of exceeding a certain threshold) - for different forecast lead times (x-axis) BRIER SKILL SCORE JANUARY 2011 > 0.1 mm > 1 mm > 2 mm 0 5 10 15 20 Forecast Time [h] Always positive! Ensemble provides additional value to COSMO-DE Additional value grows with lead time (less deterministic predictability)
PREC 1h accumulation How good are the probabilities derived from the ensemble? compared to the deterministic COSMO-DE (always forecasting 0% or 100%) Look at Brier Skill Score (no skill: zero) - for different precipitation thresholds (colors) (probabilites of exceeding a certain threshold) - for different forecast lead times (x-axis) BRIER SKILL SCORE JUNE 2011 > 0.1 mm > 1 mm > 2 mm 0 5 10 15 20 Forecast Time [h] Always positive! Ensemble provides additional value to COSMO-DE Additional value grows with lead time (less deterministic predictability)
PREC 1h accumulation How good are the probabilities derived from the ensemble? compared to the deterministic COSMO-DE (always forecasting 0% or 100%) Look at Brier Skill Score (no skill: zero) - for different precipitation thresholds (x-axis) (probabilites of exceeding a certain threshold) - for all foreast lead times BRIER SKILL SCORE MAY - JULY 2011 0.112 5 10 20 Threshold [mm/h] For larger precipitation amounts (summer): even more additional value
PREC 1h accumulation Are the probabilities already well calibrated? (without extra calibration) If we isolate all cases with a forecast probability of -say- 75-85% … did the event occur in 80% of these cases? diagonal line: optimal - for different prec thresholds (colors) (probs of exceeding a threshold) RELIABILITY DIAGRAM JUNE 2011 log (# fcst) > 0.1 mm > 1 mm > 2 mm
PREC 1h accumulation Are the probabilities already well calibrated? (without extra calibration) If we isolate all cases with a forecast probability of -say- 75-85% … did the event occur in 80% of these cases? diagonal line: optimal - for different prec thresholds (colors) (probs of exceeding a threshold) RELIABILITY DIAGRAM JUNE 2011 log (# fcst) > 0.1 mm > 1 mm > 2 mm Reliability diagram shows some bias and underdispersiveness Lines are not flat additional calibration has good potential
Summary of Verification (Precipitation) • Ensemble provides additional value to COSMO-DE (for all accumulations, lead times, precipitation thresholds,…) • Ensemble underdispersiveness is relatively small • Ensemble members may be treated as equally probable • Additional calibration has good potential
Summary of Verification (Precipitation) • Ensemble provides additional value to COSMO-DE (for all accumulations, lead times, precipitation thresholds,…) • Ensemble underdispersiveness is relatively small • Ensemble members may be treated as equally probable • Additional calibration has good potential Pre-operational COSMO-DE ensemble prediction system already meets fundamental quality requirements for precipitation
Other Variables • T_2M and VMAX have been verified • ensemble spread is far too small • nevertheless, ensemble provides additional value to COSMO-DE
Other Variables • T_2M and VMAX have been verified • ensemble spread is far too small • nevertheless, ensemble provides additional value to COSMO-DE COSMO-DE ensemble prediction system has been developed with focus on precipitation Upgrade to 40 members will also look at other variables
new First Results of Pre-operational Phase- verification - forecasters‘ feedback
Forecasters‘ Feedback • available products: see figure precipitation, snow, wind gusts, T_2m probability thresholds: warning criteria • all products on grid-scale (2.8km) • in addition: precipitation probabilities for larger areas (10x10 grid boxes) „probability that the precipitation event will occur anywhere within the region“ probabilities, quantiles, ensemble mean, spread, min, max, …
Forecasters‘ Feedback • evaluate „full package“ • including the visualization tool • consistency of products • select interesting cases • consider forecasters‘ interpretation • perception as intended? • is there any value in the forecast, additional to forecasters‘ knowledge?
Forecasters‘ Feedback • what they prefer to use: • 90%-quantile of precipitation • precipitation probabilities for an area (10x10 grid points)
Forecasters‘ Feedback • what they prefer to use: • 90%-quantile of precipitation • precipitation probabilities for an area (10x10 grid points) • what they appreciate: • early signals for heavy precipitation • indication that deterministic run may be wrong
Forecasters‘ Feedback • what they prefer to use: • 90%-quantile of precipitation • precipitation probabilities for an area (10x10 grid points) • what they appreciate: • early signals for heavy precipitation • indication that deterministic run may be wrong • what they criticize: • jumpiness between subsequent runs • lack of spread in T_2M and VMAX
Forecasters‘ Feedback • what they prefer to use: • 90%-quantile of precipitation • precipitation probabilities for an area (10x10 grid points) • what they appreciate: • early signals for heavy precipitation • indication that deterministic run may be wrong • what they criticize: • jumpiness between subsequent runs • lack of spread in T_2M and VMAX • what they are learning: • dealing with low probabilities (10% probability for extreme weather issue a warning?)
new Enlarging the Sample at Low Cost - looking into past production cycles
new Enlarging the Sample at Low Cost - looking into past production cycles - Introduction - Verification Study - Recommendation
COSMO-DE-EPS Overview December 2010 • start of pre-operational phase / evaluation • 20 members probabilities, quantiles, etc • runs at 00 UTC, 03 UTC, 06 UTC,…
COSMO-DE-EPS Overview December 2010 • start of pre-operational phase / evaluation • 20 members probabilities, quantiles, etc • runs at 00 UTC, 03 UTC, 06 UTC,… • start of pre-operational phase with 40 ensemble members • reach operational status • statistical postprocessing of probabilities of precipitation Q1 2012 End of 2012 2013
COSMO-DE-EPS Overview December 2010 • start of pre-operational phase / evaluation • 20 members probabilities, quantiles, etc • runs at 00 UTC, 03 UTC, 06 UTC,… • start of pre-operational phase with 40 ensemble members • reach operational status • statistical postprocessing of probabilities of precipitation further improvements ? Q1 2012 End of 2012 2013
21UTC 00UTC Enlarging the Sample at Low Cost • looking into past production cycles 0 UTC 15 UTC 18UTC } 20 members } 20 members } 20 members
21UTC 00UTC Enlarging the Sample at Low Cost • looking into past production cycles 0 UTC 15 UTC 18UTC } 20 members } 20 members } 20 members } 60 members
Important Check • How good are forecasts from past production cycles?
Important Check • How good are forecasts from past production cycles? June 2011 precipitation 1h accumulations threshold: 1 mm/h Forecast Start: 00UTC 21UTC 18UTC FBI ETS • similar results for • different months • different starting times • different thresholds • 6h accumulations 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 time of day [UTC] time of day [UTC]
Important Check • How good are forecasts from past production cycles? June 2011 precipitation 1h accumulations threshold: 1 mm/h Forecast Start: 00UTC 21UTC 18UTC FBI ETS • similar results for • different months • different starting times • different thresholds • 6h accumulations 0 2 4 6 8 10 12 14 0 2 4 6 8 10 12 14 time of day [UTC] time of day [UTC] The quality is similar. Exception: Most recent run with forecast lead time 0-3 hours is the best.
Probability of Precipitation > 10 mm/h Resulting Probabilities 20 + 20 + 20 members 20 members % 2011-05-22 15UTC Forecast Start: 09 UTCForecast Start: 09 UTC, 06 UTC, 03 UTC
Summary of Study • looking into past production cycles (20+20+20) • quality gain for precipitation (except first 2 forecast hours) • does not harm quality of T_2M and VMAX_10M • also applicable to „area probabilities“ Recommendation: Look into past production cycles
18UTC 21UTC 00UTC Technical aspect:how to derive probabilities from 20+20+20 members 0 UTC 15 UTC 21UTC 21h forecast in (pre-)operational mode + 6h additional lead time required