230 likes | 317 Views
Assessing the skill of decadal predictions Reidun Gangst ø, Andreas P. Weigel, Mark A. Liniger. EMS Annual Meeting, Berlin, 13 September 2011. Outline. The ENSEMBLES decadal predictions Impact of drift correction on skill Is there any skill apart from the trend ?
E N D
Assessing the skill of decadal predictionsReidun Gangstø, Andreas P. Weigel, Mark A. Liniger EMS Annual Meeting, Berlin, 13 September 2011
Outline • The ENSEMBLES decadal predictions • Impact of drift correction on skill • Is there any skill apart from the trend? • Impact of cross-validation on skill • Evaluating skill with the Jackknife bias corrector • Summary and conclusions
Global average T2 temperature(ENSEMBLES decadal predictions vs ERA-40/Interim re-analysis data) ECMWF UKMO T2 (°C) IFM-GEOMAR CERFACS T2 (°C) Hindcast year Hindcast year
Problem: sample size too small (8-9) to obtain robust bias estimates for each year separately Drift correction methods: • Subtracting the lead-time dependent bias (CONV) • Fitting a 3rd degree polynomial fit to the lead-time dependent bias (FIT) Example of drift evolutionwith lead-time Crosses: CONV solid lines: FIT T2 temperature (°C) Lead-time (year) The drift correction is done in a leave-one-out cross-validation mode
Global mean T2, after drift correction ECMWF UKMO T2 (°C) Multi-model IFM-GEOMAR CERFACS T2 (°C) Hindcast year Hindcast year
Correlation after drift correction(in cross-validation) Correlation, FIT (T2 mean over years 1-5) Mean of grid point-wise correlation Latitude Correlation Lead-time (year) Longitude
Removing the model trend 1-5 y 1-5 y All lead-times 1-10 y 1-5 y T2 temperature (°C) 6-10 y 6-10 y 6-10 y Year
Removing the observed trend 1-5 y 1-5 y 1-5 y - 6-10 y 6-10 y 6-10 y -
Correlation after detrending (drift correction with CONV, in cross-validation) Correlation, model trend removed (yrs 1-5) Mean of grid point-wise correlation Latitude Correlation Lead-time (year) Longitude Why is the skill predominantly negative???
Cross-validation … then correlate with observations 1960, 1965, 1970, …
Toy model: bias from cross-validation Correlation as measured Prescribed correlation: 0 Number of experiments: 10’000 Var.fcst / Var.obsv 1:12 Drift-correction (method: CONV) in cross-validation
Illustration of cross-validation bias,example: CONV drift correction Forecasts Obsv. Not bias corrected NO CORRELATION
Illustration of cross-validation biasexample: CONV drift correction Forecasts Obsv. Not bias corrected Bias corrected NO CORRELATION
Illustration of cross-validation biasexample: CONV drift correction Forecasts Obsv. Not bias corrected Bias corrected NO CORRELATION
Illustration of cross-validation biasexample: CONV drift correction Forecasts Obsv. Not bias corrected Bias corrected NO CORRELATION
Illustration of cross-validation biasexample: CONV drift correction Forecasts Obsv. Not bias corrected Bias corrected NEGATIVE CORRELATION NO CORRELATION
Consequences for verification • Estimates of actual prediction skill of decadal forecasts problematic because: • Issues of data situation in hindcasts (e.g. ocean data before 1980s) • small sample size induces bias in cross-validation procedure • It may be better to look at potential predictability, i.e. the skill we would have with an infinite number of training data, and assuming that there are no limitations in data quality
Jackknifing as a pragmatic solution • Empirical approach frequently used to quantify sample size related biases • Related to bootstrapping • The idea is that the estimator is computed from the full sample, then recomputed n times, leaving a different observation out each time • Reference: B. Efron (1982). The Jackknife, the Bootstrap and other resampling plans. J.W. Arrowsmith, Ltd., Bristol, England, 92 pp.
Toy model: bias from cross-validation Jackknife estimate Correlation as measured Prescribed correlation: 0 Number of experiments: 10’000 Var.fcst / Var.obsv 1:12 Drift-correction (method: CONV) in cross-validation
Local correlation after drift correction,with the Jackknife bias corrector (JK) applied Correlation, CONV, with CV Correlation with JK (T2 mean over years 1-5) Latitude Mean of grid point-wise correlation Correlation Longitude Lead-time (year)
Local correlation after detrending, with the Jackknife bias corrector applied Correlation CONV, with CV Correlation with JK, model trend removed (yrs 1-5) Latitude Mean of grid point-wise correlation Correlation Longitude Lead-time (year)
Difference in correlation between the detrending methods Uncertainties related to the choice of detrending method are of the same order of magnitude as remaining fluctuations
Summary and conclusions • Predicted near-surface temperature from the ENSEMBLES decadal model forecasts are compared to ERA-40/Interim re-analysis data • Drift correction: • Reduction of noise by fitting suitable polynomial through annual bias estimates • Verification: • Unbiased estimate of forecasts problematic due to small sample sizes • It may be more useful to focus on potential predictability (e.g. Jackknife method) • Trend: • By far most of the skill is related to reproduction of linear trend • Skill of predicting remaining (interannual) fluctuations close to zero • Exact quantification difficult due to uncertainties in detrending methods