1 / 34

G2 Crop CIS meeting Ispra, May 14 – 15, 2012

G2 Crop CIS meeting Ispra, May 14 – 15, 2012. Presented by: Institute of Geodesy and Cartography. Utility assessment of BioPAR products for wheat yield forecasting in Europe. Crop yield estimation. Detailed description of methods and comparison of results on MARSOP and BioPar data.

chin
Download Presentation

G2 Crop CIS meeting Ispra, May 14 – 15, 2012

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. G2 Crop CIS meetingIspra, May 14 – 15, 2012 Presented by: Institute of Geodesy and Cartography

  2. Utility assessment of BioPAR products for wheat yield forecasting in Europe. Crop yield estimation. Detailed description of methods and comparison of results on MARSOP and BioPar data 2012-01-25 ISPRA

  3. Objective 10400 - Utility Assessment – IGiK contribution The objective of the work is to test the performance of MARS and BioPar indicators for yield forecast on an European window. The purpose is to show and assess their practical use in crop monitoring/yield forecasting. The work is aimed at comparing the differences in yield estimation accuracy, based on the two data sets. 2012-01-25 ISPRA

  4. European agro-climatic zones Iglesias, A., Garrote, L., Quiroga, S., Moneo, M.: Impacts of climate change in agriculture in Europe. PESETA-Agriculture study. EUR 24107 EN; DOI 10.2791/33218; EC 2009. 2012-01-25 ISPRA

  5. Another grouping of regions mean ordinal number of the decade in which the annual maximum of NDVI occurred 2012-01-25 ISPRA

  6. Statistical model Partial Least Squares Regression(PLSR) - to choose a few components being linear combinations of explanatory variablesXand to perform linear regression of response variableY on these variables instead of performing regression with use of all X-variables Y - response variable (yield value); Xn - explanatory variables (values of vegetation indices); n - sequential number of ten-day period taken into account; d_beg, d_end – number of ten-day period corresponding to the beginning and the end of growing season, respectively (different for different agro-climatic zones); cNn - function f – coefficients generated by the PLS regression algorithm. 2012-01-25 ISPRA

  7. Statistical model Partial Least Squares Regression(PLSR) - generalization of multiple regression - many (correlated) predictor variables - few observations - to derive orthogonal components using the cross-covariance matrix between the response variable and the explanatory variables - dimension reduction technique similar to Principal Component Regression (PCR) • PCR - the coefficients reflect the covariance structure between the predictor variables X • PLSR – the coefficients reflect the covariance structure between the predictor X and response Y variables 2012-01-25 ISPRA

  8. Model evaluation One-leave-out cross-validation: - for each year of data the PLS regression model was built with this yearexcluded - the yield prediction for excluded year was performed - predicted and actual yield values were compared 2012-01-25 ISPRA

  9. Model evaluation One-leave-out cross-validation: Performances were evaluated in terms of cross-validation mean errors: Mean Percentage Error (MPE) Mean Absolute Percentage Error (MAPE) Root Mean Square Error (RMSE) Yield_obsi– actual yield in year i, Yield_predi –yield prediction made for year i, N – number of observations (years) taken into account 2012-01-25 ISPRA

  10. Results - cross validation Agro-climatic zones M A R S B i o P a r 2012-01-25 ISPRA

  11. Results - cross validation maxNDVI M A R S B i o P a r 2012-01-25 ISPRA

  12. Chosen regions For each european NUTS region WA - wheat area harvested (from Eurostat, mean value of 11 considered years) TA - total arable land area (from arable land mask) Ispra, May 14 – 15, 2012

  13. Chosen regions DK0 153.02 Atlantic Central ES41 133.69 Mediterranean North DE2 128.95 Continental North DEE87.51 Continental North ES24 87.31 Mediterranean North mean29.69 lowest0.09 Ispra, May 14 – 15, 2012

  14. Prediction errors Ispra, May 14 - 15, 2012

  15. Prediction errors Ispra, May 14 - 15, 2012

  16. Year 2009 yield prognosis Ispra, May 14 - 15, 2012

  17. Year 2009 yield prognosis Ispra, May 14 - 15, 2012

  18. Year 2009 yield prognosis Ispra, May 14 - 15, 2012

  19. Year 2009 yield prognosis Ispra, May 14 - 15, 2012

  20. Year 2009 yield prognosis Ispra, May 14 - 15, 2012

  21. Year 2009 prediction errors Ispra, May 14 - 15, 2012

  22. Year 2009 prediction errors Ispra, May 14 - 15, 2012

  23. Year 2009 prediction errors Ispra, May 14 - 15, 2012

  24. Year 2009 regression coefficients Ispra, May 14 - 15, 2012

  25. Year 2009 regression coefficients Ispra, May 14 - 15, 2012

  26. Year 2009 regression coefficients Ispra, May 14 - 15, 2012

  27. Models for aggregated data • A strategy to increase the number of observations by grouping the NUTS • As the number of years of yield data is small, the possibility of building PLS Regression models for aggregated data was investigated. • Levels of NUTS-2 regions aggregation considered: • agro-climatic zone, • country, • country / agro-climatic zone, • NUTS-1 / agro-climatic zone. Ispra, May 14 - 15, 2012

  28. Models for aggregated data For each NUTS-2 region, yield data was standardized. yield standardized = (yield – mean) / standard deviation Standardized yield values and values of vegetation indices from all NUTS-2 regions constituting one aggregated region were used to build PLS regression model for aggregated region. Ispra, May 14 - 15, 2012

  29. Models for aggregated data Cross-validation The predictive ability of the model for aggregated region was assessed with cross-validation. For each year of the data: The PLS regression model was built on the basis of data that did not contain data for year considered (the standardization procedure for each NUTS-2 region was repeated). For each NUTS-2 region constituting the aggregated region, the prediction of standardized yield for year considered was performed and the destandardized yield value was calculated. This predicted yield value was compared with observed yield. Cross-validation MAPE, MPE, Nash-Sutcliffe coefficient were calculated. Ispra, May 14 - 15, 2012

  30. Models for aggregated data Nash–Sutcliffe model efficiency coefficient Ispra, May 14 - 15, 2012

  31. Nash–Sutcliffe efficiencies can range from −∞ to 1. An efficiency of 1 (E = 1) corresponds to a perfect match of modeled discharge to the observed data. An efficiency of 0 (E = 0) indicates that the model predictions are as accurate as the mean of the observed data, whereas an efficiency less than zero (E < 0) occurs when the observed mean is a better predictor than the model or, in other words, when the residual variance (described by the numerator in the expression above), is larger than the data variance (described by the denominator). Essentially, the closer the model efficiency is to 1, the more accurate the model is. Models for aggregated data NSC = 1 - a perfect match of modeled to the observed data. NSC = 0 - the model predictions are as accurate as the mean of the observed data NSC < 0 - the observed mean is a better predictor than the model The closer the model efficiency is to 1, the more accurate the model is. Ispra, May 14 - 15, 2012

  32. Nash–Sutcliffe efficiencies can range from −∞ to 1. An efficiency of 1 (E = 1) corresponds to a perfect match of modeled discharge to the observed data. An efficiency of 0 (E = 0) indicates that the model predictions are as accurate as the mean of the observed data, whereas an efficiency less than zero (E < 0) occurs when the observed mean is a better predictor than the model or, in other words, when the residual variance (described by the numerator in the expression above), is larger than the data variance (described by the denominator). Essentially, the closer the model efficiency is to 1, the more accurate the model is. Aggregation for agro-climatic zones Ispra, May 14 - 15, 2012

  33. Nash–Sutcliffe efficiencies can range from −∞ to 1. An efficiency of 1 (E = 1) corresponds to a perfect match of modeled discharge to the observed data. An efficiency of 0 (E = 0) indicates that the model predictions are as accurate as the mean of the observed data, whereas an efficiency less than zero (E < 0) occurs when the observed mean is a better predictor than the model or, in other words, when the residual variance (described by the numerator in the expression above), is larger than the data variance (described by the denominator). Essentially, the closer the model efficiency is to 1, the more accurate the model is. Country / agro-climatic zone Ispra, May 14 - 15, 2012

  34. THANK YOU VERY MUCH

More Related