1 / 11

Evaluation of autoregressive time series prediction using validity of cross-validation

Cross-Validation is a validation technique used to explain how well the estimated values from a fitted statistical model will generalize to the explanatory variables under study. The standard procedure for model validation is the K-fold CV as in Regression Analysis and classification problem. This blog discusses a note on the validity of Cross-Validation for evaluating autoregressive Time Series Prediction. Statswork offers statistical services as per the requirements of the customers. When you Order statistical Services at Statswork, we promise you the following u2013 Always on Time, outstanding customer support, and High-quality Subject Matter Experts. <br>Contact Us:<br><br>Website: www.statswork.com<br><br>Email: info@statswork.com<br><br>UnitedKingdom: 44-1143520021<br><br>India: 91-4448137070t<br>tt<br>WhatsApp: 91-8754446690<br>

statsworkfb
Download Presentation

Evaluation of autoregressive time series prediction using validity of cross-validation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Research paper EVALUATION OF AUTOREGRESSIVE TIME SERIES PREDICTION USING VALIDITY OF CROSS-VALIDATION Tags: Statswork | Cross-Validation | Time Series Prediction | Time Series Analysis | Statistical Model | Regression Analysis | Serial Correlation | Statistical Models | Time Series Data Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  2. CV is a validation technique used to explain how well the estimated values from a fitted statistical model will generalize to the explanatory variables under study.  Cross-Validation (CV) Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  3. Why Cross Validation technique is different for time series data? However, for predicting the time series data, this K-fold is not a straightforward because of the non-stationarity and serial correlation present in the data. K-fold CV is applicable for the autoregressive modelswith uncorrelated errors and have wide scope in predicting the model using Machine Learning methods. Standard procedure for model validation is the K-fold CV as in Regression Analysis and classification problem. Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  4. Types of Cross Validation K-fold Cross Validation 01 K-fold Cross Validation for time series for purely autoregressive models particularly for the dependent case instead of applying to the independent case. Hold-out Cross Validation 02 Hold-out cross-validation technique is performed bysplitting the data into training and test set. Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  5. ILLUSTRATIVE EXAMPLE Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  6. Model selection adopted in that paper is 5-fold Cross Validation using leave-one-out, non-dependent, and out-of-sample evaluation methods using AR(1) to AR(5) models and residuals have been taken care for the uncorrelated errors using Serial Correlation. 01 02 Three different experiments are carried out to forecast the AR model using Monte Carlo simulation. 03 The monthly seasonality from the ACF and PACF plots for the data considered for Monte-Carlo simulation is depicted in the following figure and the results of the experiment is discussed. 04 Applicability of the procedure using the yearly sun spot dataavailable in R. The data involves 289 observations recorded from 1700 to 1988 and are depicted in the following figure. Contd... Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  7. Results Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  8. Discussions Method RMSE • Relevant auto correlation is captured using Ljung-Box testwith the residual series of 1560 model configurations. • Out of 1560, 763 satisfies the Box test and the configuration which have minimum root mean square error is chosen. • The 5-fold CV yields an error rate of 2.247 andout-of-sample (OOS) methodyields 2.281 and the CV procedure is not an over fitting model and the results are tabulated. 5 fold CV 2,247 005 2,281 Auto correlation Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  9. Summary Discussed the applicability of the K-fold Cross Validation procedure for the purely AR models beyond the usual practice and it is found to be useful when the residuals in the model is uncorrelated. 01 From the Monte-Carlo simulation, the K-fold classification yields better result than out-of-sample method for time series data and the real-time data explains the application of this using AR model with uncorrelated errors. 02 The over-fitting and under-fitting of the model is accessed using the Ljung-Box test and it results in there is no such issues occurred in the model. 03 Examines the applicability of the K-fold cross validation using neural network model with a monte-carlo simulations, & it can also be extended to the other regression models such as random forest and other machine learning techniques. 04 The interesting point is that if we use whatever regression model for these kinds of Time Series Data, this method still proves the K-fold cross validation is the best method. 05 Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  10. www.statswork.com Statswork Lab @ Statswork.com Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

  11. Freelancer PHONE NUMBER Consultant UK : +44-1143520021 INDIA : +91-4448137070 Guest Blog Editor EMAIL ADDRESS info@statswork.com GET IN TOUCH WITH US CONTACT hr@workfoster.com Research Planning | Data Collection | Semantic Annotation | Business Analytics | Bio Statistics | Econometrics Copyright © 2019 Statswrok. All rights reserved

More Related