Univariate time series forecasting of temperature and precipitation with a focus on machine learning algorithms: a multiple-case study from Greece

G. Papacharalampous, H. Tyralis, and D. Koutsoyiannis, Univariate time series forecasting of temperature and precipitation with a focus on machine learning algorithms: a multiple-case study from Greece, Water Resources Management, 32 (15), 5207–5239, doi:10.1007/s11269-018-2155-6, 2018.

[doc_id=1920]

[English]

We provide contingent empirical evidence on the solutions to three problems associated with univariate time series forecasting using machine learning (ML) algorithms by conducting an extensive multiple-case study. These problems are: (a) lagged variable selection, (b) hyperparameter handling, and (c) comparison between ML and classical algorithms. The multiple-case study is composed by 50 single-case studies, which use time series of mean monthly temperature and total monthly precipitation observed in Greece. We focus on two ML algorithms, i.e. neural networks and support vector machines, while we also include four classical algorithms and a naïve benchmark in the comparisons. We apply a fixed methodology to each individual case and, subsequently, we perform a cross-case synthesis to facilitate the detection of systematic patterns. We fit the models to the deseasonalized time series. We compare the one- and multi-step ahead forecasting performance of the algorithms. Regarding the one-step ahead forecasting performance, the assessment is based on the absolute error of the forecast of the last monthly observation. For the quantification of the multi-step ahead forecasting performance we compute five metrics on the test set (last year’s monthly observations), i.e. the root mean square error, the Nash-Sutcliffe efficiency, the ratio of standard deviations, the coefficient of correlation and the index of agreement. The evidence derived by the experiments can be summarized as follows: (a) the results mostly favour using less recent lagged variables, (b) hyperparameter optimization does not necessarily lead to better forecasts, (c) the ML and classical algorithms seem to be equally competitive.

Full text is only available to the NTUA network due to copyright restrictions

Additional material:

Preprint (4300 KB)

Our works referenced by this work:

D. Koutsoyiannis, H. Yao, and A. Georgakakos, Medium-range flow prediction for the Nile: a comparison of stochastic and deterministic methods, Hydrological Sciences Journal, 53 (1), 142–164, doi:10.1623/hysj.53.1.142, 2008.

H. Tyralis, and D. Koutsoyiannis, Simultaneous estimation of the parameters of the Hurst-Kolmogorov stochastic process, Stochastic Environmental Research & Risk Assessment, 25 (1), 21–33, 2011.

G. Papacharalampous, H. Tyralis, and D. Koutsoyiannis, Forecasting of geophysical processes using stochastic and machine learning algorithms, European Water, 59, 161–168, 2017.

H. Tyralis, and G. Papacharalampous, Variable selection in time series forecasting using random forests, Algorithms, 10, 114, doi:10.3390/a10040114, 2017.

Our works that reference this work:

G. Papacharalampous, H. Tyralis, A. Langousis, A. W. Jayawardena, B. Sivakumar, N. Mamassis, A. Montanari, and D. Koutsoyiannis, Probabilistic hydrological post-processing at scale: Why and how to apply machine-learning quantile regression algorithms, Water, doi:10.3390/w11102126, 2019.

G. Papacharalampous, H. Tyralis, D. Koutsoyiannis, and A. Montanari, Quantification of predictive uncertainty in hydrological modelling by harnessing the wisdom of the crowd: A large-sample experiment at monthly timescale, Advances in Water Resources, 136, 103470, doi:10.1016/j.advwatres.2019.103470, 2020.

Works that cite this document: View on Google Scholar or ResearchGate