Data visualization and forecast combination for probabilistic load forecasting in GEFCom2017 final match

Julian de Hoog; Khalid Abdulla

doi:10.1016/j.ijforecast.2019.02.004

International Journal of Forecasting

Paper

01 Oct 2019

Data visualization and forecast combination for probabilistic load forecasting in GEFCom2017 final match

View publication

Abstract

This paper describes the methods used by Team Cassandra, a joint effort between IBM Research Australia and the University of Melbourne, in the GEFCom2017 load forecasting competition. An important first phase in the forecasting effort involved a deep exploration of the underlying dataset. Several data visualisation techniques were applied to help us better understand the nature and size of gaps, outliers, the relationships between different entities in the dataset, and the relevance of custom date ranges. Improved, cleaned data were then used to train multiple probabilistic forecasting models. These included a number of standard and well-known approaches, as well as a neural-network based quantile forecast model that was developed specifically for this dataset. Finally, model selection and forecast combination were used to choose a custom forecasting model for every entity in the dataset.

Conference paper