TY - JOUR
T1 - Are forecasting competitions data representative of the reality?
AU - Spiliotis, Evangelos
AU - Kouloumos, Andreas
AU - Assimakopoulos, Vassilios
AU - Makridakis, Spyros
PY - 2019/1/1
Y1 - 2019/1/1
N2 - Forecasters typically evaluate the performances of new forecasting methods by exploiting data from past forecasting competitions. Over the years, numerous studies have based their conclusions on such datasets, with mis-performing methods being unlikely to receive any further attention. However, it has been reported that these datasets might not be indicative, as they display many limitations. Since forecasting research is driven somewhat by data from forecasting competitions, it becomes vital to determine whether they are indeed representative of the reality or whether forecasters tend to over-fit their methods on a random sample of series. This paper uses the data from M4 as proportionate to the real world and compares its properties with those of past datasets commonly used in the literature as benchmarks in order to provide evidence on that question. The results show that many popular benchmarks of the past may indeed deviate from reality, and ways forward are discussed in response.
AB - Forecasters typically evaluate the performances of new forecasting methods by exploiting data from past forecasting competitions. Over the years, numerous studies have based their conclusions on such datasets, with mis-performing methods being unlikely to receive any further attention. However, it has been reported that these datasets might not be indicative, as they display many limitations. Since forecasting research is driven somewhat by data from forecasting competitions, it becomes vital to determine whether they are indeed representative of the reality or whether forecasters tend to over-fit their methods on a random sample of series. This paper uses the data from M4 as proportionate to the real world and compares its properties with those of past datasets commonly used in the literature as benchmarks in order to provide evidence on that question. The results show that many popular benchmarks of the past may indeed deviate from reality, and ways forward are discussed in response.
KW - Forecasting competitions
KW - Forecasting methods
KW - M4
KW - Time series features
KW - Time series visualization
UR - http://www.scopus.com/inward/record.url?scp=85062656389&partnerID=8YFLogxK
U2 - 10.1016/j.ijforecast.2018.12.007
DO - 10.1016/j.ijforecast.2018.12.007
M3 - Article
AN - SCOPUS:85062656389
SN - 0169-2070
JO - International Journal of Forecasting
JF - International Journal of Forecasting
ER -