Algorithm selection on data streams

Abstract

We explore the possibilities of meta-learning on data streams, in particular algorithm selection. In a first experiment we calculate the characteristics of a small sample of a data stream, and try to predict which classifier performs best on the entire stream. This yields promising results and interesting patterns. In a second experiment, we build a meta-classifier that predicts, based on measurable data characteristics in a window of the data stream, the best classifier for the next window. The results show that this meta-algorithm is very competitive with state of the art ensembles, such as OzaBag, OzaBoost and Leveraged Bagging. The results of all experiments are made publicly available in an online experiment database, for the purpose of verifiability, reproducibility and generalizability.

Citation

van Rijn, J. N., Holmes, G., Pfahringer, B., & Vanschoren, J. (2014). Algorithm selection on data streams. In S. Džeroski, P. Panov, D. Kocev, & L. Todorovski (Eds.), Proceedings of 17th International Conference on Discovery Science (Vol. LNAI 8777, pp. 325–336). Springer International Publishing. http://doi.org/10.1007/978-3-319-11812-3_28

Series name

Date

Publisher

Springer International Publishing

Degree

Type of thesis

Supervisor