Bifet, A., Holmes, G. & Pfahringer, B. (2010). Leveraging bagging for evolving data streams. In J.L. alcazer et al. (Eds.): ECML PKDD 2010, Part I, LNZI 6321 (pp. 135-150). Heidelberg: Springer-Verlag.
Permanent Research Commons link: http://hdl.handle.net/10289/4638
Bagging, boosting and Random Forests are classical ensemble methods used to improve the performance of single classifiers. They obtain superior performance by increasing the accuracy and diversity of the single classifiers. Attempts have been made to reproduce these methods in the more challenging context of evolving data streams. In this paper, we propose a new variant of bagging, called leveraging bagging. This method combines the simplicity of bagging with adding more randomization to the input, and output of the classifiers. We test our method by performing an evaluation study on synthetic and real-world datasets comprising up to ten million examples.
© 2010 Springer-Verlag Berlin Heidelberg. This is the author's accepted version. The final publication is available at Springer via dx.doi.org/10.1007/978-3-642-15880-3_15