Publication:
Batch-incremental versus instance-incremental learning in dynamic and evolving data

Abstract

Many real world problems involve the challenging context of data streams, where classifiers must be incremental: able to learn from a theoretically- infinite stream of examples using limited time and memory, while being able to predict at any point. Two approaches dominate the literature: batch-incremental methods that gather examples in batches to train models; and instance-incremental methods that learn from each example as it arrives. Typically, papers in the literature choose one of these approaches, but provide insufficient evidence or references to justify their choice. We provide a first in-depth analysis comparing both approaches, including how they adapt to concept drift, and an extensive empirical study to compare several different versions of each approach. Our results reveal the respective advantages and disadvantages of the methods, which we discuss in detail.

Citation

Read, J., Bifet, A., Pfahringer, B. & Holmes, G. (2012). Batch-incremental versus instance-incremental learning in dynamic and evolving data. In 11th International Symposium on Intelligent Data Analysis, IDA 2012, Helsinki, October 25-27 2012, (pp. 313-323). Helsinki, Finland.

Series name

Date

Publisher

Springer

Degree

Type of thesis

Supervisor

Link to supplementary material

Research Projects

Organizational Units

Journal Issue