Loading...
Leveraging plasticity in incremental decision trees
Abstract
Commonly used incremental decision trees for mining data streams include Hoeffding Trees (HT) and Extremely Fast Decision Trees (EFDT). EFDT exhibits faster learning than HT. However, due to its split revision procedure, EFDT suffers from sudden and unpredictable accuracy decreases caused by subtree pruning. To overcome this, we propose PLASTIC, an incremental decision tree that restructures the otherwise pruned subtree. This is possible due to decision tree plasticity: one can alter a tree’s structure without affecting its predictions. We conduct extensive evaluations comparing PLASTIC with state-of-the-art methods on synthetic and real-world data streams. Our results show that PLASTIC improves EFDT’s worst-case accuracy by up to 50% and outperforms the current state of the art on real-world data. We provide an open-source implementation of PLASTIC within the MOA framework for mining high-speed data streams.
Type
Conference Contribution
Type of thesis
Series
Citation
Heyden, M., Gomes, H. M., Fouché, E., Pfahringer, B., & Böhm, K. (2024). Leveraging plasticity in incremental decision trees. Lecture Notes in Computer Science, 14945, 38-54. https://doi.org/10.1007/978-3-031-70362-1_3
Date
2024
Publisher
Springer
Degree
Supervisors
Rights
This is an author’s accepted version of a conference paper published in the series: Lecture Notes in Computer Science (LNAI, volume 14945). © 2024 Springer