Loading...
Thumbnail Image
Publication

Tie-breaking in Hoeffding trees

Abstract
A thorough examination of the performance of Hoeffding trees, state-of-the-art in classification for data streams, on a range of datasets reveals that tie breaking, an essential but supposedly rare procedure, is employed much more than expected. Testing with a lightweight method for handling continuous attributes, we find that the excessive invocation of tie breaking causes performance to degrade significantly on complex and noisy data. Investigating ways to reduce the number of tie breaks, we propose an adaptive method that overcomes the problem while not significantly affecting performance on simpler datasets.
Type
Conference Contribution
Type of thesis
Series
Citation
Holmes, G., Richard, K., Pfahringer, B. (2005). Tie-breaking in Hoeffding trees. In proceedings of the Second International Workshop on Knowledge Discovery from Data Streams, Porto, Portugal, 2005.
Date
2005
Publisher
ECML/PKDD
Degree
Supervisors
Rights