Classifier chains for multi-label classification

Loading...
Thumbnail Image

Publisher link

Rights

This is an author’s accepted version of a conference paper published in the Proceedings of the joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD 2009). © 2009 Springer.

Abstract

The widely known binary relevance method for multi-label classification, which considers each label as an independent binary problem, has been sidelined in the literature due to the perceived inadequacy of its label-independence assumption. Instead, most current methods invest considerable complexity to model interdependencies between labels. This paper shows that binary relevance-based methods have much to offer, especially in terms of scalability to large datasets. We exemplify this with a novel chaining method that can model label correlations while maintaining acceptable computational complexity. Empirical evaluation over a broad range of multi-label datasets with a variety of evaluation metrics demonstrates the competitiveness of our chaining method against related and state-of-the-art methods, both in terms of predictive performance and time complexity.

Citation

Read, J., Pfahringer, B., Holmes, G. & Frank, E. (2009). Classifier chains for multi-label classification. In Proceedings of European conference on Machine Learning and Knowledge Discovery in Databases 2009 (ECML PKDD 2009), pp. 254-269. Berlin: Springer.

Series name

Date

Publisher

Springer

Degree

Type of thesis

Supervisor