Landwehr, N., Hall, M.A. & Frank, E. (2005). Logistic model trees. Machine Learning, 59(1-2), 161-205.
Permanent Research Commons link: http://hdl.handle.net/10289/1445
Tree induction methods and linear models are popular techniques for supervised learning tasks, both for the prediction of nominal classes and numeric values. For predicting numeric quantities, there has been work on combining these two schemes into `model trees', i.e. trees that contain linear regression functions at the leaves. In this paper, we present an algorithm that adapts this idea for classification problems, using logistic regression instead of linear regression. We use a stagewise fitting process to construct the logistic regression models that can select relevant attributes in the data in a natural way, and show how this approach can be used to build the logistic regression models at the leaves by incrementally refining those constructed at higher levels in the tree. We compare the performance of our algorithm to several other state-of-the-art learning schemes on 36 benchmark UCI datasets, and show that it produces accurate and compact classifiers.
This is an author’s version of an article published on the journal: Machine Learning. The original publication is available at www.springerlink.com.