Sumner, M., Frank, E. & Hall, M.A. (2005). Speeding up logistic model tree induction. In A. Jorge et al.(Eds.), Proceedings of 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, Porto, Portugal, October 3-7, 2005(pp. 675-683). Berlin, Germany: Springer
Permanent Research Commons link: http://hdl.handle.net/10289/1446
Logistic Model Trees have been shown to be very accurate and compact classifiers . Their greatest disadvantage is the computational complexity of inducing the logistic regression models in the tree. We address this issue by using the AIC criterion  instead of cross-validation to prevent overfitting these models. In addition, a weight trimming heuristic is used which produces a significant speedup. We compare the training time and accuracy of the new induction process with the original one on various datasets and show that the training time often decreases while the classification accuracy diminishes only slightly.