Loading...
Abstract
The two dominant schemes for rule-learning, C4.5 and RIPPER, both operate in two stages. First, they induce an initial ruleset and then they refine it using a rather complex optimization stage that discards (C4.5) or adjusts (RIPPER) individual rules to make them work better together. In contrast, this paper shows how good rule sets can be learned one rule at a time, without any need for global optimization. We present an algorithm for inferring rules by repeatedly generating partial decision trees, thus combining the two major paradigms for rule generation—creating rules from decision trees and the separate-and-conquer rule-learning technique. The algorithm is straightforward and elegant: despite this, experiments on standard datasets show that it produces rulesets that are as accurate as and of similar size to those generated by C4.5, and more accurate than RIPPER’s. Moreover, it operates efficiently, and because it avoids postprocessing, does not suffer the extremely slow performance on pathological example sets for which the C4.5 method has been criticized.
Type
Conference Contribution
Type of thesis
Series
Citation
Frank, E., & Witten, I. H. (1998). Generating accurate rule sets without global optimization. ICML'98 15th International Conference on Machine Learning, 144-151.
Date
1998-07-24
Publisher
Morgan Kaufmann Publishers Inc.
Degree
Supervisors
Rights
This is an author’s accepted version of a conference paper published in ICML'98 15th International Conference on Machine Learning. © 1998 ACM.