Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Making better use of global discretization

      Frank, Eibe; Witten, Ian H.
      Thumbnail
      Files
      making better use of global discretization.pdf
      184.4Kb
      Link
       www-ai.ijs.si
      Citation
      Export citation
      Frank, E. & Witten, I.H.(1999). Making better use of global discretization. In Proceeding of 16th International Conference on Machine Learning, Bled, Slovenia (pp. 115-123). San Francisco: Morgan Kaufmann Publishers.
      Permanent Research Commons link: https://hdl.handle.net/10289/1507
      Abstract
      Before applying learning algorithms to datasets, practitioners often globally discretize any numeric attributes. If the algorithm cannot handle numeric attributes directly, prior discretization is essential. Even if it can, prior discretization often accelerates induction, and may produce simpler and more accurate classifiers.

      As it is generally done, global discretization denies the learning algorithm any chance of taking advantage of the ordering information implicit in numeric attributes. However, a simple transformation of discretized data preserves this information in a form that learners can use. We show that, compared to using the discretized data directly, this transformation significantly increases the accuracy of decision trees built by C4.5, decision lists built by PART, and decision tables built using the wrapper method, on several bench-mark datasets. Moreover, it can significantly reduce the size of the resulting classifiers.

      This simple technique makes global discretization an even more useful tool for data preprocessing
      Date
      1999
      Type
      Conference Contribution
      Publisher
      Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
      Rights
      This article has been published in Proceeding of 16th International Conference on Machine Learning, Bled, Slovenia (pp. 115-123). ©1999 Morgan Kaufmann.
      Collections
      • Computing and Mathematical Sciences Papers [1455]
      Show full item record  

      Usage

      Downloads, last 12 months
      39
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement