Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 2002 Working Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 2002 Working Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Benchmarking attribute selection techniques for discrete class data mining

      Hall, Mark A.; Holmes, Geoffrey
      Thumbnail
      Files
      uow-cs-wp-2002-02.pdf
      1.024Mb
      Find in your library  
      Citation
      Export citation
      Holmes, G., Pfahringer, B., Kirkby, R., Frank, E. & Hall, M. (2002). Benchmarking attribute selection techniques for discrete class data mining. (Working paper 02/02). Hamilton, New Zealand: University of Waikato, Department of Computer Science.
      Permanent Research Commons link: https://hdl.handle.net/10289/1013
      Abstract
      Data engineering is generally considered to be a central issue in the development of data mining applications. The success of many learning schemes, in their attempts to construct models of data, hinges on the reliable identification of a small set of highly predictive attributes. The inclusion of irrelevant, redundant and noisy attributes in the model building process phase can result in poor predictive performance and increased computation.

      Attribute selection generally involves a combination of search and attribute utility estimation plus evaluation with respect to specific learning schemes. This leads to a large number of possible permutation and has led to a situation where very few benchmark studies have been conducted.

      This paper presents a benchmark comparison of several attribute selection methods for supervised classification. All the methods produce an attribute ranking, a useful devise for isolating the individual merit of an attribute. Attribute selection is achieved by cross-validating the attribute rankings with respect to a classification learner to find the best attributes. Results are reported for a selection of standard data sets and two diverse learning schemes C4.5 and naïve Bayes.
      Date
      2002-04
      Type
      Working Paper
      Series
      Computer Science Working Papers
      Report No.
      02/02
      Publisher
      University of Waikato, Department of Computer Science
      Collections
      • 2002 Working Papers [12]
      Show full item record  

      Usage

      Downloads, last 12 months
      355
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement