Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Comparing classical criteria for selecting intra-class correlated features in Multimix

      Hunt, Lynette Anne; Basford, Kaye E.
      Thumbnail
      Files
      J-Computational Statistics and Analysis.pdf
      Published version, 548.0Kb
      DOI
       10.1016/j.csda.2016.05.018
      Find in your library  
      Citation
      Export citation
      Hunt, L. A., & Basford, K. E. (2016). Comparing classical criteria for selecting intra-class correlated features in Multimix. Computational Statistics & Data Analysis, 103, 350–366. https://doi.org/10.1016/j.csda.2016.05.018
      Permanent Research Commons link: https://hdl.handle.net/10289/12951
      Abstract
      The mixture approach to clustering requires the user to specify both the number of components to be fitted to the model and the form of the component distributions. In the Multimix class of models, the user also has to decide on the correlation structure to be introduced into the model. The behaviour of some commonly used model selection criteria is investigated when using the finite mixture model to cluster data containing mixed categorical and continuous attributes. The performance of these criteria in selecting both the number of components in the model and the form of the correlation structure amongst the attributes when fitting the Multimix class of models is illustrated using simulated data and a real medical data set. It is found that criteria based on the integrated classification likelihood have the best performance in detecting the number of clusters to be fitted to the model and in selecting the form of the component distributions. The performance of the Bayesian information criterion in detecting the correct model depends on the partitioning structure among the attributes while the Akaike information criterion and classification likelihood criterion perform in a less satisfactory way.
      Date
      2016
      Type
      Journal Article
      Publisher
      Elsevier
      Rights
      This is an author’s accepted version of an article published in the journal: Computational Statistics & Data Analysis. © 2016 Elsevier.
      Collections
      • Computing and Mathematical Sciences Papers [1385]
      Show full item record  

      Usage

      Downloads, last 12 months
      109
       
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement