Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 1999 Working Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computer Science Working Paper Series
      • 1999 Working Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Clustering with finite data from semi-parametric mixture distributions

      Wang, Yong; Witten, Ian H.
      Thumbnail
      Files
      uow-cs-wp-1999-14.pdf
      569.2Kb
      Find in your library  
      Citation
      Export citation
      Wang, Y. & Witten, H. (1999). Clustering with finite data from semi-parametric mixture distributions. (Working paper 99/14). Hamilton, New Zealand: University of Waikato, Department of Computer Science.
      Permanent Research Commons link: https://hdl.handle.net/10289/1043
      Abstract
      Existing clustering methods for the semi-parametric mixture distribution perform well as the volume of data increases. However, they all suffer from a serious drawback in finite-data situations: small outlying groups of data points can be completely ignored in the clusters that are produced, no matter how far away they lie from the major clusters. This can result in unbounded loss if the loss function is sensitive to the distance between clusters.

      This paper proposes a new distance-based clustering method that overcomes the problem by avoiding global constraints. Experimental results illustrate its superiority to existing methods when small clusters are present in finite data sets; they also suggest that it is more accurate and stable than other methods even when there are no small clusters.
      Date
      1999-11
      Type
      Working Paper
      Series
      Computer Science Working Papers
      Report No.
      99/14
      Publisher
      Dept. of Computer Science, University of Waikato
      Collections
      • 1999 Working Papers [16]
      Show full item record  

      Usage

      Downloads, last 12 months
      74
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement