Show simple item record  

dc.contributor.authorWang, Yong
dc.contributor.authorWitten, Ian H.
dc.date.accessioned2008-10-17T03:51:19Z
dc.date.available2008-10-17T03:51:19Z
dc.date.issued1999-11
dc.identifier.citationWang, Y. & Witten, H. (1999). Clustering with finite data from semi-parametric mixture distributions. (Working paper 99/14). Hamilton, New Zealand: University of Waikato, Department of Computer Science.en_US
dc.identifier.issn1170-487X
dc.identifier.urihttps://hdl.handle.net/10289/1043
dc.description.abstractExisting clustering methods for the semi-parametric mixture distribution perform well as the volume of data increases. However, they all suffer from a serious drawback in finite-data situations: small outlying groups of data points can be completely ignored in the clusters that are produced, no matter how far away they lie from the major clusters. This can result in unbounded loss if the loss function is sensitive to the distance between clusters. This paper proposes a new distance-based clustering method that overcomes the problem by avoiding global constraints. Experimental results illustrate its superiority to existing methods when small clusters are present in finite data sets; they also suggest that it is more accurate and stable than other methods even when there are no small clusters.en_US
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherDept. of Computer Science, University of Waikatoen_NZ
dc.relation.ispartofseriesComputer Science Working Papers
dc.subjectcomputer scienceen_US
dc.subjectMachine learning
dc.titleClustering with finite data from semi-parametric mixture distributionsen_US
dc.typeWorking Paperen_US
uow.relation.series99/14
pubs.elements-id55034
pubs.place-of-publicationHamiltonen_NZ


Files in this item

This item appears in the following Collection(s)

Show simple item record