Clustering with finite data from semi-parametric mixture distributions

dc.contributor.authorWang, Yong
dc.contributor.authorWitten, Ian H.
dc.date.accessioned2008-10-17T03:51:19Z
dc.date.available2008-10-17T03:51:19Z
dc.date.issued1999-11
dc.description.abstractExisting clustering methods for the semi-parametric mixture distribution perform well as the volume of data increases. However, they all suffer from a serious drawback in finite-data situations: small outlying groups of data points can be completely ignored in the clusters that are produced, no matter how far away they lie from the major clusters. This can result in unbounded loss if the loss function is sensitive to the distance between clusters. This paper proposes a new distance-based clustering method that overcomes the problem by avoiding global constraints. Experimental results illustrate its superiority to existing methods when small clusters are present in finite data sets; they also suggest that it is more accurate and stable than other methods even when there are no small clusters.en_US
dc.format.mimetypeapplication/pdf
dc.identifier.citationWang, Y. & Witten, H. (1999). Clustering with finite data from semi-parametric mixture distributions. (Working paper 99/14). Hamilton, New Zealand: University of Waikato, Department of Computer Science.en_US
dc.identifier.issn1170-487X
dc.identifier.urihttps://hdl.handle.net/10289/1043
dc.language.isoen
dc.publisherDept. of Computer Science, University of Waikatoen_NZ
dc.relation.ispartofseriesComputer Science Working Papers
dc.subjectcomputer scienceen_US
dc.subjectMachine learning
dc.titleClustering with finite data from semi-parametric mixture distributionsen_US
dc.typeWorking Paperen_US
pubs.elements-id55034
pubs.place-of-publicationHamiltonen_NZ
uow.relation.series99/14
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
uow-cs-wp-1999-14.pdf
Size:
569.21 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.8 KB
Format:
Item-specific license agreed upon to submission
Description: