Clustering mixed data

Abstract

Mixture model clustering proceeds by fitting a finite mixture of multivariate distributions to data, the fitted mixture density then being used to allocate the data to one of the components. Common model formulations assume that either all the attributes are continuous or all the attributes are categorical. In this paper, we consider options for model formulation in the more practical case of mixed data: multivariate data sets that contain both continuous and categorical attributes.

Citation

Hunt, L.A. & Jorgensen, M.A. (2011). Clustering mixed data. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 1(4), 352-361.

Series name

Date

Publisher

Wiley

Degree

Type of thesis

Supervisor