Carnegie Mellon University
file.pdf (439.48 kB)

Von Mises-Fisher Clustering Models

Download (439.48 kB)
journal contribution
posted on 2014-05-01, 00:00 authored by Siddarth Gopal, Yiming Yang

This paper proposes a suite of models for clustering high-dimensional data on a unit sphere based on von Mises-Fisher (vMF) distribution and for discovering more intuitive clusters than existing approaches. The proposed models include a) A Bayesian formulation of vMF mixture that enables information sharing among clusters, b) a Hierarchical vMF mixture that provides multiscale shrinkage and tree structured view of the data and c) a Temporal vMF mixture that captures evolution of clusters in temporal data. For posterior inference, we develop fast variational methods as well as collapsed Gibbs sampling techniques for all three models. Our experiments on six datasets provide strong empirical support in favour of vMF based clustering models over other popular tools such as K-means, Multinomial Mixtures and Latent Dirichlet Allocation.


Publisher Statement

Copyright 2014 by the author(s).



Usage metrics