Carnegie Mellon University
Browse

Parallel Markov Chain Monte Carlo for Pitman-Yor Mixture Models

Download (292.36 kB)
journal contribution
posted on 2014-07-01, 00:00 authored by Avinava Dubey, Sinead Williamson, Eric P Xing

The Pitman-Yor process provides an elegant way to cluster data that exhibit power law behavior, where the number of clusters is unknown or unbounded. Unfortunately, inference in PitmanYor process-based models is typically slow and does not scale well with dataset size. In this paper we present new auxiliary-variable representations for the Pitman-Yor process and a special case of the hierarchical Pitman-Yor process that allows us to develop parallel inference algorithms that distribute inference both on the data space and the model space. We show that our method scales well with increasing data while avoiding any degradation in estimate quality

History

Date

2014-07-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC