Carnegie Mellon University
Browse

Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text

Download (1 MB)
journal contribution
posted on 2011-04-01, 00:00 authored by Amr Ahmed, Qirong Ho, Choon Hui Teo, Jacob Eisenstein, Alex Smola, Eric P. Xing

We present the time-dependent topic-cluster model, a hierarchical approach for combining Latent Dirichlet Allocation and clustering via the Recurrent Chinese Restaurant Process. It inherits the advantages of both of its constituents, namely interpretability and concise representation. We show how it can be applied to streaming collections of objects such as real world feeds in a news portal. We provide details of a parallel Sequential Monte Carlo algorithm to perform inference in the resulting graphical model which scales to hundred of thousands of documents.

History

Publisher Statement

Copyright 2011 by the authors

Date

2011-04-01

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC