A Scalable Asynchronous Distributed Algorithm for Topic Modeling

16 Dec 2014Hsiang-Fu YuCho-Jui HsiehHyokun YunS. V. N VishwanathanInderjit S. Dhillon

Learning meaningful topic models with massive document collections which contain millions of documents and billions of tokens is challenging because of two reasons: First, one needs to deal with a large number of topics (typically in the order of thousands). Second, one needs a scalable and efficient way of distributing the computation across multiple machines... (read more)

PDF Abstract

Evaluation Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.