An architecture for parallel topic models

Abstract

This paper describes a high performance sampling architecture for inference of latent topic models on a cluster of workstations. Our system is faster than previous work by over an order of magnitude and it is capable of dealing with hundreds of millions of documents and thousands of topics. The algorithm relies on a novel communication structure, namely the use of a distributed (key, value) storage for synchronizing the sampler state between computers. Our architecture entirely obviates the need for separate computation and synchronization phases. Instead, disk, CPU, and network are used simultaneously to achieve high performance. We show that this architecture is entirely general and that it can be extended easily to more sophisticated latent variable models such as n-grams and hierarchies.

Keywords

Computer scienceSynchronizingArchitectureComputationSynchronization (alternating current)Distributed computingInferenceWorkstationKey (lock)Parallel computingTheoretical computer scienceArtificial intelligenceAlgorithmOperating systemComputer network

Affiliated Institutions

Related Publications

Semidirect MP2 gradient evaluation on workstation computers: The MPGRAD program

Frank Haase , Reinhart Ahlrichs

Abstract A semidirect implementation of the closed‐shell MP2 gradient for efficient use on workstation computers is presented. The approach is based on the algorithm proposed by...

1993 Journal of Computational Chemistry 179 citations

Massively Parallel Methods for Deep Reinforcement Learning

Arun Sukumaran Nair , P. Srinivasan , Sam Blackwell +11 more

We present the first massively distributed architecture for deep reinforcement learning. This architecture uses four main components: parallel actors that generate new behaviour...

2015 arXiv (Cornell University) 405 citations

GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit

Sander Pronk , Szilárd Páll , Roland Schulz +9 more

Abstract Motivation: Molecular simulation has historically been a low-throughput technique, but faster computers and increasing amounts of genomic and structural data are changi...

2013 Bioinformatics 7280 citations

A distributed alternative to finite-state-machine specifications

Pamela Zave

A specification technique, formally equivalent to finite-state machines, is offered as an alternative because it is inherently distributed and more comprehensible. When applied ...

1985 ACM Transactions on Programming Langu... 50 citations

Incremental, iterative data processing with timely dataflow

Derek G. Murray , Frank McSherry , Michael Isard +3 more

We describe the timely dataflow model for distributed computation and its implementation in the Naiad system. The model supports stateful iterative and incremental computations....

2016 Communications of the ACM 36 citations

Publication Info

Year: 2010
Type: article
Volume: 3
Issue: 1-2
Pages: 703-710
Citations: 424
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

An architecture for parallel topic models

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

424

OpenAlex

Cite This

APA Style

                            
                                    Alexander J. Smola, 
                                
                                    Shravan Narayanamurthy
                                
                            (2010). 
                            An architecture for parallel topic models. 
                            Proceedings of the VLDB Endowment
                            , 3
                            (1-2)
                            , 703-710.
                            https://doi.org/10.14778/1920841.1920931

Identifiers

DOI: 10.14778/1920841.1920931