Efficient Estimation of Word Representations in Vector Space

Abstract

We propose two novel model architectures for computing continuous vector representations of words from very large data sets. The quality of these representations is measured in a word similarity task, and the results are compared to the previ-ously best performing techniques based on different types of neural networks. We observe large improvements in accuracy at much lower computational cost, i.e. it takes less than a day to learn high quality word vectors from a 1.6 billion words data set. Furthermore, we show that these vectors provide state-of-the-art perfor-mance on our test set for measuring syntactic and semantic word similarities.

Keywords

Word (group theory)Computer scienceSimilarity (geometry)Set (abstract data type)Artificial intelligenceVector spaceNatural language processingTask (project management)Test setSemantic similaritySpace (punctuation)Vector space modelArtificial neural networkMathematics

Affiliated Institutions

Related Publications

Glove: Global Vectors for Word Representation

Jeffrey Pennington , Richard Socher , Christopher D. Manning

Recent methods for learning vector space representations of words have succeeded in capturing fine-grained semantic and syntactic regularities using vector arithmetic, but the o...

2014 32840 citations

Finding Structure in Time

Jeffrey L. Elman

Time underlies many interesting human behaviors. Thus, the question of how to represent time in connectionist models is very important. One approach is to represent time implici...

1990 Cognitive Science 10427 citations

Parallel networks that learn to pronounce English text

Terrence J. Sejnowski

This paper describes NETtalk, a class of massively-parallel network systems that learn to convert English text to speech. The memory representations for pronunciations are learn...

1987 1556 citations

Learning the hidden structure of speech

Jeffrey L. Elman , David Zipser

In the work described here, the backpropagation neural network learning procedure is applied to the analysis and recognition of speech. This procedure takes a set of input/outpu...

1988 The Journal of the Acoustical Society... 269 citations

Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR

Tara N. Sainath , Bhuvana Ramabhadran , Michael Picheny +2 more

The use of exemplar-based methods, such as support vector machines (SVMs), k-nearest neighbors (kNNs) and sparse representations (SRs), in speech recognition has thus far been l...

2011 IEEE Transactions on Audio Speech and... 65 citations

Publication Info

Year: 2013
Type: preprint
Citations: 11710
Access: Closed

External Links

Citation Metrics

11710

OpenAlex

Cite This

APA Style

                            
                                    Tomáš Mikolov, 
                                
                                    Kai Chen, 
                                
                                    Greg S. Corrado
                                
                                et al.
                            
                            (2013). 
                            Efficient Estimation of Word Representations in Vector Space. 
                            arXiv (Cornell University)
                            
                            .