Ruslan Salakhutdinov

Publications

5 shown

Transformer-XL: Attentive Language Models beyond a Fixed-Length Context

Zihang Dai , Zhilin Yang , Yiming Yang +3 more

Transformers have a potential of learning longer-term dependency, but are limited by a fixed-length context in the setting of language modeling. We propose a novel neural archit...

2019 3018 citations

Improving neural networks by preventing co-adaptation of feature detectors

Geoffrey E. Hinton , Nitish Srivastava , Alex Krizhevsky +2 more

When a large feedforward neural network is trained on a small training set, it typically performs poorly on held-out test data. This "overfitting" is greatly reduced by randomly...

2012 arXiv (Cornell University) 6630 citations

Skip-Thought Vectors

Ryan Kiros , Yukun Zhu , Ruslan Salakhutdinov +4 more

We describe an approach for unsupervised learning of a generic, distributed sentence encoder. Using the continuity of text from books, we train an encoder-decoder model that tri...

2015 arXiv (Cornell University) 723 citations

Reducing the Dimensionality of Data with Neural Networks

Geoffrey E. Hinton , Ruslan Salakhutdinov

High-dimensional data can be converted to low-dimensional codes by training a multilayer neural network with a small central layer to reconstruct high-dimensional input vectors....

2006 Science 20153 citations

Show, Attend and Tell: Neural Image Caption Generation with Visual\n Attention

Kelvin Xu , Jimmy Ba , Ryan Kiros +5 more

Inspired by recent work in machine translation and object detection, we\nintroduce an attention based model that automatically learns to describe the\ncontent of images. We desc...

2015 arXiv (Cornell University) 1750 citations

Researcher Info

h-index: 5
Publications: 5
Citations: 32,274
Institution: University of Toronto

External Links

ORCID Profile Google Scholar

Identifiers

ORCID: 0000-0002-3752-2756

Impact Metrics

h-index 5

h-index: Number of publications with at least h citations each.

Publications

Transformer-XL: Attentive Language Models beyond a Fixed-Length Context

Improving neural networks by preventing co-adaptation of feature detectors

Skip-Thought Vectors

Reducing the Dimensionality of Data with Neural Networks

Show, Attend and Tell: Neural Image Caption Generation with Visual\n Attention

Frequent Co-Authors

Zihang Dai

Zhilin Yang

Yiming Yang

Jaime Carbonell

Quoc V. Le

Geoffrey E. Hinton

Researcher Info

External Links

Identifiers

Impact Metrics