Abstract

Teaching machines to read natural language documents remains an elusive challenge. Machine reading systems can be tested on their ability to answer questions posed on the contents of documents that they have seen, but until now large scale training and test datasets have been missing for this type of evaluation. In this work we define a new methodology that resolves this bottleneck and provides large scale supervised reading comprehension data. This allows us to develop a class of attention based deep neural networks that learn to read real documents and answer complex questions with minimal prior knowledge of language structure.

Keywords

Computer scienceBottleneckReading (process)Reading comprehensionArtificial intelligenceClass (philosophy)Natural language processingScale (ratio)Test (biology)Question answeringComprehensionArtificial neural networkMachine learningLinguisticsProgramming language

Affiliated Institutions

Related Publications

Publication Info

Year
2015
Type
preprint
Citations
1527
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1527
OpenAlex

Cite This

Karl Moritz Hermann, Tomáš Kočiský, Edward Grefenstette et al. (2015). Teaching Machines to Read and Comprehend. arXiv (Cornell University) . https://doi.org/10.48550/arxiv.1506.03340

Identifiers

DOI
10.48550/arxiv.1506.03340

Data Quality

Data completeness: 77%