Text understanding with the attention sum reader network

Rudolf Kadlec; Martin Schmid; Ondrej Bajgar; Jan Kleindienst

doi:10.18653/v1/p16-1086

ACL 2016

Conference paper

07 Aug 2016

Text understanding with the attention sum reader network

View publication

Abstract

Several large cloze-style context-questionanswer datasets have been introduced recently: the CNN and Daily Mail news data and the Children's Book Test. Thanks to the size of these datasets, the associated text comprehension task is well suited for deep-learning techniques that currently seem to outperform all alternative approaches. We present a new, simple model that uses attention to directly pick the answer from the context as opposed to computing the answer using a blended representation of words in the document as is usual in similar models. This makes the model particularly suitable for questionanswering problems where the answer is a single word from the document. Ensemble of our models sets new state of the art on all evaluated datasets.

Paper