Fast neural network language model lookups at N-Gram speeds

Yinghui Huang; Abhinav Sethy; Bhuvana Ramabhadran

doi:10.21437/Interspeech.2017-564

INTERSPEECH 2017

Conference paper

20 Aug 2017

Fast neural network language model lookups at N-Gram speeds

View publication

Abstract

Feed forward Neural Network Language Models (NNLM) have shown consistent gains over backoff word n-gram models in a variety of tasks. However, backoff n-gram models still remain dominant in applications with real time decoding requirements as word probabilities can be computed orders of magnitude faster than the NNLM. In this paper, we present a combination of techniques that allows us to speed up the probability computation from a neural net language model to make it comparable to the word n-gram model without any approximations. We present results on state of the art systems for Broadcast news transcription and conversational speech which demonstrate the speed improvements in real time factor and probability computation while retaining the WER gains from NNLM.

Conference paper