A fast, accurate approximation to log likelihood of Gaussian mixture models

Pierre L. Dognin; Vaibhava Goel; John R. Hershey; Peder A. Olsen

doi:10.1109/ICASSP.2009.4960459

ICASSP 2009

Conference paper

23 Sep 2009

A fast, accurate approximation to log likelihood of Gaussian mixture models

View publication

Abstract

It has been a common practice in speech recognition and elsewhere to approximate the log likelihood of a Gaussian mixture model (GMM) with the maximum component log likelihood. While often a computational necessity, the max approximation comes at a price of inferior modeling when the Gaussian components significantly overlap. This paper shows how the approximation error can be reduced by changing component priors. In our experiments the loss in word error rate due to max approximation, albeit small, is reduced by 50-100% at no cost in computational efficiency. Furthermore, we expect acoustic models will become larger with time and increase component overlap and word error rate loss. This makes reducing the approximation error more relevant. The techniques considered do not use the original data and can easily be applied as a post-processing step to any GMM. ©2009 IEEE.

Conference paper