A study of bootstrapping with multiple acoustic features for improved automatic speech recognition

Xiaodong Cui; Jian Xue; Bing Xiang; Bowen Zhou

INTERSPEECH 2009

Conference paper

26 Nov 2009

A study of bootstrapping with multiple acoustic features for improved automatic speech recognition

Abstract

This paper investigates a scheme of bootstrapping with multiple acoustic features (MFCC, PLP and LPCC) to improve the overall performance of automatic speech recognition. In this scheme, a Gaussian mixture distribution is estimated for each type of feature resampled in each HMM state by single-pass retraining on a shared decision tree. Thus obtained acoustic models based on the multiple features are combined by likelihood averaging during decoding. Experiments on large vocabulary spontaneous speech recognition show its superior overall performance than the best of acoustic models from individual features. It also achieves comparable performance to Recognizer Output Voting Error Reduction (ROVER) with computational advantages. Copyright © 2009 ISCA.

Paper