About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
ICASSP 1993
Conference paper
Supervised approach to the construction of context-sensitive acoustic prototypes
Abstract
The performance of a large vocabulary speech recognition system is critically tied to the quality of the acoustic prototypes that are established in the relevant feature space(s). This is especially true when only a limited amount of training data is available to extract information about pronunciation variability. To better account for co-articulation effects, we describe a supervised strategy for the construction of context-sensitive acoustic prototypes. The idea is to incorporate contextual supervision to relate the allophonic models to their acoustic manifestations. This makes for a better utilization of the available training data, while at the same time allowing for a short design time turn around. The performance of this method is illustrated on an isolated utterance speech recognition task with a vocabulary of 20,000 words.