About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
INTERSPEECH - Eurospeech 2003
Conference paper
Automatic baseform generation from acoustic data
Abstract
We describe two algorithms for generating pronunciation networks from acoustic data. One is based on raw phonetic recognition and the other uses the spelling of the words and the identification of their language of origin as guides. In both cases, a pruning and voting procedure distills the noisy phonetic sequences into pronunciation networks. Recognition experiments on two large, grammar-based, test sets show a reduction of sentence error rates between 2% and 14%, and of word error rate between 3% to 23% when the learned baseforms are added to our baseline lexicons.