Acoustics-only based automatic phonetic baseform generation

B. Ramabhadran; L.R. Bahl; P.V. Desouza; M. Padmanabhan

doi:10.1109/ICASSP.1998.674429

ICASSP 1998

Conference paper

01 Dec 1998

Acoustics-only based automatic phonetic baseform generation

View publication

Abstract

Phonetic baseforms are the basic recognition units in most speech recognition systems. These baseforms are usually determined by linguists once a vocabulary is chosen and not modified thereafter. However, several applications, such as name dialing, require the user be able to add new words to the vocabulary. These new words are often names, or task-specific jargon, that have user-specific pronunciations. This paper describes a novel method for generating phonetic transcriptions (baseforms) of words based on acoustic evidence alone. It does not require either the spelling or any prior acoustic representation of the new word, is vocabulary independent, and does not have any linguistic constraints (pronunciation rules). Our experiments demonstrate the high decoding accuracies obtained when baseforms deduced using this approach are incorporated into our speech recognizer. Also, the error rates on the added words were found to be comparable to or better than when the baseforms were derived by hand. © 1998 IEEE.

Conference paper