About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
Qinghua Daxue Xuebao/Journal of Tsinghua University
Paper
IBM GALE Mandarin transcription system
Abstract
An automatic transcription of Mandarin broadcast speech system was developed at IBM under the DARPA GALE program. In particular, this system applies a discriminative acoustic model training method and a new topic-adaptive language modeling technique to achieve the best recognition performance using multiple pass decoding. Results are given for three Gale test sets designed to cover both the broadcast news and the broadcast conversation domains. The transcription system achieves satisfactory performance on these datasets. The recognition errors are highly dependent on the speaking style, speech overlap and accent, which helps steer future research.