Syllable HMM based Mandarin TTS and comparison with concatenative TTS

Zhiwei Shuang; Shiyin Kang; Qin Shi; Yong Qin; Lianhong Cai

INTERSPEECH 2009

Conference paper

26 Nov 2009

Syllable HMM based Mandarin TTS and comparison with concatenative TTS

Abstract

This paper introduces a Syllable HMM based Mandarin TTS system. 10-state left-to-right HMMs are used to model each syllable. We leverage the corpus and the front end of a concatenative TTS system to build the Syllable HMM based TTS system. Furthermore, we utilize the unique consonant/vowel structure of Mandarin syllable to improve the voiced/unvoiced decision of HMM states. Evaluation results show that the Syllable HMM based Mandarin TTS system with a 5.3MB's model size can achieve an overall quality close to a concatenative TTS system with 1GB' data size. Copyright © 2009 ISCA.

Paper