About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
ICSLP 2002
Conference paper
Generating script using statistical information of the context variation unit vector
Abstract
A statistical selection method is proposed for generating an optimized recording script for Concatenative Speech Synthesizer. This method starts with traveling a large text corpus to collect the statistical information of the Context Variation Unit Vectors (CVUV), which represent the multi-dimension phonetic contexts and properties of the synthesis unit. Each CVUV descriptor is organized as a node in a sorted tree of the CVUV forest to record the dimension values and the index to its position in the corpus. Then it selects sentences according to the pre-defined criteria relating to the CVUV distribution in the corpus. This selection algorithm has been implemented to generate syllable-based Chinese script and yielded satisfactory results. The context dimension definition concept is described in this paper, and the coverage analysis and computing time estimation are reported also.