About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
ICSLP 2004
Conference paper
Discriminative training of compound-word based multinomial classifiers for speech routing
Abstract
We describe a method for utterance classification for speech routing based on a discriminative training multinomial topic classifier. We propose the utilization of the n-norm a posteriori topic probability of the speech hypothesis as the objective function of a discriminative training step, and explore various ways to compute and maximize this function with respect to model parameters. To avoid obtaining negative probability estimates, we propose an alternative representation of the model parameterization. We utilize our approach in combination with a simple non-discriminative word detection algorithm based on Mutual Information and with a technique to identify the most salient phrases of the domain in order to alleviate the feature sparsity problem. We also explore the post-processing of classification results to further improve the classification performance via a decision tree model and a neural network. Overall, our discriminative trained multinomial system reduces the classification error rate up to 45% in an NLU task of financial transactions comparative to our baseline non-discriminative multinomial system.