About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
ICSLP 2004
Conference paper
Task adaptation of acoustic and language models based on large quantities of data
Abstract
We investigate use of large amounts, over 1500 hours, of untran-scribed data recorded from a deployed conversational system to improve the acoustic and language models. The system that we considered allows users to perform transactions on their retirement accounts. Using all the untranscribed data we get over 19% relative improvement in word error rate over a baseline system. In contrast, a system built using 70 hours of transcribed data results in over 31% relative improvement.