About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
Computer Journal
Paper
Speech recognition in the office: how the technology supports dictation
Abstract
Automatic speech recognition by computer, in real-time and using only an input audio signal, has been developed over the last two decades to the point where it is useful for a number of practical applications, such as telephone enquiry services and office dictation. The purpose of talking to a computer is first examined and the inherent reasons for the difficulty of understanding speech are suggested. Three specific problems in the recognition process are highlighted: data rate reduction, speaker variability and application domain dependence. The currently successful methodology used in speech recognition systems is described in non-technical terms. The principal processes required for real-time automatic speech recognition are introduced, and the need for adaption and customization discussed. Some suggestions for the directions of future research and applications are given.