Using audio time scale modification for video browsing
Abstract
In the IBM CueVideo project we study various aspects of fully automated video indexing, browsing and retrieval. The engineering aspects include audio processing, speech recognition, image processing and information retrieval. Equally important, however, is exploring user expectations and conducting user studies. We focus on the field of video for Training and Education, including Distributed Learning, Remote Education, and Just-in-Time Learning. This paper describes the use of audio processing technology, namely audio Time Scale Modification (TSM), for the novel application of fast video browsing and efficient Video-based learning. The paper provides a brief overview of the CueVideo system, technical background of TSM technology, and the way it is being used in our system. We have conducted a usability study on the effect of TSM on speech comprehension. The results will be included in the final version.