Retrieval from spoken documents using content and speaker information

Mahesh Viswanathan; Homayoon S.M. Beigi; Satya Dharanipragada; Alain Tritschler

doi:10.1109/ICDAR.1999.791851

ICDAR 1999

Conference paper

20 Sep 1999

Retrieval from spoken documents using content and speaker information

View publication

Abstract

There has been a recent upsurge in the deployment of emerging technologies such as speech and speaker recognition which are reaching maturity. We discuss the details of the components required to build a system for audio indexing and retrieval for spoken documents using content and speaker based information facilitated by speech and speaker recognition. The real power of spoken document analysis is in using both content and speaker information together in retrieval by combining the results. The experiments described here are in the broadcast news domain, but the underlying techniques can easily be extended to other speech-centric applications and transactions.

Conference paper