About cookies on this site Our websites require some cookies to function properly (required). In addition, other cookies may be used with your consent to analyze site usage, improve the user experience and for advertising. For more information, please review your options. By visiting our website, you agree to our processing of information as described in IBM’sprivacy statement. To provide a smooth navigation, your cookie preferences will be shared across the IBM web domains listed here.
Publication
ICME 2002
Conference paper
Mixtures of probability experts for audio retrieval and indexing
Abstract
This paper describes a system for connecting nonspeech sounds and words using linked multidimensional vector spaces. An approach based on a mixture of experts learns the mapping between one space and the other. This paper describes the conversion of audio and semantic data into their respective vector spaces. Two different mixture-of-probability-expert models are trained to learn the association between acoustic queries and the corresponding semantic explanation, and vice versa. Test results are presented based on commercial sound effects CDs.