Semantic-audio retrieval

Malcolm Slaney

doi:10.1109/icassp.2002.5745561

ICASSP 2002

Conference paper

13 May 2002

Semantic-audio retrieval

View publication

Abstract

This paper describes a system for connecting sounds and words in linked multi-dimensional vector spaces. The acoustic space is represented using anchor models and partitioned using agglomerative clustering. The semantic space is modeled by a hierarchical multinomial clustering model. Nodes in one space are linked by probabilistic models to the other space. With these linked models, users retrieve sounds with natural language, and the system describes new sounds with words.

Conference paper