Publication
ICASSP 2002
Conference paper

Semantic-audio retrieval

View publication

Abstract

This paper describes a system for connecting sounds and words in linked multi-dimensional vector spaces. The acoustic space is represented using anchor models and partitioned using agglomerative clustering. The semantic space is modeled by a hierarchical multinomial clustering model. Nodes in one space are linked by probabilistic models to the other space. With these linked models, users retrieve sounds with natural language, and the system describes new sounds with words.

Date

Publication

ICASSP 2002

Authors

Share