By Topic

Audio Clips Retrieval Using Anchor Reference Space and Latent Semantic Analysis

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Konstantin Biatov ; Fraunhofer IAIS, St. Augustin, Germany

This paper describes a technique for audio clips retrieval. The audio clips are modeled using a common universal codebook. The codebook is based on a bag-of-features (BOF). The features extracted from all clips are grouped into clusters using the k-means algorithm. The individual audio clips are modeled by the normalized distribution of the numbers of cluster bins. The latent semantic indexing (LSI) is applied to the feature-audio clip matrix to represent the data in latent semantic space. Then the primary audio clip description is converted to the vector in anchor reference space. Each component of the anchor vector is a probabilistic similarity between this clip and the clip corresponding to the considered component. Then LSI is applied to new feature-audio clip matrix, mapping the data to the latent semantic space based on anchor representation. For audio retrieval the nearest-neighbor (NN) algorithm is exploited. The described algorithm demonstrates high retrieval performance.

Published in:

Multimedia, 2009. ISM '09. 11th IEEE International Symposium on

Date of Conference:

14-16 Dec. 2009