Loading [a11y]/accessibility-menu.js
Large-scale audio feature extraction and SVM for acoustic scene classification | IEEE Conference Publication | IEEE Xplore

Large-scale audio feature extraction and SVM for acoustic scene classification


Abstract:

This work describes a system for acoustic scene classification using large-scale audio feature extraction. It is our contribution to the Scene Classification track of the...Show More

Abstract:

This work describes a system for acoustic scene classification using large-scale audio feature extraction. It is our contribution to the Scene Classification track of the IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (D-CASE). The system classifies 30 second long recordings of 10 different acoustic scenes. From the highly variable recordings, a large number of spectral, cepstral, energy and voicing-related audio features are extracted. Using a sliding window approach, classification is performed on short windows. SVM are used to classify these short segments, and a majority voting scheme is employed to get a decision for longer recordings. On the official development set of the challenge, an accuracy of 73 % is achieved. SVM are compared with a nearest neighbour classifier and an approach called Latent Perceptual Indexing, whereby SVM achieve the best results. A feature analysis using the t-statistic shows that mainly Mel spectra are the most relevant features.
Date of Conference: 20-23 October 2013
Date Added to IEEE Xplore: 09 January 2014
Electronic ISBN:978-1-4799-0972-8

ISSN Information:

Conference Location: New Paltz, NY, USA

Contact IEEE to Subscribe

References

References is not available for this document.