By Topic

A Combination of Data Mining Method with Context-Based State Transfer for Speech/Music Discrimination

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Qin Yan ; Sch. of Inf. & Eng., Hohai Univ., Nanjing, China ; Qiong Wu ; Haojiang Deng ; Jinlin Wang

In our previous work, a speech/music classifier is proposed on the basis of the feature subset selection (FSS) tool and oblique decision tree induced by the algorithm OC1. In this paper, we endeavor to improve it by state transfer (ST) strategy whose aim is to refine the classification results, according to the fact that adjacent segments in one audio file have strong relevance to each other. The proposed algorithm is evaluated by a set of 5-to-11-minute 504 audio files of different types of speech and music in three signal-to-noise ratio (SNR) levels: 30 dB, 20 dB and 10 dB. The results show that ST strategy averagely improves the accuracy for music by 3.3% at 10 dB and 2.3% at 20 dB while keeping accuracy rate of speech almost unchanged. The speech classification rate is also lifted by 5.7% at 10 dB on average.

Published in:

2009 5th International Conference on Wireless Communications, Networking and Mobile Computing

Date of Conference:

24-26 Sept. 2009