By Topic

Speech/music discrimination by detection: Assessment of time series events using ROC graphs

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Alnadabi, M. ; Sultan Qaboos Univ. ; Johnstone, S.

This paper suggests the application of the Receiver Operating Characteristics (ROC) graph to assess the performance of any speech/music discrimination method. ROC graphs are applied in the field of speech/music discrimination to assess the Time Series Events (TSE) method. The discrimination problem is viewed as two detection problems: detection of speech and detection of music. It was found that the optimal feature for detecting speech was silence with a true positive rate of 0.9 and false positive rate of 0.14, whilst the optimal feature for music was non-zero crossing rate NZCR with a true positive rate of 0.71 and false positive rate of 0.08.

Published in:

Systems, Signals and Devices, 2009. SSD '09. 6th International Multi-Conference on

Date of Conference:

23-26 March 2009