By Topic

A Two-level Method for Unsupervised Speaker-based Audio Segmentation

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Shilei Zhang ; Chinese Academy of Sciences, Beijing, China ; Shuwu Zhang ; Bo Xu

In this paper, we propose a two-level segmentation method that detects speaker changes in a continuous audio stream effectively. In our approach, we divide the change detection process into two levels: region level that detects the potential change regions containing candidate speaker change points, and boundary level that searches and refines the true change points. At the region level, we employ the modified generalized likelihood ratio (MGLR) metric to search for the potential change regions in continuous local windows. At the boundary level, we perform T2 and Bayesian information criterion (BIC) algorithm to detect segment boundaries within the potential windows. The experimental results on the 1997 Broadcast News Hub4-NE mandarin corpus show the efficiency of the proposed scheme

Published in:

18th International Conference on Pattern Recognition (ICPR'06)  (Volume:4 )

Date of Conference:

0-0 0