By Topic

Automatic thresholding for voicing detection algorithms

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Neuburg, E. ; Department of Defense, Meade, Md

Automatic voicing-decision algorithms depend on thresholds which are dependent on speaker, channel, S/N ratio, etc. Low-frequency energy (LFE) is one of the best voicing statistics when properly thresholded; it is even better if two thresholds are set, one for onset of voicing and one for offset. Two schemes are proposed for adaptive, estimation of thresholds. The first is finding stretches that are "surely" voiced or unvoiced, finding boundaries by heuristic algorithms, and setting thresholds consistent with these boundaries, in the second, one finds segments that are "surely" voiced or unvoiced according to voicing statistics other than LFE, using these to form estimates of the distribution of LFE in voiced and unvoiced cases. Both schemes successfully determine speaker-dependent thresholds in about 15 seconds, during which "standard" thresholds can be used. Overall voicing error rate using LFE with adaptive thresholds is about 1%.

Published in:

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '79.  (Volume:4 )

Date of Conference:

Apr 1979