By Topic

Speech segmentation using multifractal measures and amplification of signal features

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Kinsner, Witold ; Dept. of Electr. & Comput. Eng., Univ. of Manitoba, Winnipeg, MB ; Grieder, W.

This paper describes a fast multiscale time-domain technique for the analysis of natural speech waveforms in the presence of noise. The technique is based on the variance fractal dimension trajectory algorithm that is used not only to detect the external boundaries of an utterance, but also its internal pauses representing the unvoiced speech. The algorithm can also identify internal features of phonemes. The features can be amplified so that the speech utterances can be segmented into sentences, words and phonemes. This approach is superior to other energy-based boundary-detection techniques. These observations are based on extensive experimental results on speech utterances digitized at 44.1 kilosamples per second, with 16 bits in each sample.

Published in:

Cognitive Informatics, 2008. ICCI 2008. 7th IEEE International Conference on

Date of Conference:

14-16 Aug. 2008