By Topic

Speaker identification with whispered speech using unvoiced-consonant phonemes

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Juan Xu ; School of Electronics and Information Engineering, Soochow University, Suzhou, China ; Heming Zhao

A whisper is a speech production mode used by us to protect our privacy. Due to the differences between whispered and neutral speech, in both excitation and vocal tract function, the performance of speaker identification systems trained with neutral speech degrades significantly. This paper describes a neutral/whisper mismatched closed-set speaker identification system. The acoustic characteristics of vowels and voiced consonants are different between whispered and neutral speech. The acoustic characteristics of unvoiced consonants are relatively similar between whispered and neutral speech. In order to improve system performance, a feature extraction algorithm based on linear frequency scale is applied in this paper. The static linear frequency cepstral coefficient vectors are extracted as features from neutral and whispered unvoiced consonants. The closed-set speaker ID system using unvoiced consonants based on linear frequency cepstral coefficients achieves an absolute improvement for speaker recognition.

Published in:

2012 International Conference on Image Analysis and Signal Processing

Date of Conference:

9-11 Nov. 2012