By Topic

Using neural network and principal component analysis to study vowel recognition with temporal envelope cues

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Kaibao Nie ; Dept. of Biomed. Eng., California Univ., Irvine, CA, USA ; Fan-Gang Zeng

Previous studies in normal-hearing and cochlear-implant subjects have shown high levels of speech recognition with primarily temporal envelope cues. The present study used principal component analysis (PCA) to extract important features in temporal envelopes and then constructed a 3-layer feedforward artificial neural network to study their role in vowel recognition. Twelve vowels by 30 speakers in a /hVd/ context served as the test material. Temporal envelopes from 1 to 8 spectral bands were extracted and subjected to PCA with 15 principle components. Similar to previous perceptual data, the present study showed that 63% correct vowel recognition was achieved with only 4-band envelope cues. The principle components responsible for this high level of vowel recognition included phonemic transition cues and steady-state amplitude cues. The present result can be applied to the development of novel algorithms to improve performance for automatic speech recognition and auditory prosthetic devices.

Published in:

Engineering in Medicine and Biology Society, 2004. IEMBS '04. 26th Annual International Conference of the IEEE  (Volume:2 )

Date of Conference:

1-5 Sept. 2004