Cart (Loading....) | Create Account
Close category search window

A parametric representation and a clustering method for phoneme recognition--Application to stops in a CV environment

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Tanaka, K. ; Ministry of International Trade and Industry, Ibaraki, Japan.

A new method of representing phonemic categories and determining their standard values from a training sample distribution is presented. It is an essential part of a phoneme recognition system aiming at speaker-independent speech recognition. The phonemic value of a short-duration speech signal of up to 50 ms is represented by a matrix composed of acoustic parameters. Standard phonemic categories (SPC's) are defined by a combination of several simple potential functions in this matrix space. The potential function set, as well as its number, is determined automatically by the proposed method. Processing is primarily by algebraic operation and is formulated according to an analogy to particle dynamics. The method is applied to voiceless and voiced stop consonant sets spoken by twelve speakers. The relationship between the classification rate and the number of SPC's is investigated under several initial conditions. Stop consonant recognition tests in CV-syllables are made using derived SPC sets irrespective of following vowels. Recognition rates for the utterances of four speakers not included among the twelve speakers used for training were 84 percent for voiceless and 81 percent for voiced stops.

Published in:

Acoustics, Speech and Signal Processing, IEEE Transactions on  (Volume:29 ,  Issue: 6 )

Date of Publication:

Dec 1981

Need Help?

IEEE Advancing Technology for Humanity About IEEE Xplore | Contact | Help | Terms of Use | Nondiscrimination Policy | Site Map | Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest professional association for the advancement of technology.
© Copyright 2014 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.