By Topic

Isolated word recognition using high-order statistics and time-delay neural networks

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Ashouri, M.R. ; Dept. of Electr. & Comput. Eng., Isfahan Univ. of Technol., Iran

In this paper, two isolated word recognition methods based on high-order statistics and a time-delay neural network (TDNN) for recognition of Farsi spoken digits have been studied. The adopted speech recognition system consists of four modules, namely, a preprocessor, endpoints' detector, feature extractor and classifier. The first method estimates the AR parameters of speech based on the third- and fourth-order cumulants using high-order Yule-Walker, W-slice and 1-D slice approaches. In the second, method, statistical features are extracted from the estimated high-order probability density function (pdf) of thresholded amplitude features. For each pdf estimate, the values of mean, variance, third order moment and entropy are computed. The total number of features for each frame of approximate length of 15 ms is 16. The adopted TDNN has 16 nodes in its input layer, 10 nodes in its output layer and two hidden layers. The learning rule of the adopted TDNN that is based on the backpropagation rule has been modified to decrease the training time. Computer simulation results obtained from recognizing 10 Farsi digits spoken by different speakers shows that the first method has a better recognition rate while the second method necessitates less computation

Published in:

Higher-Order Statistics, 1997., Proceedings of the IEEE Signal Processing Workshop on

Date of Conference:

21-23 Jul 1997