By Topic

Thai speech recognition using Double filter banks for basic voice commanding

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Phokharatkul, P. ; Dept. of Comput. Eng., Mahidol Univ., Nakhon Pathom, Thailand ; Nantanitikorn, K. ; Phaiboon, S.

This paper describes the methodology to recognize Thai speech words by integrating two approaches e.g., Double filter banks and Euclidian distance in a feature extraction and the recognition processes, respectively. Firstly, the speech signals are transformed into the 3-dimension of signal or spectrogram. The spectrogram displays energy information along both time and frequency axes. Secondly, the frequencies to be within the bin spread and correlated them with each triangular filter. Thus, each bin holds a weighted sum, and represents the spectral magnitude in that filter bank channel. Finally, the filter banks are normalized into the normalized bank for comparison between the entry signal of word and various words of dictionary. The Euclidian distance is used to measure the similarity between them. The system was evaluated for its accuracy and stability in performing various conditions. The accuracy was tested with 9,000 speeches from several volunteers. The average accuracy rate is about 96.3 %. The results show that the evaluation was beyond satisfaction for every aspect.

Published in:

Computer, Mechatronics, Control and Electronic Engineering (CMCE), 2010 International Conference on  (Volume:6 )

Date of Conference:

24-26 Aug. 2010