By Topic

N-best decision for Thai stressed speech recognition with parallel hidden Makov model

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
P. Amomkul ; Fac. of Eng., King Mongkut's Univ. of Technol., Bangkok, Thailand ; P. Kumhom ; K. Chanmongthai

In integrating multi-isolated-word recognizers into a speech recognition for various stressed speeches, the best likelihood scopes as outputs of each recognizer are not guaranteed a correct recognition result. Since training sometimes does not cover all speakers, likelihood score of the correct recognition result is not the best and causes misrecognition. Moreover, the difference among recognizers also leads to mis-understanding. This paper proposes a decision-making method for Thai stressed speech recognition with parallel hidden Markov model. In this method, a voting scheme is applied on the words with the N-best likelihood score. Firstly, if the score margin between the first and the second-best is greater than a threshold, the voting is applied on the words with highest scores from each recognizer. If there is no clear winner, decided by considering the voting score, the next best score are included into the voting scheme. The process goes on until a winner is found or there is a tied score, its which case the average of the likelihood score of each tied word is used to decide the winner. The experiments were conducted with 4-stress speeches including angry, toward, loud, and neutral. It showed that the proposed method helped increase the recognition rate to 96.545% comparing with previous decision making techniques.

Published in:

2005 International Symposium on Intelligent Signal Processing and Communication Systems

Date of Conference:

13-16 Dec. 2005