By Topic

Burst Onset Landmark Detection and Its Application to Speech Recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Chi-Yueh Lin ; Department of Electrical Engineering, National Tsing Hua University, Hsinchu, Taiwan ; Hsiao-Chuan Wang

The reliable detection of salient acoustic-phonetic cues in speech signal plays an important role in speech recognition based on speech landmarks. Once speech landmarks are located, not only can phone recognition be performed, but other useful information can also be derived. This paper focuses on the detection of burst onset landmarks, which are crucial to the recognition of stop and affricate consonants. The proposed detector is purely based on a random forest technique, which belongs to an ensemble of tree-structured classifiers. By adopting a special asymmetric bootstrapping method, a series of experiments conducted on the TIMIT database demonstrate that the proposed detector is an efficient and accurate method for detecting burst onsets. When the detection results are appended to mel frequency cepstral coefficient vectors, the augmented feature vectors enhance the recognition correctness of hidden Markov models in recognizing stop and affricate consonants in continuous speech.

Published in:

IEEE Transactions on Audio, Speech, and Language Processing  (Volume:19 ,  Issue: 5 )