By Topic

Discriminative training of high performance speech recognizer using N best candidates

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Chen, J.-K. ; Telecommun. Lab., Minist. of Commun., Chung-Li, Taiwan ; Soong, F.K.

Proposes an N-best candidates based, discriminative training procedure for constructing high performance HMM speech recognizers. The algorithm has two features: (1) a new frame-level loss function; (2) N best candidates are used for training. The new frame-level loss function, defined as a rectified log likelihood difference between the correct and other competing hypotheses, is minimized over all training utterances. Two speech recognition applications have been tested: speaker independent, small vocabulary (10 Mandarin Chinese digits), continuous speech recognition; and a speaker-trained, large vocabulary (5,000 commonly used Chinese words), isolated word recognition. Significant performance improvement over the traditional maximum likelihood trained HMMs has been obtained. In the connected Chinese digit recognition experiment, the string error rate is reduced from 17% to 10.8% for unknown length decoding and from 8.2% to 5.2% for known length decoding. In the large vocabulary, isolated word recognition experiment, the recognition error rate is improved from 6.8% to 3.8%

Published in:

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on  (Volume:i )

Date of Conference:

19-22 Apr 1994