By Topic

Vocabulary independent discriminative utterance verification for nonkeyword rejection in subword based speech recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
R. A. Sukkar ; Bell Labs., Lucent Technol., Naperville, IL, USA ; Chin-Hui Lee

An integral part of any deployable speech recognition system is the capability to detect if the input speech does not contain any of the words in the recognizer vocabulary set. This capability, which is called utterance verification (or keyword recognition and nonkeyword rejection), is therefore becoming increasingly important as speech recognition systems continue to migrate from the laboratory to actual applications. We present a framework and a method for vocabulary independent utterance verification in subword-based speech recognition. The verification process is cast as a statistical hypothesis test, where vocabulary independence is accomplished through a two-stage verification process: subword-level verification followed by string-level verification. A verification function is defined and discriminatively trained to perform subword-level verification. String-level verification is accomplished by defining and evaluating an overall string-level log likelihood ratio that is a function of the subword-level verification scores. Experimental results show that this vocabulary-independent discriminative utterance verification method significantly outperforms a baseline method commonly used in wordspotting tasks

Published in:

IEEE Transactions on Speech and Audio Processing  (Volume:4 ,  Issue: 6 )