Tandem connectionist feature extraction for conventional HMM systems | IEEE Conference Publication | IEEE Xplore

Tandem connectionist feature extraction for conventional HMM systems


Abstract:

Hidden Markov model speech recognition systems typically use Gaussian mixture models to estimate the distributions of decorrelated acoustic feature vectors that correspon...Show More

Abstract:

Hidden Markov model speech recognition systems typically use Gaussian mixture models to estimate the distributions of decorrelated acoustic feature vectors that correspond to individual subword units. By contrast, hybrid connectionist-HMM systems use discriminatively-trained neural networks to estimate the probability distribution among subword units given the acoustic observations. In this work we show a large improvement in word recognition performance by combining neural-net discriminative feature processing with Gaussian-mixture distribution modeling. By training the network to generate the subword probability posteriors, then using transformations of these estimates as the base features for a conventionally-trained Gaussian-mixture based system, we achieve relative error rate reductions of 35% or more on the multicondition Aurora noisy continuous digits task.
Date of Conference: 05-09 June 2000
Date Added to IEEE Xplore: 06 August 2002
Print ISBN:0-7803-6293-4
Print ISSN: 1520-6149
Conference Location: Istanbul, Turkey

Contact IEEE to Subscribe

References

References is not available for this document.