Loading [MathJax]/extensions/MathMenu.js
PLSA enhanced with a long-distance bigram language model for speech recognition | IEEE Conference Publication | IEEE Xplore

PLSA enhanced with a long-distance bigram language model for speech recognition


Abstract:

We propose a language modeling (LM) approach using background n-grams and interpolated distanced n-grams for speech recognition using an enhanced probabilistic latent sem...Show More

Abstract:

We propose a language modeling (LM) approach using background n-grams and interpolated distanced n-grams for speech recognition using an enhanced probabilistic latent semantic analysis (EPLSA) derivation. PLSA is a bag-of-words model that exploits the topic information at the document level, which is inconsistent for the language modeling in speech recognition. In this paper, we consider the word sequence in modeling the EPLSA model. Here, the predicted word of an n-gram event is drawn from a topic that is chosen from the topic distribution of the (n-1) history words. The EPLSA model cannot capture the long-range topic information from outside of the n-gram event. The distanced n-grams are incorporated into interpolated form (IEPLSA) to cover the long-range information. A cache-based LM that models the re-occurring words is also incorporated through unigram scaling to the EPLSA and IEPLSA models, which models the topical words. We have seen that our proposed approaches yield significant reductions in perplexity and word error rate (WER) over a PLSA based LM approach using the Wall Street Journal (WSJ) corpus.
Date of Conference: 09-13 September 2013
Date Added to IEEE Xplore: 08 May 2014
Electronic ISBN:978-0-9928626-0-2

ISSN Information:

Conference Location: Marrakech, Morocco

Contact IEEE to Subscribe

References

References is not available for this document.