By Topic

Transformation-based Bayesian predictive classification using online prior evolution

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Jen-Tzung Chien ; Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan ; Guo-Hong Liao

The mismatch between training and testing environments makes the necessity of speech recognizers to be adaptive both in acoustic modeling and decision making. Accordingly, the speech hidden Markov models (HMMs) should be able to incrementally capture the evolving statistics of environments using online available data. Also, it is necessary for speech recognizers to exploit the robust decision strategy, which takes the uncertainty of parameters into account. This paper presents a transformation-based Bayesian predictive classification (TBPC) where the uncertainty of the transformation parameters of the HMM mean vector and precision matrix is adequately represented by a joint multivariate prior density of normal-Wishart belonging to the conjugate family. The formulation of TBPC decision is correspondingly constructed. Due to the benefit of conjugate density, we generate the reproducible prior/posterior pair such that the hyperparameters of prior density could evolve successively to new environments using online test/adaptation data. The evolved hyperparameters could suitably describe the parameter uncertainty for TBPC decision. Therefore, a novel framework of TBPC geared with online prior evolution (OPE) capability is developed for robust speech recognition. This framework is examined to be effective as well as efficient on the recognition task of connected Chinese digits in hands-free car environments

Published in:

Speech and Audio Processing, IEEE Transactions on  (Volume:9 ,  Issue: 4 )