By Topic

Automatic determination of acoustic model topology using variational Bayesian estimation and clustering

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Watanabe, S. ; NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan ; Sako, A. ; Nakamura, A.

We describe the automatic determination of an acoustic model for speech recognition, which is very complicated and includes latent variables, using VBEC: variational Bayesian estimation and clustering for speech recognition. We propose an efficient Gaussian mixture model (GMM) based phonetic decision tree construction within the VBEC framework. The proposed method features a novel approach to reduce the unrealistically large number of computations needed for iterative calculations in the GMM-based decision tree method to a practical level by assuming that each Gaussian per state has the same occupancy and is represented by the same posterior distribution for the covariance parameter. The experimental results confirmed that VBEC automatically provided an optimum model topology with the highest performance level.

Published in:

Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on  (Volume:1 )

Date of Conference:

17-21 May 2004