By Topic

Speech modelling using cepstral-time feature matrices in hidden Markov models

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $31
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Vaseghi, S.V. ; Sch. of Inf. Syst., East Anglia Univ., Norwich, UK ; Conner, P.N. ; Milner, B.P.

The paper explores the use of 2-dimensional cepstral-time features for the utilisation of correlation among successive speech spectral vectors, within a hidden-Markov-model (HMM) framework. A cepstral-time-feature matrix is obtained from a 2-dimensional discrete cosine transform of a spectral-time matrix. Advantages of cepstral-time features are that cepstral-time-feature matrices are a simple and robust method of representing short-time variation of speech spectral parameters; a cepstral-time matrix contains information on the transitional dynamics of feature vectors within the matrix; speech recognition based on cepstral time matrices is more robust in noisy environments; and use of a matrix of M cepstral vectors implies a minimum HMM-state duration constraint of M vector units. A simple framework investigated in the paper for applications of cepstral-time features is a finite-state-matrix quantiser (FSMQ), a special case of the HMM. It is used for initialisation of the training phase of HMMs.<>

Published in:

Communications, Speech and Vision, IEE Proceedings I  (Volume:140 ,  Issue: 5 )