By Topic

Karhunen-Loeve method for data compression and speech synthesis

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $31
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Chen, C.S. ; Dept. of Electr. Eng., Akron Univ., OH, USA ; Huo, K.-S.

The use of the Karhunen-Loeve (KL) method in speech data compression and synthesis using the Fourier-Bessel (FB) expansion coefficients of speech signals is described. Bessel functions seem to make a natural basis for speech signal decomposition. Sinusoidal functions are the eigenfunctions of vibrating strings. Bessel functions are the eigenfunctions of vibrating pipes. The vocal tract resembles an excited pipe rather than a vibrating string. Good quality intelligible speech signals can be reconstructed using only a small portion of the FB expansion coefficient. Further data compression is possible through KL transformation of the speech signal FB expansion coefficient for efficient speech coding and synthesis. The transformation is implemented by first forming a covariance matrix of the FB coefficients. Eigenvalues and eigenvectors of the covariance matrix are computed and ranked according to the eigenvalue magnitude. Speech signals are then reconstructed using only the feature corresponding to the larger magnitude eigenvalues of the covariance matrix.<>

Published in:

Communications, Speech and Vision, IEE Proceedings I  (Volume:138 ,  Issue: 5 )