A mixed Fourier/Walsh transform scheme for speech coding is proposed. A set of harmonically structured frequency components is used to represent narrowband components of speech. The broadband residual is characterised by a small number of sequency components. The frequencies and sequences of the transform components are determined by sampling the short-time Fourier and Walsh transforms, respectively. The magnitudes and phases of the Fourier components and the amplitudes of the Walsh components are determined using an iterative algorithm based on the Gauss-Seidel method. A vector quantisation (VQ) scheme is developed to encode the frequency and the sequency components. Results and subjective evaluations are given for speech coding at 4.0 kbit/s.<
Published in:
Communications, Speech and Vision, IEE Proceedings I
(Volume:139
,
Issue:
5
)
Date of Publication: Oct. 1992