By Topic

Fast Recovery for a CELP-Like Speech Codec After a Frame Erasure

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Chibani, M. ; Dept. de Genie Electr. et de Genie Inf., Univ. de Sherbrooke, Sherbrooke, QC ; Lefebvre, R. ; Gournay, P.

The adaptive codebook used in code-excited linear prediction (CELP)-like speech codecs is very effective for modeling the quasi-periodic component of the excitation signal but, unfortunately, introduces a strong interframe dependency that renders the decoder vulnerable to frame erasures. For voiced speech, the error affects not only the erased frame but also all the subsequent frames. In this paper, a technique to improve the recovery after a frame erasure is proposed. The technique consists in a constrained excitation search at the encoder and a resynchronization procedure at the decoder. The constraint aims at reducing the contribution of the adaptive codebook by making the innovation codebook partially model the pitch excitation. Further, for highly voiced frames, the pitch-related information contained in the innovation excitation is exploited at the decoder to speed up the resynchronization of the adaptive codebook after a frame erasure. When applied to the adaptive multirate wideband (AMR-WB) codec, the method brings a significant improvement in the case of frame erasures, at the cost of a minor quality loss compared to the standard codec at the same bit rate. The method does not need additional delay and has the advantage of maintaining full interoperability between the standard codec and its modified version.

Published in:

Audio, Speech, and Language Processing, IEEE Transactions on  (Volume:15 ,  Issue: 8 )