By Topic

A new quantization optimization algorithm for the MPEG advanced audio coder using a statistical subband model of the quantization noise

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Derrien, O. ; Signal & Image Process. Dept., ENST/TSI ; Duhamel, P. ; Charbit, M. ; Richard, G.

In this paper, an improvement of the quantization optimization algorithm for the MPEG Advanced Audio Coder (AAC) is presented. This algorithm, given a bit-rate constraint, minimizes the perceived distortion generated by the signal compression. The distortion can be related to the quantization error level over frequency subbands through an auditory model. Thus, optimizing the quantization requires knowledge of the rate-distortion function for each subband. When this function can be modeled in a simple way, the algorithm can take a one-loop recursive structure. However, in the MPEG AAC, the rate-distortion function is hard to characterize, since AAC makes use of nonlinear quantizers and variable length entropy coders. As a result, the standard algorithm makes use of two nested loops with a local decoder, in order to measure the error level rather than predicting its value. We first describe a partial subband modeling of the rate-distortion function of interest in the MPEG AAC. Then, using a statistical approach, we find a relationship between the error level and the so-called quantization "scale-factor" and propose a new algorithm that is basically similar to a classical one loop "bit allocation" process. Finally, we describe the complete algorithm and show that it is more efficient than the standard one

Published in:

Audio, Speech, and Language Processing, IEEE Transactions on  (Volume:14 ,  Issue: 4 )