Loading [a11y]/accessibility-menu.js
Refining Side Information for Improved Transform Domain Wyner-Ziv Video Coding | IEEE Journals & Magazine | IEEE Xplore

Refining Side Information for Improved Transform Domain Wyner-Ziv Video Coding


Abstract:

Wyner-Ziv (WZ) video coding is a particular case of distributed video coding, which is a recent video coding paradigm based on the Slepian-Wolf and WZ theorems. Contrary ...Show More

Abstract:

Wyner-Ziv (WZ) video coding is a particular case of distributed video coding, which is a recent video coding paradigm based on the Slepian-Wolf and WZ theorems. Contrary to available prediction-based standard video codecs, WZ video coding exploits the source statistics at the decoder, allowing the development of simpler encoders. Until now, WZ video coding did not reach the compression efficiency performance of conventional video coding solutions, mainly due to the poor quality of the side information, which is an estimate of the original frame created at the decoder in the most popular WZ video codecs. In this context, this paper proposes a novel side information refinement (SIR) algorithm for a transform domain WZ video codec based on a learning approach where the side information is successively improved as the decoding proceeds. The results show significant and consistent performance improvements regarding state-of-the-art WZ and standard video codecs, especially under critical conditions such as high motion content and long group of pictures sizes.
Published in: IEEE Transactions on Circuits and Systems for Video Technology ( Volume: 19, Issue: 9, September 2009)
Page(s): 1327 - 1341
Date of Publication: 12 May 2009

ISSN Information:


I. Introduction

Available video coding algorithms, notably those adopted by the ITU-T and ISO/IEC MPEG standards, exploit the source signal statistics at the encoder using a block-based prediction and transform coding paradigm, known as hybrid or predictive video coding. The key coding tools for hybrid coding are: 1) temporal prediction to exploit the temporal redundancy between the video frames; 2) transform coding, e.g., discrete cosine transform (DCT) to exploit the spatial redundancy; 3) quantization of the transform coefficients to exploit the visual irrelevancy (related to the human visual system limitations); and 4) entropy coding to exploit the statistical redundancy of the created coding symbols. Since a hybrid video coding solution exploits the correlation between and within the video frames at the encoder, it typically leads to rather complex encoders and much simpler decoders, without much flexibility in terms of complexity budget allocation besides making the encoder less complex and thus less efficient. This conventional video coding approach is especially appropriate for services and systems such as broadcasting and video-on-demand, where the video data is encoded by one encoder and decoded by many decoders, ideally as simple as possible.

Contact IEEE to Subscribe

References

References is not available for this document.