By Topic

2009 IEEE International Conference on Multimedia and Expo

June 28 2009-July 3 2009

Filter Results

Displaying Results 1 - 25 of 473
  • [Front cover]

    Publication Year: 2009, Page(s): c1
    Request permission for commercial reuse | PDF file iconPDF (519 KB)
    Freely Available from IEEE
  • [Title page]

    Publication Year: 2009, Page(s): c2
    Request permission for commercial reuse | PDF file iconPDF (437 KB)
    Freely Available from IEEE
  • [Copyright notice]

    Publication Year: 2009, Page(s): ii
    Request permission for commercial reuse | PDF file iconPDF (497 KB)
    Freely Available from IEEE
  • Organizing Committee

    Publication Year: 2009, Page(s):iii - xii
    Request permission for commercial reuse | PDF file iconPDF (444 KB)
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2009, Page(s):xiii - lvi
    Request permission for commercial reuse | PDF file iconPDF (549 KB)
    Freely Available from IEEE
  • Directional filtering transform

    Publication Year: 2009, Page(s):1 - 4
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (256 KB) | HTML iconHTML

    This paper proposes the directional filtering transform (dFT, in order to distinguish from the common usage on DFT) to better exploit intra-frame correlation in H.264 intra-frame coding. It consists of a directional filtering and an optional DCT transform. In the proposed directional filtering, there are two different approaches. One is the uni-directional filtering (UDF) that is similar to H.264 ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multiview video coding using projective rectification-based view extrapolation and synthesis bias correction

    Publication Year: 2009, Page(s):5 - 8
    Cited by:  Papers (6)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (446 KB) | HTML iconHTML

    Current view synthesis prediction (VSP) techniques for multiview video coding (MVC) rely on disparity-based view interpolation or depth-based 3D warping. The former cannot be applied to every camera view, whereas the latter may require coding of the depth information of a scene. To avoid these constraints, we propose an improved VSP-based MVC scheme based on the following three techniques: 1) view... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Single-iteration full-search fractional motion estimation for quad full HD H.264/AVC encoding

    Publication Year: 2009, Page(s):9 - 12
    Cited by:  Papers (12)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (508 KB) | HTML iconHTML

    Fractional motion estimation (FME) is widely used in video compression standards. In H.264/AVC, the precision of motion vector is down to quarter pixels to improve the coding efficiency. However, FME occupies over 45% of the computation complexity in an H.264 encoder and this high complexity limits the processing capability. In this paper, a single-iteration full search FME is proposed. By the alg... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Rate-distortion analysis of rectification-based view interpolation for multiview video coding

    Publication Year: 2009, Page(s):13 - 16
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (276 KB) | HTML iconHTML

    View interpolation has been applied in multiview view coding. However, existing schemes assume all cameras are aligned. These methods may not perform well when neighboring cameras point to different directions. In this paper, we apply the rectification based view interpolation to MVC. We first derive the theoretical performance gain of the rectification based view interpolation over existing inter... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fast mode selection scheme for H.264/AVC inter prediction based on statistical learning method

    Publication Year: 2009, Page(s):17 - 20
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (242 KB) | HTML iconHTML

    H.264 adopts variable block size motion estimation and rate-distortion-optimization based mode decision to improve video quality and compression ratio. These techniques have made H.264 better than other existing video coding standards. However, they are computationally intensive and time-consuming. In this paper, a fast mode selection scheme is proposed for H.264 inter prediction. Firstly, the fir... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A new strategy to predict the search range in H.264/AVC

    Publication Year: 2009, Page(s):21 - 24
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (171 KB) | HTML iconHTML

    Motion estimation is a very time consuming part in H.264 codec. In order to reduce motion estimation time, many strategies have been used during this process. Dynamic search range is one of them. In this paper, based on the analysis of the problems existed in current algorithm, we propose a new strategy to predict motion estimation search range by using the information of image size, block mode an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Content-based hierarchical motion description for multiple video adaptation

    Publication Year: 2009, Page(s):25 - 28
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (310 KB) | HTML iconHTML

    Video adaptation has been considered as a promising technique to tackle challenging problems in pervasive multimedia applications. However, the styles of video representation and description in existing framework are not flexible enough to adapt diversified application environment. In this paper, we propose a novel solution based on intermediate description, which can support fast multiple video a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Spatial transcoding from Scalable Video Coding to H.264/AVC

    Publication Year: 2009, Page(s):29 - 32
    Cited by:  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (238 KB) | HTML iconHTML

    Scalable Video Coding (SVC) is backwards compatible to H.264/AVC in the sense that the base layer sub-bitstream is decodable by an H.264/AVC decoder. However, there are applications wherein it is desirable for an H.264/AVC decoder to obtain a higher resolution video representation than the base layer within SVC. In order to fulfill the needs of such application scenarios, transcoding of SVC enhanc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Block-Matching Translation and Zoom Motion-Compensated Prediction

    Publication Year: 2009, Page(s):33 - 36
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (223 KB) | HTML iconHTML

    In modern video coding standards, motion compensated prediction (MCP) plays a key role to achieve video compression efficiency. Most of them make use of block matching techniques and assume the motions are pure translational. Attempts toward a more general motion model are usually too complex to be practical in near future. In this paper, a new Block-Matching Translation and Zoom Motion-Compensate... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Content aware configurable architecture for H.264/AVC integer motion estimation engine

    Publication Year: 2009, Page(s):37 - 40
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (154 KB) | HTML iconHTML

    In this paper, we contribute a configurable SAD tree architecture based on adaptive subsampling scheme. Firstly, by further exploiting the spatial feature, the integer motion estimation process is greatly sped up. Secondly, the conventional partial sum of absolute difference (SAD) based pipeline structure is optimized into configurable SAD oriented way, which enhances the performance and solve the... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Accurate bit prediction for intra-only rate control

    Publication Year: 2009, Page(s):41 - 44
    Cited by:  Papers (2)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (394 KB) | HTML iconHTML

    Rate control plays a crucial role for video communication applications. It ensures that the generated compressed bit streams satisfy bandwidth and buffer constraints. Rate control algorithms recommended by H.264/AVC adopt rate-distortion (R-D) models for inter-frames to determine quantization parameters (QPs) but not for intra-frames. Instead, they directly compute QPs without any considerations o... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Frame complexity prediction for H.264/AVC rate control

    Publication Year: 2009, Page(s):45 - 48
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (462 KB) | HTML iconHTML

    Rate control regulates the output bit rate of a video encoder in order to obtain optimum visual quality within the available network bandwidth and to maintain buffer fullness within a specified tolerance range. In this paper, we propose a novel rate control scheme for H.264/AVC video compression with a number of new features. We first introduce a calculation approach of frame complexity based on t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fractional compensation for spatial scalable video coding

    Publication Year: 2009, Page(s):49 - 52
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (415 KB) | HTML iconHTML

    This paper proposes a novel fractional compensation approach for spatial scalable video coding. It simultaneously exploits inter layer correlation and intra layer correlation by learning-based mapping. Instead of using an enhancement layer reconstruction as an entire reference, a set of reference pairs are generated from high-frequency components of both base layer and enhancement layer reconstruc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Estimating spatial cues for audio coding in MDCT domain

    Publication Year: 2009, Page(s):53 - 56
    Cited by:  Papers (3)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (220 KB) | HTML iconHTML

    Although widely used otherwise, MDCT is excluded in the current scheme for spatial cues representation, due to its lacking of phase information and energy conservation. But combining MDCT with MDST overcomes the difficulties. Moreover, MDST spectra can be built perfectly from neighboring MDCT spectra. The MDCT-MDST conversion, in matrix form, is approximating to a banded sparse matrix. When applie... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Video coding based on audio-visual attention

    Publication Year: 2009, Page(s):57 - 60
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (201 KB) | HTML iconHTML

    This paper proposes an efficient video coding method based on audio-visual attention, which is motivated by the fact that cross-modal interaction significantly affects humans' perception of multimedia content. First, we propose an audio-visual source localization method to locate the sound source in a video sequence. Then, its result is used for applying spatial blurring to video frames in order t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fast multi-reference motion estimation via statistical learning for H.264/AVC

    Publication Year: 2009, Page(s):61 - 64
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (205 KB) | HTML iconHTML

    In the H.264/AVC coding standard, motion estimation (ME) is allowed to use multiple reference frames to make full use of reducing temporal redundancy in a video sequence. Although it can further reduce the motion compensation errors, it introduces tremendous computational complexity as well. In this paper, we propose a statistical learning approach to reduce the computation involved in the multire... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Block-based color correction algorithm for multi-view video coding

    Publication Year: 2009, Page(s):65 - 68
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (441 KB) | HTML iconHTML

    The color variations among different viewpoints in multiview video sequences may deteriorate the visual quality and coding efficiency. Various color correction methods have been proposed, however, the color appearance and histogram of corrected target frames are not similar enough to the reference frames in details. Focusing on restoring more similar color, a block-based color correction algorithm... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Multi-layer motion estimation scheme for spatial scalability in H.264/AVC scalable extension

    Publication Year: 2009, Page(s):69 - 72
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (398 KB) | HTML iconHTML

    In this paper, we propose a fast multi-layer motion estimation algorithm for spatial scalability provided in H.264/AVC scalable extension, based on the reuse of the motion vectors from multiple spatial layers. The reused motion vector is used to set a search center and refined within a small search area. However, the reused motion vector often produces significant prediction error at object bounda... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Kurtosis-based super-resolution algorithm

    Publication Year: 2009, Page(s):73 - 76
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1095 KB) | HTML iconHTML

    A kurtosis-based super-resolution image reconstruction algorithm is proposed in this paper. Firstly, we give the definition of the kurtosis image and analyze its two properties: (i) the kurtosis image is Gaussian noise invariant, and (ii) the absolute value of a kurtosis image becomes smaller as the the image gets smoother. Then we build a constrained absolute local kurtosis maximization function ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A robust spatial-temporal line-warping based deinterlacing method

    Publication Year: 2009, Page(s):77 - 80
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (129 KB) | HTML iconHTML

    In this paper, a line-warping based deinterlacing method will be introduced. The missing pixels in interlaced videos can be derived from the warping of pixels in horizontal line pairs. In order to increase the accuracy of temporal prediction, multiple temporal-line pairs, selected according to constant velocity model, are used for warping. The stationary pixels can be well-preserved by accuracy st... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.