By Topic

IEEE Transactions on Multimedia

Issue 3 • June 2006

Filter Results

Displaying Results 1 - 25 of 28
  • Table of contents

    Publication Year: 2006, Page(s):c1 - c4
    Request permission for commercial reuse | PDF file iconPDF (126 KB)
    Freely Available from IEEE
  • IEEE Transactions on Multimedia publication information

    Publication Year: 2006, Page(s): c2
    Request permission for commercial reuse | PDF file iconPDF (32 KB)
    Freely Available from IEEE
  • Semantic adaptation of sport videos with user-centred performance analysis

    Publication Year: 2006, Page(s):433 - 443
    Cited by:  Papers (14)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1088 KB) | HTML iconHTML

    In semantic video adaptation measures of performance must consider the impact of the errors in the automatic annotation over the adaptation in relationship with the preferences and expectations of the user. In this paper, we define two new performance measures Viewing Quality Loss and Bit-rate Cost Increase,that are obtained from classical peak signal-to-noise ration (PSNR) and bitrate, and relate... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Deep compression of remotely rendered views

    Publication Year: 2006, Page(s):444 - 456
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (3320 KB) | HTML iconHTML

    Three-dimensional (3-D) models are information-rich and provide compelling visualization effects. However downloading and viewing 3-D scenes over the network may be excessive. In addition low-end devices typically have insufficient power and/or memory to render the scene interactively in real-time. Alternatively,3-D image warping, an image-based-rendering technique that renders a two-dimensional(2... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An efficient scheme for motion estimation using multireference frames in H.264/AVC

    Publication Year: 2006, Page(s):457 - 466
    Cited by:  Papers (26)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (784 KB) | HTML iconHTML

    The multiple reference frame motion compensation (MRMC) supported by H.264 makes use of the redundancy which is between multiple frames to enhance the coding efficiency over a scheme using the single reference frame motion compensation (SRMC) in which motion vectors are searched over a single reference frame. And, the technique using multiple reference frames can combat the channel errors efficien... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Low-delay rate control for real-time H.264/AVC video coding

    Publication Year: 2006, Page(s):467 - 477
    Cited by:  Papers (44)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (560 KB) | HTML iconHTML

    This paper presents an efficient rate control scheme for the H.264/AVC video coding in low-delay environments. In our scheme, we propose an enhancement to the buffer-status based H.264/AVC bit allocation method. The enhancement is by using a PSNR-based frame complexity estimation to improve the existing mean absolute difference based (MAD-based) complexity measure. Bit allocation to each frame is ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fast multiframe motion estimation algorithms by motion vector composition for the MPEG-4/AVC/H.264 standard

    Publication Year: 2006, Page(s):478 - 487
    Cited by:  Papers (38)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1136 KB) | HTML iconHTML

    The MPEG-4/AVC/H.264 video coding standard adopts various coding schemes such as multiple reference frames and variable block sizes for motion estimation. Hence, MPEG-4/AVC/H.264 provides gains in compression efficiency of up to 50% over a wide range of bit rates and video resolutions compared to previous standards. However, these features result in a considerable increase in encoder complexity, m... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A novel fractal image watermarking

    Publication Year: 2006, Page(s):488 - 499
    Cited by:  Papers (24)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2720 KB) | HTML iconHTML

    A novel watermarking method is proposed to hide a binary watermark into image files compressed by fractal block coding. This watermarking method utilizes a special type of orthogonalization fractal coding method where the fractal affine transform is determined by the range block mean and contrast scaling. Such orthogonalization fractal decoding is a mean-invariant iteration. In contrast, the fract... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Recognition of facial expressions and measurement of levels of interest from video

    Publication Year: 2006, Page(s):500 - 508
    Cited by:  Papers (92)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (600 KB) | HTML iconHTML

    This paper presents a spatio-temporal approach in recognizing six universal facial expressions from visual data and using them to compute levels of interest. The classification approach relies on a two-step strategy on the top of projected facial motion vectors obtained from video sequences of facial expressions. First a linear classification bank was applied on projected optical flow vectors and ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Modeling individual and group actions in meetings with layered HMMs

    Publication Year: 2006, Page(s):509 - 520
    Cited by:  Papers (75)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (728 KB) | HTML iconHTML

    We address the problem of recognizing sequences of human interaction patterns in meetings, with the goal of structuring them in semantic terms. The investigated patterns are inherently group-based (defined by the individual activities of meeting participants, and their interplay), and multimodal (as captured by cameras and microphones). By defining a proper set of individual actions, group actions... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Merging artificial objects with marker-less video sequences based on the interacting multiple model method

    Publication Year: 2006, Page(s):521 - 528
    Cited by:  Papers (9)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (784 KB) | HTML iconHTML

    Inserting synthetic objects into video sequences has gained much interest in recent years. Fast and robust vision-based algorithms are necessary to make such an application possible. Traditional pose tracking schemes using recursive structure from motion techniques adopt one Kalman filter and thus only favor a certain type of camera motion. We propose a robust simultaneous pose tracking and struct... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Interactive dialogue model: a design technique for multichannel applications

    Publication Year: 2006, Page(s):529 - 541
    Cited by:  Papers (16)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2960 KB) | HTML iconHTML

    Multichannel applications deliver the same content and a "similar interactive experience" using different devices and different technologies(e.g., web sites, palm held devices, car navigators, or interactive TVs). Various channels imply a number of differences, including screen (size), keyboard(size), pointing devices, output devices, performances, and the context of use (standing, sitting, walkin... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Learning dynamic audio-visual mapping with input-output Hidden Markov models

    Publication Year: 2006, Page(s):542 - 549
    Cited by:  Papers (18)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (744 KB) | HTML iconHTML

    In this paper, we formulate the problem of synthesizing facial animation from an input audio sequence as a dynamic audio-visual mapping. We propose that audio-visual mapping should be modeled with an input-output hidden Markov model, or IOHMM. An IOHMM is an HMM for which the output and transition probabilities are conditional on the input sequence. We train IOHMMs using the expectation-maximizati... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Adaptive online transmission of 3-D TexMesh using scale-space and visual perception analysis

    Publication Year: 2006, Page(s):550 - 563
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (4992 KB) | HTML iconHTML

    Efficient online visualization of three-dimensional (3-D) mesh, mapped with photo realistic texture, is essential for a variety of applications such as museum exhibits and medical images. In these applications synthetic texture or color per vertex loses authenticity and resolution. An image-based view dependent approach requires too much overhead to generate a 360° display for online applicati... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Toward intelligent music information retrieval

    Publication Year: 2006, Page(s):564 - 574
    Cited by:  Papers (55)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (600 KB) | HTML iconHTML

    Efficient and intelligent music information retrieval is a very important topic of the 21st century. With the ultimate goal of building personal music information retrieval systems, this paper studies the problem of intelligent music information retrieval. Huron points out that since the preeminent functions of music are social and psychological, the most useful characterization would be based on ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Precise pitch profile feature extraction from musical audio for key detection

    Publication Year: 2006, Page(s):575 - 584
    Cited by:  Papers (17)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1416 KB) | HTML iconHTML

    The majority of pieces of music, including classical and popular music,are composed using music scales, such as keys. The key or the scale information of a piece provides important clues on its high level musical content, like harmonic and melodic context. Automatic key detection from music data can be useful for music classification, retrieval or further content analysis. Many researchers have ad... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fusion of audio and motion information on HMM-based highlight extraction for baseball games

    Publication Year: 2006, Page(s):585 - 599
    Cited by:  Papers (40)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1256 KB) | HTML iconHTML

    This paper aims to extract baseball game highlights based on audio-motion integrated cues. In order to better describe different audio and motion characteristics in baseball game highlights, we propose a novel representation method based on likelihood models. The proposed likelihood models measure the "likeliness" of low-level audio features and motion features to a set of predefined audio types a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • TCP smoothness and window adjustment strategy

    Publication Year: 2006, Page(s):600 - 609
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (536 KB) | HTML iconHTML

    We observe that even when the system throughput is relatively stable,end users of media-streaming applications do not necessarily experience smooth throughput, due to the unsynchronized window adjustments triggered by random congestion indications. We analyze and evaluate the negative impact of random window adjustments on smoothness, short-term fairness, and long-term fairness. We further propose... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Service differentiated peer selection: an incentive mechanism for peer-to-peer media streaming

    Publication Year: 2006, Page(s):610 - 621
    Cited by:  Papers (67)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (456 KB) | HTML iconHTML

    We propose a service differentiated peer selection mechanism for peer-to-peer media streaming systems. The mechanism provides flexibility and choice in peer selection to the contributors of the system, resulting in high quality streaming sessions. Free-riders are given limited options in peer selection,if any, and hence receive low quality streaming. The proposed incentive mechanism follows the ch... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Adding lossless video compression to MPEGs

    Publication Year: 2006, Page(s):622 - 625
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (160 KB) | HTML iconHTML

    In this correspondence, we propose to add a lossless compression functionality into existing MPEGs by developing a new context tree to drive arithmetic coding for lossless video compression. In comparison with the existing work on context tree design, the proposed algorithm features in 1) prefix sequence matching to locate the statistics model at the internal node nearest to the stopping point, wh... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Nonlinear collusion attack on a watermarking scheme for buyer authentication

    Publication Year: 2006, Page(s):626 - 629
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (200 KB) | HTML iconHTML

    This paper presents an adaptive collusion attack on a buyer authentication watermarking scheme. To accomplish this attack, the traitors (i.e., dishonest buyers) select the pixels of their watermarked images generated from the same original image and average the selected pixels so as to remove the watermark information. Additionally, the forged image is of higher quality than any watermarked image.... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Cryptanalysis of Chu's DCT based watermarking scheme

    Publication Year: 2006, Page(s):629 - 632
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (296 KB) | HTML iconHTML

    In 2003, Chu proposed an oblivious watermarking algorithm by modifying the CKLS scheme proposed by Cox, Kilian, Leighton, and Shamoon in 1997, known as the CKLS scheme. In this correspondence, we report that the modification presented by Chu is susceptible to a suitably modified attack devised by Das and Maitra in 2004. In fact, the experimental results show that Chu's scheme is even weaker than t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An effective music information retrieval method using three-dimensional continuous DP

    Publication Year: 2006, Page(s):633 - 639
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (480 KB) | HTML iconHTML

    This paper describes a music information retrieval system that uses humming as the key for retrieval. Humming is an easy way for a user to input a melody. However, there are several problems with humming that degrade the retrieval of information. One problem is the human factor. Sometimes, people do not sing accurately, especially if they are inexperienced or unaccompanied. Another problem arises ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • IEEE Transactions on Multimedia Edics

    Publication Year: 2006, Page(s): 640
    Request permission for commercial reuse | PDF file iconPDF (14 KB)
    Freely Available from IEEE
  • IEEE Transactions on Multimedia information for authors

    Publication Year: 2006, Page(s):641 - 642
    Request permission for commercial reuse | PDF file iconPDF (50 KB)
    Freely Available from IEEE

Aims & Scope

The scope of the Periodical is the various aspects of research in multimedia technology and applications of multimedia.

Full Aims & Scope

Meet Our Editors

Editor-in-Chief
Wenwu Zhu
Department of Computer Science
Tsinghua University
Beijing, China
Tel: (+86 10) 6279 0967
wwzhu@tsinghua.edu.cn