By Topic

IEEE Transactions on Multimedia

Issue 4 • June 2008

Filter Results

Displaying Results 1 - 16 of 16
  • Table of contents

    Publication Year: 2008, Page(s):C1 - C4
    Request permission for commercial reuse | PDF file iconPDF (44 KB)
    Freely Available from IEEE
  • IEEE Transactions on Multimedia publication information

    Publication Year: 2008, Page(s): C2
    Request permission for commercial reuse | PDF file iconPDF (36 KB)
    Freely Available from IEEE
  • Digital-Invisible-Ink Data Hiding Based on Spread-Spectrum and Quantization Techniques

    Publication Year: 2008, Page(s):557 - 569
    Cited by:  Papers (11)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1476 KB) | HTML iconHTML

    A novel data-hiding methodology, denoted as digital invisible ink (DII), is proposed to implement secure steganography systems. Like the real-world invisible ink, secret messages will be correctly revealed only after the marked works undergo certain prenegotiated manipulations, such as lossy compression and processing. Different from conventional data-hiding schemes where content processing or com... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Audio–Visual Affective Expression Recognition Through Multistream Fused HMM

    Publication Year: 2008, Page(s):570 - 577
    Cited by:  Papers (58)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (518 KB) | HTML iconHTML

    Advances in computer processing power and emerging algorithms are allowing new ways of envisioning human-computer interaction. Although the benefit of audio-visual fusion is expected for affect recognition from the psychological and engineering perspectives, most of existing approaches to automatic human affect analysis are unimodal: information processed by computer system is limited to either fa... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Human Age Estimation With Regression on Discriminative Aging Manifold

    Publication Year: 2008, Page(s):578 - 584
    Cited by:  Papers (167)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1035 KB) | HTML iconHTML

    Recently, extensive studies on human faces in the human-computer interaction (HCI) field reveal significant potentials for designing automatic age estimation systems via face image analysis. The success of such research may bring in many innovative HCI tools used for the applications of human-centered multimedia communication. Due to the temporal property of age progression, face images with aging... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Vision-Based Augmented-Reality System For Multiuser Collaborative Environments

    Publication Year: 2008, Page(s):585 - 595
    Cited by:  Papers (4)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1692 KB) | HTML iconHTML

    This work presented a novel vision-based augmented-reality system for applications in multiuser collaborative environments. The kernel technology of this vision-based system locates the cameras that are utilized to point and simulate the positions of multiple viewers. Camera calibration based on computer vision is employed during the camera's locating process. The applications in multiuser collabo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Co-clustering for Auditory Scene Categorization

    Publication Year: 2008, Page(s):596 - 606
    Cited by:  Papers (11)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1151 KB) | HTML iconHTML

    Auditory scenes are temporal audio segments with coherent semantic content. Automatically classifying and grouping auditory scenes with similar semantics into categories is beneficial for many multimedia applications, such as semantic event detection and indexing. For such semantic categorization, auditory scenes are first characterized with either low-level acoustic features or some mid-level rep... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Multimodal and Multilevel Ranking Scheme for Large-Scale Video Retrieval

    Publication Year: 2008, Page(s):607 - 619
    Cited by:  Papers (17)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1056 KB) | HTML iconHTML

    A critical issue of large-scale multimedia retrieval is how to develop an effective framework for ranking the search results. This problem is particularly challenging for content-based video retrieval due to some issues such as short text queries, insufficient sample learning, fusion of multimodal contents, and large-scale learning with huge media data. In this paper, we propose a novel multimodal... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Video Annotation Based on Kernel Linear Neighborhood Propagation

    Publication Year: 2008, Page(s):620 - 628
    Cited by:  Papers (27)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1048 KB) | HTML iconHTML

    The insufficiency of labeled training data for representing the distribution of the entire dataset is a major obstacle in automatic semantic annotation of large-scale video database. Semi-supervised learning algorithms, which attempt to learn from both labeled and unlabeled data, are promising to solve this problem. In this paper, a novel graph-based semi-supervised learning method named kernel... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Interactive Transmission of JPEG2000 Images Using Web Proxy Caching

    Publication Year: 2008, Page(s):629 - 636
    Cited by:  Papers (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1030 KB) | HTML iconHTML

    This paper describes and analyzes JPIP-W, an innovative proposal for the interactive transmission of JPEG2000 images on the Internet. JPIP-W is an extension of JPIP, the current JPEG protocol proposed for interactive JPEG2000 image browsing. One of the JPIP characteristics of greatest interest is its ability to use the Web for retrieving images. However, JPIP is unable to exploit the large infrast... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • No-Reference PSNR Estimation for Quality Monitoring of Motion JPEG2000 Video Over Lossy Packet Networks

    Publication Year: 2008, Page(s):637 - 645
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2006 KB) | HTML iconHTML

    In this paper, we propose a no-reference (NR) method for estimating the degradation of Motion JPEG2000 (MJP2) video due to packet loss. The proposed method estimates the peak signal-to-noise ratio (PSNR) between received MJP2 frames affected by packet loss and the originally encoded frames without having information about the latter. The PSNR of a frame could be estimated in the wavelet domain, na... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Versatile Transcoding Proxy for Internet Content Adaptation

    Publication Year: 2008, Page(s):646 - 658
    Cited by:  Papers (17)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1231 KB) | HTML iconHTML

    Recent technology advances in multimedia communication have ushered in a new era of personal communication. Users can ubiquitously access the Internet via various mobile devices. For the mobile devices featured with lower bandwidth network connectivity, transcoding can be used to reduce the object size by lowering the quality of a multimedia object. In view of the monolithic transcoders which only... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Recognizing Human Emotional State From Audiovisual Signals

    Publication Year: 2008, Page(s):659 - 668
    Cited by:  Papers (24)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1569 KB) | HTML iconHTML

    Machine recognition of human emotional state is an important component for efficient human-computer interaction. The majority of existing works address this problem by utilizing audio signals alone, or visual information only. In this paper, we explore a systematic approach for recognition of human emotional state from audiovisual signals. The audio characteristics of emotional speech are represen... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • IEEE Transactions on Multimedia information for authors

    Publication Year: 2008, Page(s):669 - 670
    Request permission for commercial reuse | PDF file iconPDF (47 KB)
    Freely Available from IEEE
  • Special Issue on Quality-Driven Cross-Layer Design for Multimedia Communications

    Publication Year: 2008, Page(s):671 - 672
    Request permission for commercial reuse | PDF file iconPDF (283 KB)
    Freely Available from IEEE
  • IEEE Transactions on Multimedia society information

    Publication Year: 2008, Page(s): C3
    Request permission for commercial reuse | PDF file iconPDF (27 KB)
    Freely Available from IEEE

Aims & Scope

The scope of the Periodical is the various aspects of research in multimedia technology and applications of multimedia.

Full Aims & Scope

Meet Our Editors

Editor-in-Chief
Wenwu Zhu
Department of Computer Science
Tsinghua University
Beijing, China
Tel: (+86 10) 6279 0967
wwzhu@tsinghua.edu.cn