By Topic

2006 IEEE International Conference on Multimedia and Expo

Date 9-12 July 2006

Filter Results

Displaying Results 1 - 25 of 558
  • [Front cover]

    Publication Year: 2006, Page(s): C1
    Request permission for commercial reuse | PDF file iconPDF (61 KB)
    Freely Available from IEEE
  • 2006 IEEE International Conference on Multimedia and Expo

    Publication Year: 2006, Page(s): i
    Request permission for commercial reuse | PDF file iconPDF (80 KB)
    Freely Available from IEEE
  • Copyright page

    Publication Year: 2006, Page(s): ii
    Request permission for commercial reuse | PDF file iconPDF (54 KB)
    Freely Available from IEEE
  • Welcome

    Publication Year: 2006, Page(s): iii
    Request permission for commercial reuse | PDF file iconPDF (44 KB) | HTML iconHTML
    Freely Available from IEEE
  • Technical program overview

    Publication Year: 2006
    Request permission for commercial reuse | PDF file iconPDF (59 KB) | HTML iconHTML
    Freely Available from IEEE
  • Organizing Committee

    Publication Year: 2006, Page(s): vi
    Request permission for commercial reuse | PDF file iconPDF (61 KB)
    Freely Available from IEEE
  • Technical Program Committee

    Publication Year: 2006, Page(s): vii
    Request permission for commercial reuse | PDF file iconPDF (64 KB)
    Freely Available from IEEE
  • Special Sessions

    Publication Year: 2006, Page(s): viii
    Request permission for commercial reuse | PDF file iconPDF (65 KB)
    Freely Available from IEEE
  • Review Committee

    Publication Year: 2006, Page(s):viii - xii
    Request permission for commercial reuse | PDF file iconPDF (76 KB)
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2006
    Request permission for commercial reuse | PDF file iconPDF (199 KB)
    Freely Available from IEEE
  • Key Frame Extraction in 3D Video by Rate-Distortion Optimization

    Publication Year: 2006, Page(s):1 - 4
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1187 KB) | HTML iconHTML

    3D video, which consists of a sequence of 3D mesh models, can provide detailed 3D information both in spatial and temporal domain. In this paper, a key frame extraction method has been developed to summarize 3D video by rate-distortion optimization. For this purpose, we introduce an effective feature vector extraction algorithm from 3D video. Prior to key frame extraction, shot detection is perfor... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Evolutionary Feature Generation in Speech Emotion Recognition

    Publication Year: 2006, Page(s):5 - 8
    Cited by:  Papers (11)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (88 KB) | HTML iconHTML

    Feature sets are broadly discussed within speech emotion recognition by acoustic analysis. While popular filter and wrapper based search help to retrieve relevant ones, we feel that automatic generation of such allows for more flexibility throughout search. The basis is formed by dynamic low-level descriptors considering intonation, intensity, formants, spectral information and others. Next, syste... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Identification and Detection of the Same Scene Based on Flash Light Patterns

    Publication Year: 2006, Page(s):9 - 12
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (274 KB) | HTML iconHTML

    A method has been developed to identify video shots of the same scene where camera flash lights are observed, and the method has been tested by using it to detect such shots from a large TV video archive. Camera flashes are often used in impressive scenes, such as interviews of important persons. Because such scenes are broadcasted repeatedly on various TV programs, a method for detecting them is ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Detecting Musical Sounds in Broadcast Audio Based on Pitch Tuning Analysis

    Publication Year: 2006, Page(s):13 - 16
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (343 KB) | HTML iconHTML

    Detecting the presence of musical sounds in broadcast audio is important for content-based indexing and retrieval of auditory and visual information in radio and TV programs. In this paper, we propose a novel approach for musical sounds detection in broadcast audio based on the analysis of the characteristic feature of musical tones, pitch tuning. A spectral analysis method is presented for detect... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Scalability of Multimedia Applications on Next-Generation Processors

    Publication Year: 2006, Page(s):17 - 20
    Cited by:  Papers (2)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (95 KB) | HTML iconHTML

    In the near future, the majority of personal computers are expected to have several processing units. This is referred to as core multiprocessing (CMP). Furthermore, each of the computation units will be capable of running multiple hardware threads. To benefit from the additional processing power, application developers should multithread their software. This paper studies the scalability (expecte... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On Parallelization of a Video Mining System

    Publication Year: 2006, Page(s):21 - 24
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (139 KB) | HTML iconHTML

    As digital video data becomes more pervasive, mining information from multimedia data becomes increasingly important. Although researches in multimedia mining area have shown great potential in daily life, the huge computational requirement prohibits its wide use in practice. Since our personal computer is shifting from uniprocessors to multicore processors, exploiting thread level parallelism in ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Collaborative Multithreading: An Open Scalable Processor Architecture for Embedded Multimedia Applications

    Publication Year: 2006, Page(s):25 - 28
    Cited by:  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB) | HTML iconHTML

    Numerous approaches can be employed in exploiting computation power in processors such as superscalar, VLIW, SMT and multi-core on chip. In this paper, a UniCore VisoMT processor is proposed, which unifies VLIW and multithreading by providing an efficient control and data communication model, while offering explicit parallelisms for embedded applications. The architecture concurrently executes a m... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Video Analysis and Compression on the STI Cell Broadband Engine Processor

    Publication Year: 2006, Page(s):29 - 32
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (171 KB) | HTML iconHTML

    With increased concern for physical security, video surveillance is becoming an important business area. Similar camera-based system can also be used in such diverse applications as retail-store shopper motion analysis and casino behavioral policy monitoring. There are two aspects of video surveillance that require significant computing power: image analysis for detecting objects, and video compre... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Computational Intellegence Techniques and their Applications in Content-Based Image Retrieval

    Publication Year: 2006, Page(s):33 - 36
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (165 KB) | HTML iconHTML

    The main focus of this paper is to present a methodology for optimizing relevance identification in content-based image retrieval (CBIR) systems through the principle of feature weight detection. The purpose of relevance identification is to find a collection of images that are statistically similar to, or match with, an original query image within a large visual database. The novelty of this sche... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Video News Shot Labeling Refinement via Shot Rhythm Models

    Publication Year: 2006, Page(s):37 - 40
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (209 KB) | HTML iconHTML

    We present a three-step post-processing method for increasing the precision of video shot labels in the domain of television news. First, we demonstrate that news shot sequences can be characterized by rhythms of alternation (due to dialogue), repetition (due to persistent background settings), or both. Thus a temporal model is necessarily third-order Markov. Second, we demonstrate that the output... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Which Thousand Words are Worth a Picture? Experiments on Video Retrieval using a Thousand Concepts

    Publication Year: 2006, Page(s):41 - 44
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (240 KB) | HTML iconHTML

    In contrast to traditional video retrieval that represents visual content with low-level features (e.g. color and texture), emerging concept-based video retrieval allows users to search video archives by specifying a limited number of high-level concepts (e.g. outdoors and car). Recent studies have demonstrated the feasibility of concept-based retrieval, but a fundamental question remains: what ki... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Clustering-Based Analysis of Semantic Concept Models for Video Shots

    Publication Year: 2006, Page(s):45 - 48
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (101 KB) | HTML iconHTML

    In this paper we present a clustering-based method for representing semantic concepts on multimodal low-level feature spaces and study the evaluation of the goodness of such models with entropy-based methods. As different semantic concepts in video are most accurately represented with different features and modalities, we utilize the relative model-wise confidence values of the feature extraction ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Overview of Low-Complexity Video Transcoding from H.263 to H.264

    Publication Year: 2006, Page(s):49 - 52
    Cited by:  Papers (3)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (186 KB) | HTML iconHTML

    With the standardization of H.264/AVC by ITU-T and ISO/IEC and the adaptatation into new hardware, the necessity of transcoding between existing standards and H.264 will arise to achieve interoperability between hardware devices. Because of the many new prediction parameters as well as the pixel-based deblocking filter and the new transform of H.264 this is a difficult task to perform. In our work... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An Efficient Algorithm for VC-1 to H.264 Video Transcoding in Progressive Compression

    Publication Year: 2006, Page(s):53 - 56
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (71 KB) | HTML iconHTML

    The high definition video adoption has been growing rapidly for the last two years. The two high definition DVD formats HD-DVD and Blueray have mandated MPEG-2, H.264 and VC-1 as video compression formats. The coexistence of these different video coding standards creates a need for transcoding. In this paper, an efficient transcoding algorithm from VC-1 video to H.264 video is discussed. While the... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • MPEG-4 to H.264 Transcoding using Macroblock Statistics

    Publication Year: 2006, Page(s):57 - 60
    Cited by:  Papers (7)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (116 KB) | HTML iconHTML

    In this paper, a temporal resolution reduction transcoding method that transforms an MPEG-4 video bitstream into an H.264 video bitstream is proposed. The block mode statistics and motion vectors in the MPEG-4 bitstream are utilized in the H.264 encoder for block mode conversion and motion vector interpolation methods. The proposed motion vector interpolation methods are developed not to perform b... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.