2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763)

27-30 June 2004

Go

Filter Results

Displaying Results 1 - 25 of 195
  • 2004 IEEE International Conference Multimedia and Expo (ICME) - Cover

    Publication Year: 2004, Page(s): 0_1
    Request permission for commercial reuse | PDF file iconPDF (13 KB)
    Freely Available from IEEE
  • [Blank page]

    Publication Year: 2004, Page(s): 0_2
    Request permission for commercial reuse | PDF file iconPDF (3 KB)
    Freely Available from IEEE
  • 2004 IEEE International Conference on Multimedia and Expo (ICME) - Title

    Publication Year: 2004, Page(s): i
    Request permission for commercial reuse | PDF file iconPDF (8 KB)
    Freely Available from IEEE
  • Copyright

    Publication Year: 2004, Page(s): ii
    Request permission for commercial reuse | PDF file iconPDF (22 KB)
    Freely Available from IEEE
  • Message from the General Chair

    Publication Year: 2004, Page(s): iii
    Request permission for commercial reuse | PDF file iconPDF (50 KB) | HTML iconHTML
    Freely Available from IEEE
  • Message From the Technical Program Chair

    Publication Year: 2004, Page(s): iv
    Request permission for commercial reuse | PDF file iconPDF (53 KB) | HTML iconHTML
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2004, Page(s):v - xxviii
    Request permission for commercial reuse | PDF file iconPDF (1477 KB)
    Freely Available from IEEE
  • A high-performance area-aware DSP processor architecture for video codecs

    Publication Year: 2004, Page(s):1499 - 1502 Vol.3
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (291 KB) | HTML iconHTML

    In this paper, we propose a high-performance and area-aware very long instruction word (VLIW) DSP architecture using a flexible single instruction multiple data (SIMD) approach and a grouped permutation (GP) structure register file, respectively. Via the proposed data path architecture, the reduction of the execution cycles for digital filter and RGB2YUV benchmarks can be improved up to 50% compar... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Integrating virtual camera controls into digital video

    Publication Year: 2004, Page(s):1503 - 1506 Vol.3
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (403 KB) | HTML iconHTML

    Virtual camera controls (VCC) for digital video enable the viewers to visualize interesting objects from their perspective. VCC also allow us to play the video from different angles. In this sense, VCC do not only support visualization and browsing capabilities but also support playback from different angles. VCC management requires accurate global motion estimation and accurate sprite generation.... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Distributed construction of resource-efficient overlay tree by approximating MST

    Publication Year: 2004, Page(s):1507 - 1510 Vol.3
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (270 KB) | HTML iconHTML

    This paper presents a distributed protocol called RESMO for constructing an overlay tree to support video streaming applications. RESMO reduces network resource usage by approximating minimum spanning tree and achieves low end-to-end latency between the sender and each receiver at the same time. The resulting overlay is a compromise between overlay minimum spanning tree and shortest path tree. We ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Change aware distributed file system for a distributed search engine

    Publication Year: 2004, Page(s):1511 - 1514 Vol.3
    Cited by:  Patents (10)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (260 KB) | HTML iconHTML

    Intranet information retrieval is very important for discovering useful knowledge. In this process, the search engine is useful. However, conventional search engines, which are based on a centralized architecture, are not suited for intranet information retrieval because intranet information is frequently updated. Centralized search engines take a long time to collect Web pages by robots. So, we h... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Low-complexity linear array multiplier for normal basis of type-II

    Publication Year: 2004, Page(s):1515 - 1518 Vol.3
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (226 KB) | HTML iconHTML

    In many cryptographic applications, the finite field is usually defined as the normal basis representation, and accordingly, much research on fast implementation of such a basis are reported. By restricting the characteristics of an optimal normal basis (ONB) of type-II, we used the palindromic representation of type-II ONB to derive a linear array multiplier with a low-complexity architecture. Th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Experimental study of dual microphone systems

    Publication Year: 2004, Page(s):1519 - 1522 Vol.3
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (303 KB) | HTML iconHTML

    In this paper, we compare and evaluate the various noise cancellation schemes available in what we term a dual microphone system. A dual microphone system (DMS) is a composite directional audio-capturing device which consists of two microphones, each microphone having possibly different directional characteristics, e.g., omnidirectional, bidirectional or cardioid. By recasting the various combinat... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A VLSI prototype for Hadamard transform with application to MPEG-4 part 10

    Publication Year: 2004, Page(s):1523 - 1526 Vol.3
    Cited by:  Papers (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (234 KB) | HTML iconHTML

    This paper presents a VLSI prototype for the 2times2 Hadamard transform that is applied to the DC coefficients of the four 4times4 blocks of each chroma component as described in the MPEG-4 part 10 advanced video coding (AVC) standard. A VLSI prototype fir the quantization process that is accompanied with the transform operation is given as well. The implemented transform represents a level in the... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A fast algorithm and hardware implementation for rate-distortion optimization in JPEG2000

    Publication Year: 2004, Page(s):1527 - 1530 Vol.3
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (216 KB) | HTML iconHTML

    A fast algorithm for T<sub>2</sub> encoder in JPEG2000 suitable for hardware implementation is presented in this paper, based on the elaborate analysis of a rate-distortion optimization algorithm. By reducing calculative complexity, the difficulty of hardware implementation for the T<sub>2</sub> encoder is reduced, and the parallelizability of the JPEG2000 hardware system i... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • News Tuner: a simple interface for searching and browsing radio archives

    Publication Year: 2004, Page(s):1531 - 1534 Vol.3
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (310 KB) | HTML iconHTML

    We present in this paper a new Web-based application, called the News Tuner, for searching and browsing large radio archives. While popular search engines provide means for finding text and images, our approach combines semantic and acoustic search for efficient retrieval of audio documents. Semantic search allows the user to retrieve stories for a given concept, while acoustic search allows rando... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Merging rank lists from multiple sources in video classification

    Publication Year: 2004, Page(s):1535 - 1538 Vol.3
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (280 KB) | HTML iconHTML

    Multimedia corpora increasingly consist of data from multiple sources, with different characteristics that can be exploited by specialized applications. This paper focuses on video classification over multiple-source collections, and addresses the question whether classifiers should train from individual sources or from a full data set across all sources. If training separately, how can rank lists... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Small and fast moving object detection and tracking in sports video sequences

    Publication Year: 2004, Page(s):1539 - 1542 Vol.3
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (424 KB) | HTML iconHTML

    We propose an algorithm for detection and tracking of small and fast moving objects, like a ping pong ball or a cricket ball, in sports video sequences. For detection, the proposed method uses only motion as a cue; moreover it does not use any texture information. Our method is able to detect the object with very low contrast and negligible texture content. Along with detection, we also propose a ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Co-histogram and its application in video analysis

    Publication Year: 2004, Page(s):1543 - 1546 Vol.3
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (262 KB) | HTML iconHTML

    The tool for video analysis addressed in this paper is called co-histogram, which is a statistic graph generated by counting the corresponding pixel pairs of two images. A co-histogram shows how the pixels are distributed among combinations of two image pixel values. By means of the co-histogram, we can have a visual perception of a widely used metric, peak signal-to-noise ratio (PSNR), and the sy... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A dynamic model integrating colour and shape information for objects tracking in conditions of occlusion

    Publication Year: 2004, Page(s):1547 - 1550 Vol.3
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (294 KB) | HTML iconHTML

    In this paper, an algorithm for tracking multiple rigid and non-rigid objects in conditions of occlusion is presented. The proposed method is based on a scalable and adaptive model based on joint information of color and shape. Through a GHT (generalized Hough transform) based voting method the center of mass of each object can be determined in real time with a good degree of precision. Quantitati... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A three-layer event detection framework and its application in soccer video

    Publication Year: 2004, Page(s):1551 - 1554 Vol.3
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (330 KB) | HTML iconHTML

    In this paper, a three-layer event detection scheme is proposed and applied to shoot and card event detection in soccer videos. At the lowest layer low-level features including color texture, edge and motion are considered. At the middle layer a concept of semantic unit is presented to bridge the semantic gap. The semantic unit is a sequence of consecutive frames tagged a special semantic cue. At ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A robust Hough-based algorithm for partial ellipse detection in broadcast soccer video

    Publication Year: 2004, Page(s):1555 - 1558 Vol.3
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (366 KB) | HTML iconHTML

    This paper presents a robust Hough-based algorithm for partial slightly oblique ellipse detection in broadcast soccer video. The successful identification of the ellipses significantly facilitate soccer video analysis. The existing standard and various modified ellipse Hough transforms measure a cell in the Hough space as though the ellipse defined by the cell were a complete ellipse. Hence, they ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Patch-based natural object detection using CF*IRF

    Publication Year: 2004, Page(s):1559 - 1562 Vol.3
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (395 KB) | HTML iconHTML

    In this paper, we propose a patch-based approach for detecting natural objects on keyframes of video shots. We apply it on the extraction of semantic feature "vegetation" and "animal", and on some search tasks in TRECVID2003. Our detection method is based on color and texture features, and considers the spatial information as well. TRECVID evaluation shows that our approach works effectively and c... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Applications and specificities of synthetic/synthetic projective registration

    Publication Year: 2004, Page(s):1563 - 1566 Vol.3
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (285 KB) | HTML iconHTML

    A recent application related to 3D watermarking has led to a specific registration problem: the registration of a 3D computer object with a computer-generated 2D view of it. So far, projective registration algorithms have focused on images of real objects because there was no interest in registering synthetic images with computer models. While those algorithms could also be directly applied to the... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Motion-based event detection and semantic classification for baseball sport videos

    Publication Year: 2004, Page(s):1567 - 1570 Vol.3
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (355 KB) | HTML iconHTML

    In this paper, the techniques of event detection and semantic classification of baseball sport videos are investigated. Due to abundant motion information in sport videos, motion vectors are estimated, validated, and used to compute both the motion activity and camera motion parameters of a frame. Considering the domain-specific knowledge of baseball sport, behaviors of motion features in the spat... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.