Ninth IEEE International Symposium on Multimedia (ISM 2007)

10-12 Dec. 2007

Filter Results

Displaying Results 1 - 25 of 51
  • Ninth IEEE International Symposium on Multimedia - Cover

    Publication Year: 2007, Page(s): c1
    Request permission for commercial reuse | PDF file iconPDF (161 KB)
    Freely Available from IEEE
  • Ninth IEEE International Symposium on Multimedia - Title page

    Publication Year: 2007, Page(s):i - iii
    Request permission for commercial reuse | PDF file iconPDF (107 KB)
    Freely Available from IEEE
  • Ninth IEEE International Symposium on Multimedia - Copyright

    Publication Year: 2007, Page(s): iv
    Request permission for commercial reuse | PDF file iconPDF (72 KB)
    Freely Available from IEEE
  • Ninth IEEE International Symposium on Multimedia - TOC

    Publication Year: 2007, Page(s):v - viii
    Request permission for commercial reuse | PDF file iconPDF (137 KB)
    Freely Available from IEEE
  • General Co-chairs' Foreword

    Publication Year: 2007, Page(s):ix - x
    Request permission for commercial reuse | PDF file iconPDF (104 KB) | HTML iconHTML
    Freely Available from IEEE
  • Message from the Program Chairs

    Publication Year: 2007
    Request permission for commercial reuse | PDF file iconPDF (88 KB) | HTML iconHTML
    Freely Available from IEEE
  • Conference Organizing Committee

    Publication Year: 2007, Page(s):xii - xiv
    Request permission for commercial reuse | PDF file iconPDF (104 KB)
    Freely Available from IEEE
  • Technical Program Committee

    Publication Year: 2007, Page(s):xv - xviii
    Request permission for commercial reuse | PDF file iconPDF (113 KB)
    Freely Available from IEEE
  • The Design of a Multi-party VoIP Conferencing System over the Internet

    Publication Year: 2007, Page(s):3 - 10
    Cited by:  Papers (9)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (248 KB) | HTML iconHTML

    In this paper, we present the design of a VoIP conferencing system that enables the voice communication of multiple users in the Internet. After studying the conversational dynamics in multi-party conferencing, we identify user-observable metrics that affect the perception of conversational quality and their trade-offs. Based on the dynamics and the behavior on delays, jitters, and losses of Inter... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The Role of QoE on IPTV Services style

    Publication Year: 2007, Page(s):11 - 13
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (422 KB) | HTML iconHTML

    The IPTV, Internet Protocol TV, is one of the hottest topics as an emerging service. This new media service has a significant potential where a various kind of content can be enjoyed in a variety of way. We are living in the content-centric world. This flood of data thanks to the evolution of the hardware since 60 year- old transistor technology becomes the potential problem these days. The user e... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A New Image Compression Scheme Based on Locally Adaptive Coding

    Publication Year: 2007, Page(s):14 - 21
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1246 KB) | HTML iconHTML

    Vector quantization (VQ) is a simple and widely used compression technology in many applications. For image compression, VQ provides both a fixed compression ratio and maintains acceptable distortion. However, the performance of VQ still can be improved in terms of the image quality of compressed images and codebook size used for encoding and decoding. In this paper, a new VQ-like image compressio... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Making Sense of Ubiquitous Media style

    Publication Year: 2007, Page(s):22 - 26
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (571 KB) | HTML iconHTML

    In the emerging Post-PC era, more and more computers 'in the net' can see, hear, or feel. Since these computers are networked, they can cooperate in the interpretation of their 'sensation'. Cameras, camcorders, etc. will soon be wirelessly connected, doubling as mobile phones. In other words: multimedia goes ubiquitous. On the other hand, users leverage off the wealth of text-based information pre... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Detection of Questions in Arabic Audio Monologues Using Prosodic Features

    Publication Year: 2007, Page(s):29 - 36
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (203 KB) | HTML iconHTML

    Prosody has been widely used in many speech-related applications including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. An important application we investigate is that of identifying question sentences in Arabic monologue lectures. Languages other than Arabic have received a lot of attention in this regard. We ap... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multi-stream Asynchrony Modeling for Audio-Visual Speech Recognition

    Publication Year: 2007, Page(s):37 - 44
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (405 KB) | HTML iconHTML

    In this paper, two multi-stream asynchrony Dynamic Bayesian Network models (MS-ADBN model and MM-ADBN model) are proposed for audio-visual speech recognition (AVSR). The proposed models, with different topology structures, loose the asynchrony of audio and visual streams to word level. For MS-ADBN model, both in audio stream and in visual stream, each word is composed of its corresponding phones, ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An Adaptive Audio Quantizer for Voip Systems

    Publication Year: 2007, Page(s):45 - 55
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (199 KB) | HTML iconHTML

    The Internet evolution has been requiring the development of new technology to support multimedia transmission such as images, database access, audio and video in realtime. Such development needs new services and supports like the voice over IP (VoIP) which has a main motivation in the low cost communication and management. VoIP systems have motivated this work which proposes an adaptive audio qua... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Complexity Reduction and Fast Algorithm for 2-D Integer Discrete Wavelet Transform Using Symmetric Mask-Based Scheme

    Publication Year: 2007, Page(s):57 - 64
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (619 KB) | HTML iconHTML

    Wavelet coding has been shown to be better than discrete cosine transform (DCT) in image/video processing. Moreover, it has the feature of scalability, which is involved in modern video standards. This work presents novel algorithms, namely 2-D symmetric mask-based discrete wavelet transform (SMDWT), to improve the critical issue of the 2-D lifting-based discrete wavelet transform (LDWT), and then... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Local Binary Patterns for Human Detection on Hexagonal Structure

    Publication Year: 2007, Page(s):65 - 71
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (324 KB) | HTML iconHTML

    Local binary pattern (LBP) was designed and has been widely used for efficient texture classification. LBP provides a simple and effective way to represent texture patterns. Uniform LBPs play an important role for LBP-based pattern/object recognition as they include majority of LBPs. On the other hand, Human detection based on Mahalanobis distance map (MDM) recognizes appearance of human based on ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Adaptive Early Termination for Fast H.264 Video Coding

    Publication Year: 2007, Page(s):72 - 77
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (358 KB) | HTML iconHTML

    The H.264 standard applies several powerful coding methods to obtain high compression efficiency. However, it requires a lot of computation especially in variable block-size motion estimation. To reduce the motion estimation redundancy more effectively, an adaptive early termination algorithm is proposed in this paper. The proposed algorithm dynamically changes the thresholds for different coding ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Spatial-Temporal Error Detection Scheme for Video Transmission over Noisy Channels

    Publication Year: 2007, Page(s):78 - 85
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (279 KB) | HTML iconHTML

    Error detection plays an important role in an error- robust video decoder. In this paper, a spatial-temporal error detection scheme for a video decoder is proposed. By considering inherently spatial and temporal similarities in video sequences, the visually corrected macroblocks in the decoded frames are detected by employing a set of error detection procedures, where one cross-boundary similarity... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Moving Region Detection by Transportation Problem Solving

    Publication Year: 2007, Page(s):86 - 91
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2020 KB) | HTML iconHTML

    In this paper, we propose a novel moving region detection method from the viewpoint of solving the transportation problem. This method extracts the relations between regions as a solution to the transformation problem between pixels belonging to adjacent frames. Moving regions are detected by utilizing the properties of these relations. This method does not require any models such as prior knowled... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Summarization of Wearable Videos Based on User Activity Analysis

    Publication Year: 2007, Page(s):92 - 99
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1610 KB) | HTML iconHTML

    This paper presents a model for automatic summarization of videos recorded by wearable cameras. The proposed model detects various user activities by computing the transform of matching image features among video frames. Four basic types of user activities are proposed, including "moving closer /farther", "panning", "making a turn", and "rotation". Different summarization techniques are provided f... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Feature-Based Full-Frame Image Stabilization

    Publication Year: 2007, Page(s):100 - 106
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1630 KB) | HTML iconHTML

    Digital image stabilization usually discards boundary pixels and outputs a smaller video. In this paper, we present a new digital image stabilization algorithm that preserves the frame size of output video by pixel filling. The proposed algorithm eliminates the accumulation error by directly estimating the global motions in a transformation chain with reference to a fixed frame. A feature matching... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Event-Based Segmentation of Sports Video Using Motion Entropy

    Publication Year: 2007, Page(s):107 - 111
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (293 KB) | HTML iconHTML

    An event-based segmentation method for sports videos is presented. A motion entropy criterion is employed to characterize the level of intensity of relevant object motion in individual frames of a video sequence. The resulting motion entropy curve then is approximated with a piece-wise linear model using a homoscedastic error model based time series change point detection algorithm. It is observed... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • RISA: A Real-Time Interactive Shadow Avatar

    Publication Year: 2007, Page(s):112 - 122
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (441 KB) | HTML iconHTML

    As Webcams become an important factor in the PC environment, many camera-based communication techniques have been developed. Among them, gesture-based communication is attracting attention. In this paper, we propose a real-time interactive shadow avatar (RISA) which can express facial emotions by changing as response to the user's gestures. The avatar's shape is a virtual shadow constructed from a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A General Scheme for Extracting QR Code from a Non-uniform Background in Camera Phones and Applications

    Publication Year: 2007, Page(s):123 - 130
    Cited by:  Papers (18)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (956 KB) | HTML iconHTML

    With the rapid advances in mobile communication technologies, QR code in the embedded camera devices has been used as new input interfaces. However, the previous works for extracting QR code from an image do not consider a non-uniform background. In this paper, we implement the applications of QR code and propose an efficient algorithm to extract QR code from the non-uniform background. In contras... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.