1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)

7-9 Dec. 1998

Filter Results

Displaying Results 1 - 25 of 102
  • 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)

    Publication Year: 1998
    Request permission for commercial reuse | |PDF file iconPDF (317 KB)
    Freely Available from IEEE
  • Robust H.263 video transmission over spectrally shaped channels using multicarrier modulation

    Publication Year: 1998, Page(s):585 - 590
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (364 KB)

    This paper proposes an efficient and robust technique for transmission of entropy coded video over noisy channels, especially for spectrally shaped channels which are commonly used in ADSL environments. Combining multicarrier modulation and the error resilient entropy coding (EREC) technique together, we develop a new loading algorithm which achieves unequal error protection (UEP) by optimally all... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 1998, Page(s):635 - 638
    Request permission for commercial reuse | |PDF file iconPDF (103 KB)
    Freely Available from IEEE
  • Capture and synthesis of human motion in video sequences

    Publication Year: 1998, Page(s):111 - 116
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (320 KB)

    We present a knowledge-based framework to capture and represent human walkers in video. The system models the human body as an articulated object of twelve rigid body-parts whose motions are almost periodic and subject to dynamic constraints. The resulting representation is compact and composed of the motion, shape, and texture for each of the body-parts. We apply the representation to regenerate ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Estimating camera parameters from motion vectors of digital video

    Publication Year: 1998, Page(s):105 - 110
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (416 KB)

    In the development of a method of estimating the global camera motion from encoded video, we propose a robust estimation based on an outlier rejection scheme. The extended Kalman filter and the Levenberg-Marquardt method are implemented for the estimation. Both implementations show promising results, but from a computational aspect, the LM method is more efficient than the extended Kalman filter i... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The virtual museum: an integrated text and image database

    Publication Year: 1998, Page(s):253 - 258
    Cited by:  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (260 KB)

    We describe our “virtual museum” project: an union of image and text retrieval technologies that allows users to visit art collections on the Web. The virtual museum is composed of a series of rooms that the user can visit. The connections between the rooms are variable: passing from a room to another yields a result which is the outcome of a query, and depends on the query criterion t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A fast method of reconstructing high-resolution panoramic stills from MPEG-compressed video

    Publication Year: 1998, Page(s):99 - 104
    Cited by:  Papers (1)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (384 KB)

    Creating high quality still pictures from video presents a challenging problem due to the low spatial resolution of most video signals. Many algorithms have been proposed in the literature that utilize multiple video frames to increase spatial resolution. These algorithms depend on two critical assumptions: first, that the scene does not change significantly in the temporal vicinity of the frame o... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Real-time spontaneous interaction system with narratives

    Publication Year: 1998, Page(s):247 - 252
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (404 KB)

    Real-time spontaneous interaction is the basis of our active behavior such as communication. On the other hand, narratives in movies or novels give us the opportunity to experience dramatic events which are not encountered in our daily lives. The integration of these factors is expected to provide us with a new type of experience. In this paper, by integrating movies and interaction technologies, ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Enhancements to the error resilient entropy code

    Publication Year: 1998, Page(s):456 - 461
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (492 KB)

    The error resilient entropy code (EREC) has been well known for some time. It is a method of encoding the overall prefix codes which result from many forms of image compression algorithm such that they are largely independent of each other. Any error introduced into the data, for example during transmission over a radio link, will thus only affect a small amount of data. This paper discusses metho... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Image coding ringing artifact reduction using morphological post-filtering

    Publication Year: 1998, Page(s):628 - 633
    Cited by:  Papers (27)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (452 KB)

    Ringing is an annoying artifact frequently encountered in low bit-rate transform and subband decomposition based compression of different media such as image, intra frame video and graphics. A mathematical morphology based post-processing algorithm is presented in this paper for image ringing artifact suppression. First, we use binary morphological operators to isolate the regions of an image wher... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Facial expression recognition using HMM with observation dependent transition matrix

    Publication Year: 1998, Page(s):89 - 95
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (336 KB)

    An expression recognition technique is proposed based on the hidden Markov models (HMM) ability to deal with time sequential data and to provide time scale invariability as well as a learning capability. A feature vector sequence is used for this purpose, which relies on optical flow extraction, as well as directional filtering of the motion field. Segmentation and identification of important faci... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Towards robust, content based techniques for image authentication

    Publication Year: 1998, Page(s):297 - 302
    Cited by:  Papers (26)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (380 KB)

    This paper is focused on image authentication, as the process of evaluating the integrity of image contents relative to the original picture and of being able to detect, in an automatic way, malevolent image modifications. The paper begins with a description of a general framework for content based authentication of images and video. Then, a specific method is proposed. It relies on image edges an... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Emu: an e-mail preprocessor for text-to-speech

    Publication Year: 1998, Page(s):239 - 244
    Cited by:  Papers (2)  |  Patents (31)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (336 KB)

    E-mail reading is one of the most important commercial applications of text-to-speech synthesis (TTS). Yet e-mail is one of the most difficult types of text to deal with, since it is both highly structured -frequently containing elements such as tables, signatures, “artwork” and quotations from previous messages; and at the same time often lacks any reliable unambiguous indicators for ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A framework for interactive courses and virtual laboratories

    Publication Year: 1998, Page(s):501 - 506
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (472 KB)

    Theory and experimentation are both complementary in sciences. Where the former builds up a formal framework, the latter helps humans to develop their intuition. Hard-printed books are great supports for theorems and formulas repositories, but stay desperately static when providing examples. This paper presents some directions towards a framework for digital publishing, distance learning and compu... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Linear discriminant analysis for speechreading

    Publication Year: 1998, Page(s):221 - 226
    Cited by:  Papers (9)  |  Patents (6)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (336 KB)

    This paper investigates the use of Fisher-Rao (1965) linear discriminant analysis (LDA) as a means of visual feature extraction for hidden Markov model based automatic speechreading. For every video frame, a three-dimensional region of interest containing the speaker's mouth over a sequence of adjacent frames is lexicographically arranged into a data vector. Such vectors are then projected onto th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Head-related transfer function synthesis for immersive audio

    Publication Year: 1998, Page(s):155 - 160
    Cited by:  Papers (5)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (236 KB)

    Immersive audio systems are being envisioned for applications that include teleconferencing and telepresence; augmented and virtual reality for manufacturing and entertainment; air traffic control, pilot warning, and guidance systems; displays for the visually- or aurally-impaired; home entertainment; distance learning; and professional sound and picture editing for television and film. The princi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Frame-rate up-conversion using transmitted true motion vectors

    Publication Year: 1998, Page(s):622 - 627
    Cited by:  Papers (18)  |  Patents (22)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (416 KB)

    In this paper, we present a video frame-rate up-conversion scheme that uses transmitted true motion vectors for motion-compensated interpolation. In a past work, we demonstrated that a neighborhood-relaxation motion tracker can provide more accurate true motion information than a conventional minimal-residue block-matching algorithm. Although the technique to estimate the true motion vectors is a ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Joint source channel coding with hybrid FEC/ARQ for buffer constrained video transmission

    Publication Year: 1998, Page(s):567 - 572
    Cited by:  Papers (9)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (268 KB)

    We propose an automatic repeat request (ARQ)/forward error correction (FEC) scheme for synchronous transmission of video over a binary symmetric constant rate channel. The approach consists of jointly allocating source and channel rates to video blocks from a given admissible set subject to the buffer or equivalently end-end delay constraints. The channel codes used are the popular class of powerf... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Emotion recognition from audiovisual information

    Publication Year: 1998, Page(s):83 - 88
    Cited by:  Papers (18)  |  Patents (26)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (300 KB)

    We report preliminary results on emotion recognition by machine from joint audiovisual input of facial video and speech. The results show potential advantages in using both modalities over either modality alone. The recognition rate for audio alone is about 75% and video alone about 70%. Using audiovisual data we achieved 97% without increasing the number of features. The improvement in performanc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Combining vocal and visual cues in an identity verification system using K-NN based classifiers

    Publication Year: 1998, Page(s):59 - 64
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (296 KB)

    The contribution of this paper is twofold: (1) to formulate a fusion problem encountered in the design of a multi-modal identity verification system as a particular classification problem, (2) to propose a simple classifier to solve this problem. The multi-modal identity verification system under consideration is built of d modalities in parallel, each one delivering as output a scalar number, cal... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A buyer-seller watermarking protocol

    Publication Year: 1998, Page(s):291 - 296
    Cited by:  Papers (5)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (328 KB)

    Digital watermarks have previously been proposed for the purpose of copy protection and copy deterrence for multimedia content. Copy deterrence using digital watermarks is achieved by inserting a unique watermark into each copy of the watermark content sold which could be used to trace unauthorized copies to the erring buyer. One problem with such an approach is the fact that the buyer whose water... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An audio-visual distance for audio-visual speech vector quantization

    Publication Year: 1998, Page(s):523 - 528
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (300 KB)

    Speech is both an acoustic and a visual signal, and there exists some complementarity and redundancy between the two modalities. In the speech coding domain, it is of great interest to use this redundancy to improve speech coder performance. In this paper, we consider some audio and video joint coding process based on an audio-visual vector quantization. The method is shown to exploit quite well t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An algorithm for unsupervised color image segmentation

    Publication Year: 1998, Page(s):33 - 38
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (396 KB)

    In a great number of fields of computer vision image segmentation plays an essential role as a preliminary step towards further and higher levels of image processing. An original algorithm for unsupervised segmentation of color images is here presented; it resorts to a palletized representation of images, it uses the low spatial frequency content of color which is represented in the CIELUV space a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multimedia adaptation for dynamic environments

    Publication Year: 1998, Page(s):203 - 208
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (384 KB)

    This paper presents a new approach for adapting a multimedia presentation into heterogeneous and dynamically changing environments. A novel media adaptation taxonomy is proposed that includes synchronized media optimization and scaling. The taxonomy incorporates multi-dimensional interaction between media, service requirements and presentation. Network properties, terminal types and integration ca... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Noise reduction algorithms employing an intelligent inference engine for multimedia applications

    Publication Year: 1998, Page(s):125 - 130
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (248 KB)

    Two approaches to noise reduction are presented, namely the spectral subtraction system and the perceptual coding algorithm allowing to diminish audible noise. Both systems are controlled by an intelligent inference engine based on fuzzy logic. An extension of perceptual coding applications was proposed and verified experimentally with regard to noise removal originally present in acoustic signals... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.