1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)

7-9 Dec. 1998

Filter Results

Displaying Results 1 - 25 of 102
  • 1998 IEEE Second Workshop on Multimedia Signal Processing (Cat. No.98EX175)

    Publication Year: 1998
    Request permission for commercial reuse | PDF file iconPDF (317 KB)
    Freely Available from IEEE
  • Robust H.263 video transmission over spectrally shaped channels using multicarrier modulation

    Publication Year: 1998, Page(s):585 - 590
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (364 KB)

    This paper proposes an efficient and robust technique for transmission of entropy coded video over noisy channels, especially for spectrally shaped channels which are commonly used in ADSL environments. Combining multicarrier modulation and the error resilient entropy coding (EREC) technique together, we develop a new loading algorithm which achieves unequal error protection (UEP) by optimally all... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 1998, Page(s):635 - 638
    Request permission for commercial reuse | PDF file iconPDF (103 KB)
    Freely Available from IEEE
  • Real-time spontaneous interaction system with narratives

    Publication Year: 1998, Page(s):247 - 252
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (404 KB)

    Real-time spontaneous interaction is the basis of our active behavior such as communication. On the other hand, narratives in movies or novels give us the opportunity to experience dramatic events which are not encountered in our daily lives. The integration of these factors is expected to provide us with a new type of experience. In this paper, by integrating movies and interaction technologies, ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Rhythm complexity measures for music pattern recognition

    Publication Year: 1998, Page(s):167 - 172
    Cited by:  Papers (6)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (304 KB)

    Three measures of rhythm complexity are considered. It is suggested that these measures be used in a system for machine recognition of music patterns as determinants of relative weights assigned to pitch and rhythm errors. The three measures are characterized and a procedure for determining parameters of one of the measures is described View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Emu: an e-mail preprocessor for text-to-speech

    Publication Year: 1998, Page(s):239 - 244
    Cited by:  Papers (2)  |  Patents (31)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (336 KB)

    E-mail reading is one of the most important commercial applications of text-to-speech synthesis (TTS). Yet e-mail is one of the most difficult types of text to deal with, since it is both highly structured -frequently containing elements such as tables, signatures, “artwork” and quotations from previous messages; and at the same time often lacks any reliable unambiguous indicators for ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Continually traffic accommodating Internet streaming video

    Publication Year: 1998, Page(s):403 - 408
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (308 KB)

    A new way to implement the Internet modem video transmission is presented. This system is capable of continually accommodating its bitstream size in response to changing network conditions. The key idea is to adopt an adaptive least mean squares (LMS) controller to orchestrates an H.263+ encoder rate control at the server end as well as a fast frame interpolation at the client end. It is demonstra... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A real-time encoding/decoding system (REDS) for HDTV editing

    Publication Year: 1998, Page(s):215 - 220
    Cited by:  Papers (1)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (260 KB)

    A programmable and scalable parallel architecture is proposed for the real-time encoding/decoding of HDTV images and for nonlinear editing of the compressed video data. It only uses the intra-mode compression/decompression so that nonlinear editing can be performed easily and high-quality images can be recovered. Spatially partitioned image data are concurrently processed by multiple parallel proc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Enhancements to the error resilient entropy code

    Publication Year: 1998, Page(s):456 - 461
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (492 KB)

    The error resilient entropy code (EREC) has been well known for some time. It is a method of encoding the overall prefix codes which result from many forms of image compression algorithm such that they are largely independent of each other. Any error introduced into the data, for example during transmission over a radio link, will thus only affect a small amount of data. This paper discusses metho... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A framework for interactive courses and virtual laboratories

    Publication Year: 1998, Page(s):501 - 506
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (472 KB)

    Theory and experimentation are both complementary in sciences. Where the former builds up a formal framework, the latter helps humans to develop their intuition. Hard-printed books are great supports for theorems and formulas repositories, but stay desperately static when providing examples. This paper presents some directions towards a framework for digital publishing, distance learning and compu... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Determining acoustic round trip delay for VoIP conferences

    Publication Year: 1998, Page(s):161 - 166
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (280 KB)

    This paper proposes a theoretical model and an ITU-T H.323 compliant empirical measurement method for the acoustic round trip delay (ARTD) of a voice over Internet protocol (VoIP) conference. The empirical measurement method is able to compute the complete acoustic round trip delay that the user perceives. The acoustic round trip delay is experimentally measured by slicing the real-time signal pat... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Implementation of optimized cache replenishment algorithms in a soft caching system

    Publication Year: 1998, Page(s):233 - 238
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (280 KB)

    We address practical issues which arise in the implementation of optimized cache replenishment algorithms within a “soft” caching framework. We study the algorithms that have been proposed for optimized soft caching and simulate them using actual proxy traces. Our objective is to determine what compromises have to be made in order to approximate the desired optimal performance while ma... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A watermarking with two signatures

    Publication Year: 1998, Page(s):394 - 399
    Cited by:  Papers (1)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (468 KB)

    A watermarking scheme is presented which embeds two different watermarks to the same frequency and extracts the marks with two different methods in the frequency domain. One of the two extraction methods needs a source image, whereas the other does not. Each extraction method uses a unique operation in extracting the watermark. The use of two watermark schemes is more effective in claiming rightfu... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Progressive coding of medical volumetric data using three-dimensional integer wavelet packet transform

    Publication Year: 1998, Page(s):553 - 558
    Cited by:  Papers (9)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (280 KB)

    We examine progressive lossy to lossless compression of medical volumetric data using three-dimensional (3D) integer wavelet packet transforms and set partitioning in hierarchical trees (SPIHT). To achieve good lossy coding performance, we describe a 3D integer wavelet packet transform that allows implicit bit shifting of wavelet coefficients to approximate a 3D unitary transformation. We also add... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Remote control and visualization of scanning probe microscopes via Web

    Publication Year: 1998, Page(s):209 - 214
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (420 KB)

    Our efforts in bringing the scanning probe microscope (SPM) live data available on the Web have been successful. This work is being pursued under the aegis of the IN-VSEE project targeted to bring SPM data live to community colleges, high school and other research labs/classroom. We have further advanced this to provide the capability of remote operation of the SPM via the Web. The truly remote op... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Estimating camera parameters from motion vectors of digital video

    Publication Year: 1998, Page(s):105 - 110
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (416 KB)

    In the development of a method of estimating the global camera motion from encoded video, we propose a robust estimation based on an outlier rejection scheme. The extended Kalman filter and the Levenberg-Marquardt method are implemented for the estimation. Both implementations show promising results, but from a computational aspect, the LM method is more efficient than the extended Kalman filter i... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Real-time lip tracking and bimodal continuous speech recognition

    Publication Year: 1998, Page(s):65 - 70
    Cited by:  Papers (23)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (360 KB)

    We investigate using a bimodal approach to speech recognition by incorporating additional visual features derived from lip movement of the speaker. A reference contour model is used to track the lip outline of the speaker. By using color, constraining the deformation in an affine subspace, and by incorporating an outlier rejection mechanism, our system is robust and runs in real time. To address t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Comparison of two objective speech quality measures: MBSD and ITU-T Recommendation P.861

    Publication Year: 1998, Page(s):426 - 431
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (300 KB)

    The modified bark spectral distortion (MBSD), used for an objective speech quality measure, estimates speech distortion in the loudness domain taking into account the noise masking threshold in order to include only audible distortions in the calculation of the distortion measure. Preliminary simulation results have shown improvement of the MBSD over the conventional BSD. In this paper, the perfor... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Head-related transfer function synthesis for immersive audio

    Publication Year: 1998, Page(s):155 - 160
    Cited by:  Papers (5)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (236 KB)

    Immersive audio systems are being envisioned for applications that include teleconferencing and telepresence; augmented and virtual reality for manufacturing and entertainment; air traffic control, pilot warning, and guidance systems; displays for the visually- or aurally-impaired; home entertainment; distance learning; and professional sound and picture editing for television and film. The princi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Image coding ringing artifact reduction using morphological post-filtering

    Publication Year: 1998, Page(s):628 - 633
    Cited by:  Papers (27)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (452 KB)

    Ringing is an annoying artifact frequently encountered in low bit-rate transform and subband decomposition based compression of different media such as image, intra frame video and graphics. A mathematical morphology based post-processing algorithm is presented in this paper for image ringing artifact suppression. First, we use binary morphological operators to isolate the regions of an image wher... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Mouth motion learning and generating from observation

    Publication Year: 1998, Page(s):415 - 420
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (332 KB)

    This paper presents a system for analyzing and generating human mouth motion. We apply model-based tracking to a set of typical mouth image sequences and obtain model motion sequences, which are used to build the mouth motion space by applying principal component analysis (PCA). Given an abstract description of the mouth motion in the mouth motion space, our system can generate a new mouth motion ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Active multimedia documents for mobile services

    Publication Year: 1998, Page(s):227 - 232
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (344 KB)

    Multimedia content is designed for many purposes and the services that are associated with them rely heavily on technologies that address representation, processing, and transmission of that content. This paper presents a new model that refines the multimedia documents for mobile environments while integrating information technology components to address strict requirements. We present new ways to... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Beyond query by example

    Publication Year: 1998, Page(s):3 - 8
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (284 KB)

    This paper considers some of the problems we found trying to extract meaning from images in database applications, and proposes some ways to solve them. We argue that the meaning of an image is an ill-defined entity, and it is not in general possible to derive from an image the meaning that the user of the database wants. Rather, we should be content with a correlation between the intended meaning... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Building human face models from two images

    Publication Year: 1998, Page(s):117 - 122
    Cited by:  Papers (4)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (376 KB)

    We present a practical technique for building 3-D human face models from two photographs. Rather than using expensive 3-D scanners, we show that frontal face models can be faithfully reconstructed with unsophisticated digital cameras in a totally non-invasive setup. We propose a rectification algorithm based on the fundamental matrix by computing the dual of the point transformation matrix. The im... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Audiovisual speech enhancement: new advances using multi-layer perceptrons

    Publication Year: 1998, Page(s):77 - 82
    Cited by:  Papers (1)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (304 KB)

    This paper deals with the improvement of a noisy speech enhancement system based on the fusion of auditory and visual information. The system was presented in previous papers and implemented with a simple stimuli corrupted with white noise. Its principle consists of an analysis-enhancement-synthesis process based on a linear prediction (LP) model of the signal: the LP filter is enhanced thanks to ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.