Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

15-15 May 1998

Go

Filter Results

Displaying Results 1 - 25 of 166
  • Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing [front matter]

    Publication Year: 1998, Page(s):i - lxiii
    Request permission for commercial reuse | |PDF file iconPDF (3916 KB)
    Freely Available from IEEE
  • A new subband perceptual audio coder using CELP

    Publication Year: 1998, Page(s):3661 - 3664 vol.6
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (307 KB)

    This paper presents an audio coding system which uses filter banks to decompose, in the frequency domain, the audio signal into constant width subbands. A specific compression is applied in each subband. This compression is achieved by means of CELP coders. In order to obtain a high audio quality, psychoacoustic models allocate dynamically the number of bits needed in each subband. A particular ca... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Extraction of detailed image regions for content-based image retrieval

    Publication Year: 1998, Page(s):3713 - 3716 vol.6
    Cited by:  Papers (2)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (708 KB)

    We present a technique for coarsely extracting the regions of natural color images which contain directional detail, e.g., edges, texture, etc., which we then use for image database indexing. As a measure of color activity, we use a perceptually modified distance measure based on the sum-of-angles criterion. We then apply histogram thresholding techniques to separate the image into smooth color re... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 1998, Page(s):A1 - A12
    Request permission for commercial reuse | |PDF file iconPDF (1083 KB)
    Freely Available from IEEE
  • Digital processing of affective signals

    Publication Year: 1998, Page(s):3749 - 3752 vol.6
    Cited by:  Papers (31)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (336 KB)

    Affective signal processing algorithms were developed to allow a digital computer to recognize the affective state of a user who is intentionally expressing that state. This paper describes the method used for collecting the training data, the feature extraction algorithms used and the results of pattern recognition using a Fisher linear discriminant and the leave one out test method. Four physiol... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Text-to-visual speech synthesis based on parameter generation from HMM

    Publication Year: 1998, Page(s):3745 - 3748 vol.6
    Cited by:  Papers (18)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (340 KB)

    This paper presents a new technique for synthesizing visual speech from arbitrarily given text. The technique is based on an algorithm for parameter generation from HMM with dynamic features, which has been successfully applied to text-to-speech synthesis. In the training phase, syllable HMMs are trained with visual speech parameter sequences that represent lip movements. In the synthesis phase, a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • MEMIS-MHEG Environment for Multimedia Information and Simulation

    Publication Year: 1998, Page(s):3813 - 3816 vol.6
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (348 KB)

    MHEG represents a new multimedia and hypermedia standard proposed by ISO/IEC. This paper presents a new software authoring environment based around MHEG-5 that offers users a vehicle for creating multimedia applications that can interact with external programs which involve intense computational tasks. MEMIS provides a linkage between a multimedia front-end and externally available computational p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Conditional maximum likelihood frequency estimation for staggered modulations

    Publication Year: 1998, Page(s):3425 - 3428 vol.6
    Cited by:  Papers (6)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (352 KB)

    The use of spectrally efficient continuous phase modulations for mobile communications may lead to a serious performance degradation of the classical frequency error detectors (FEDs) due to the presence of self-noise. This article presents a new statistically efficient frequency estimation algorithm for staggered modulations. The cancellation of the self-noise is accomplished by the use of the con... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Blind carrier synchronization and channel identification for OFDM communications

    Publication Year: 1998, Page(s):3509 - 3512 vol.6
    Cited by:  Papers (21)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (292 KB)

    In OFDM communications, the loss of orthogonality due to carrier offset must be compensated before DFT-based demodulation can be performed. We present a high accuracy blind carrier offset estimation algorithm and a blind channel equalizer which exploit the intrinsic structure information of OFDM signals. The latter method allows the receiver to perform coherent demodulation in changing environment... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A hidden Markov model framework for video segmentation using audio and image features

    Publication Year: 1998, Page(s):3741 - 3744 vol.6
    Cited by:  Papers (65)  |  Patents (50)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (356 KB)

    This paper describes a technique for segmenting video using hidden Markov models (HMM). Video is segmented into regions defined by shots, shot boundaries, and camera movement within shots. Features for segmentation include an image-based distance between adjacent video frames, an audio distance based on the acoustic difference in intervals just before and after the frames, and an estimate of motio... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Protocols for real-time multimedia data transmission over the Internet

    Publication Year: 1998, Page(s):3809 - 3812 vol.6
    Cited by:  Papers (2)  |  Patents (17)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (380 KB)

    The explosive growth of the Internet and the intranets have attracted a great deal of attention to the implementation and performance of networked multimedia services, which involve the transport of real-time multimedia data streams over non-guaranteed quality of service (QoS) networks based on the Internet Protocol (IP). In this paper, the author presents an overview of the existing architectural... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Optimal truncation time for matched filter array processing

    Publication Year: 1998, Page(s):3629 - 3632 vol.6
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (356 KB)

    Matched filter array processing (MFA) has been shown to improve the signal-to-noise (SNR) quality for array speech capture in reverberant environments. However, under non-optimum conditions, MFA processing is computationally costly, and may produce little improvement or even subjective quality degradation as compared with simple time delay compensation (TDC). Appropriate truncation of the MFA filt... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Estimation of FM modulation of multi-component signals from the Fourier phase

    Publication Year: 1998, Page(s):3421 - 3424 vol.6
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (416 KB)

    The spectral phase is a quantity which is normally discarded in analyzing signals. The concept of a complex time-frequency representation is presented in which the rows are narrow bandpass filters and the columns are broadband Fourier spectra. Methods are developed which exploit the spectral phase of the surface to recover the FM modulating function of an FM modulated tone and an FM modulated mult... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Nonmaximally decimated filterbank based precoder/post-equalizer for blind channel identification and optimal MMSE equalization

    Publication Year: 1998, Page(s):3505 - 3508 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (320 KB)

    A novel nonmaximally decimated multirate filterbank structure is proposed for blind identification of communication channels. This structure is shown to be very similar to a form proposed earlier in the literature. The proposed blind channel identification algorithm is not sensitive to the characteristics of the unknown channel, including mixed phase and zeros on the unit circle. An optimal minimu... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Optimum delay and mean square error using CMA

    Publication Year: 1998, Page(s):3361 - 3364 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (340 KB)

    The performance of the constant modulus algorithm can suffer because of the existence of local minima with large mean squared error (MSE). This paper presents a new way of obtaining the optimum MSE over all delays using a second equalizer under a mixed constant modulus and cross correlation algorithm (CM-CCA). Proof of convergence is obtained for the noiseless case. Simulations demonstrate the pot... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A hybrid real-time face tracking system

    Publication Year: 1998, Page(s):3737 - 3740 vol.6
    Cited by:  Papers (35)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (428 KB)

    A hybrid real-time face tracker based on both sound and visual cues is presented. Initial talker locations are estimated acoustically from microphone array data while precise localization and tracking are derived from image information. A computationally efficient algorithm for face detection via motion analysis is employed to track individual faces at rates up to 30 frames per second. The system ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Natural and synthetic video in MPEG-4

    Publication Year: 1998, Page(s):3805 - 3808 vol.6
    Cited by:  Papers (6)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (376 KB)

    The ISO MPEG Committee, after successful completion of the MPEG-1 and the MPEG-2 standards, has recently completed the Committee Draft for MPEG-4, its third standard. MPEG-4 is designed to be an object-based standard for multimedia coding. The visual part of the standard specifies coding of both natural and synthetic video. The MPEG-4 visual standard supports coding of natural video not only in a ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Personal computer software vowel training aid for the hearing impaired

    Publication Year: 1998, Page(s):3625 - 3628 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (304 KB)

    A vowel training aid system for hearing impaired persons which uses a Windows-based multimedia computer has been developed. The system provides two main displays which give visual feedback for vowels spoken in isolation and short word contexts. Feature extraction methods and neural network processing techniques provide a high degree of accuracy for speaker independent vowel training. The system ty... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • MPEG audio bit rate scaling on coded data domain

    Publication Year: 1998, Page(s):3669 - 3672 vol.6
    Cited by:  Papers (5)  |  Patents (33)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (348 KB)

    Formerly, once the audio data is compressed, transcoding is used to scale the bit rate, where decoding and re-encoding are taking place. Therefore, data manipulation of coded data has been very complex and time consuming work. We describe three algorithms for bit rate scaling in the coded MPEG data domain. One is a bandwidth limitation method cutting higher frequency components until the target da... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Applications of blind equalization in wireless ATM network

    Publication Year: 1998, Page(s):3277 - 3280 vol.6
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (344 KB)

    We investigated the feasibility of applying blind equalization to wireless ATM networks. Making use of the information exploited from the wireless ATM cell structure and medium access control (MAC), blind channel estimation together with a non-linear data directed estimator achieve good equalization performance without transmitting extra preamble. Simulation results are presented for ATM constant ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Constrained equalizers and precoding for magnetic storage channels

    Publication Year: 1998, Page(s):3481 - 3484 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (260 KB)

    The nonlinear write process of magnetic recording allows to write the symbols ±1 only. The magnetic channel is a differentiating channel. The locations of the transitions from +1 to -1 and vice versa in the input signal to the magnetic channel are important for the received waveform. This paper defines a noise enhancement constrained, finite dimensional equalizer. This equalizer trades some... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Pilot symbol assisted diversity reception for a fading channel

    Publication Year: 1998, Page(s):3417 - 3420 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (248 KB)

    Pilot symbol assisted modulation is a promising scheme to mitigate the effect of fading in a wireless channel. Analytical results for the performance of this scheme are available. Although the use of diversity is known to improve the performance of receivers used in fading channels, pilot symbol assisted diversity reception has not been studied. In this paper, we derive an exact probability of err... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Two-stage code reference beamformer in mobile communications

    Publication Year: 1998, Page(s):3309 - 3312 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (352 KB)

    This paper addresses a new architecture for blind adaptive beamforming when dealing with frequency hopping (FH) modulation in cellular mobile communications systems. The proposed code reference beamformer (CRB) takes advantage of the inherent frequency diversity to estimate beforehand the noise plus interference correlation matrix, which is employed as the first part of the framework. Then, a seco... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • QoS considerations for DMT-based ADSL and VDSL systems

    Publication Year: 1998, Page(s):3437 - 3440 vol.6
    Cited by:  Papers (3)  |  Patents (11)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (340 KB)

    Thanks to their high bandwidth capability, asymmetric digital subscriber lines (ADSL) and very high speed DSL (VDSL) are access technologies that permit the transmission of several applications simultaneously on telephone subscriber lines. Considering that these applications may require a different quality of service (QoS), and particularly different bit error rates (BER), for transmission, this p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • H equalization of communication channels

    Publication Year: 1998, Page(s):3489 - 3492 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract |PDF file iconPDF (280 KB)

    As an alternative to existing techniques and algorithms, we investigate the merit of the H-infinity approach to the equalization of communication channels. We first look at causal H-infinity equalization problem and then look at the improvement due to finite delay. By introducing the risk sensitive property, we compare the average performance of the central H-infinity equalizer with the MMSE equal... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.