By Topic

Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

15-15 May 1998

Go

Filter Results

Displaying Results 1 - 25 of 166
  • Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing [front matter]

    Publication Year: 1998, Page(s):i - lxiii
    Request permission for commercial reuse | PDF file iconPDF (3916 KB)
    Freely Available from IEEE
  • A new subband perceptual audio coder using CELP

    Publication Year: 1998, Page(s):3661 - 3664 vol.6
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (307 KB)

    This paper presents an audio coding system which uses filter banks to decompose, in the frequency domain, the audio signal into constant width subbands. A specific compression is applied in each subband. This compression is achieved by means of CELP coders. In order to obtain a high audio quality, psychoacoustic models allocate dynamically the number of bits needed in each subband. A particular ca... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Extraction of detailed image regions for content-based image retrieval

    Publication Year: 1998, Page(s):3713 - 3716 vol.6
    Cited by:  Papers (2)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (708 KB)

    We present a technique for coarsely extracting the regions of natural color images which contain directional detail, e.g., edges, texture, etc., which we then use for image database indexing. As a measure of color activity, we use a perceptually modified distance measure based on the sum-of-angles criterion. We then apply histogram thresholding techniques to separate the image into smooth color re... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 1998, Page(s):A1 - A12
    Request permission for commercial reuse | PDF file iconPDF (1083 KB)
    Freely Available from IEEE
  • MEMIS-MHEG Environment for Multimedia Information and Simulation

    Publication Year: 1998, Page(s):3813 - 3816 vol.6
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (348 KB)

    MHEG represents a new multimedia and hypermedia standard proposed by ISO/IEC. This paper presents a new software authoring environment based around MHEG-5 that offers users a vehicle for creating multimedia applications that can interact with external programs which involve intense computational tasks. MEMIS provides a linkage between a multimedia front-end and externally available computational p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Protocols for real-time multimedia data transmission over the Internet

    Publication Year: 1998, Page(s):3809 - 3812 vol.6
    Cited by:  Papers (2)  |  Patents (17)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (380 KB)

    The explosive growth of the Internet and the intranets have attracted a great deal of attention to the implementation and performance of networked multimedia services, which involve the transport of real-time multimedia data streams over non-guaranteed quality of service (QoS) networks based on the Internet Protocol (IP). In this paper, the author presents an overview of the existing architectural... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Blind linearization, and identification of nonlinear systems-a least squares, Pth order inverse approach

    Publication Year: 1998, Page(s):3413 - 3416 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (308 KB)

    A deterministic approach to blind nonlinear channel equalization and identification is presented. This approach applies to nonlinear channels that can be approximately linearized by finite memory, finite order Volterra filters. Both the Volterra equalizers and the linearized channels are identified. This method also applies to blind identification of linear IIR channels. General conditions for exi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multiresolution sinusoidal modeling for wideband audio with modifications

    Publication Year: 1998, Page(s):3585 - 3588 vol.6
    Cited by:  Papers (12)  |  Patents (10)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (384 KB)

    We describe a computationally efficient method of generating more accurate sinusoidal parameters {amplitude, frequency, phase} from a wideband polyphonic audio source in a multiresolution, non-aliased fashion. This significantly improves upon previous work of sinusoidal modeling that assumes a single-pitched monophonic source, such as speech or an individual musical instrument, while using approxi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Natural and synthetic video in MPEG-4

    Publication Year: 1998, Page(s):3805 - 3808 vol.6
    Cited by:  Papers (4)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (376 KB)

    The ISO MPEG Committee, after successful completion of the MPEG-1 and the MPEG-2 standards, has recently completed the Committee Draft for MPEG-4, its third standard. MPEG-4 is designed to be an object-based standard for multimedia coding. The visual part of the standard specifies coding of both natural and synthetic video. The MPEG-4 visual standard supports coding of natural video not only in a ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Blind equalization of I.I.R. single-input multiple-output channels with common zeros using second-order statistics

    Publication Year: 1998, Page(s):3409 - 3412 vol.6
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (324 KB)

    The problem of blind equalization of SIMO (single-input multiple-output) communications channels is considered using only the second-order statistics of the data. Such models arise when single receiver data is fractionally sampled (assuming that there is excess bandwidth), or when an antenna array is used with or without fractional sampling. We focus on the direct design of finite-length MMSE (min... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Computational reduction during idle transmission in DSL modems

    Publication Year: 1998, Page(s):3441 - 3444 vol.6
    Cited by:  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (316 KB)

    This document describes two methods for reducing computational requirements during idle transmission in remote access systems incorporating digital subscriber line (DSL) modems, including asymmetrical DSL (ADSL) systems. These methods save processing power during idle transmission by generating an idle signal using low-complexity techniques. The generated idle signal is made spectrally compatible ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Two-stage code reference beamformer in mobile communications

    Publication Year: 1998, Page(s):3309 - 3312 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (352 KB)

    This paper addresses a new architecture for blind adaptive beamforming when dealing with frequency hopping (FH) modulation in cellular mobile communications systems. The proposed code reference beamformer (CRB) takes advantage of the inherent frequency diversity to estimate beforehand the noise plus interference correlation matrix, which is employed as the first part of the framework. Then, a seco... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A new blind zeroforcing equalizer for multichannel systems

    Publication Year: 1998, Page(s):3177 - 3180 vol.6
    Cited by:  Papers (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB)

    Blind channel equalization has been a very active research topic due to its potential application in mobile communications and digital TV systems. In this paper, we present a new blind zero-forcing equalizer that utilizes second order statistics from the multi-channel configuration. The algorithm is simple and relies only on nullspace decomposition. It can actively select the desired delay of the ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • GMDFα with adaptive reconstruction filters and zero throughput delay

    Publication Year: 1998, Page(s):3553 - 3556 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (316 KB)

    With reduction of the block size (increasing the number of subfilters) regular generalized multi-delay frequency domain adaptive filter (GMDFα) can achieve low throughput delay at the expense of the system performance. In situations where zero delay is desirable, we propose a new method which is not dependent on the block size. In addition, by using an adaptive reconstruction filter, further... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Discriminative training of HMM stream exponents for audio-visual speech recognition

    Publication Year: 1998, Page(s):3733 - 3736 vol.6
    Cited by:  Papers (47)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (372 KB)

    We propose the use of discriminative training by means of the generalized probabilistic descent (GPB) algorithm to estimate hidden Markov model (HMM) stream exponents for audio-visual speech recognition. Synchronized audio and visual features are used to respectively train audio-only and visual-only single-stream HMMs of identical topology by maximum likelihood. A two-stream HMM is then obtained b... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The MPEG-4 Structured Audio standard

    Publication Year: 1998, Page(s):3801 - 3804 vol.6
    Cited by:  Papers (4)  |  Patents (14)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (392 KB)

    The MPEG-4 standard defines numerous tools that represent the state-of-the-art in representation, transmission, and decoding of multimedia data. Among these is a new type of audio standard, termed “Structured Audio”. The MPEG-4 standard for structured audio allows for the efficient, flexible description of synthetic music and sound effects, and the use of synthetic sound in synchroniza... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Fast blind identification of FIR communications channels

    Publication Year: 1998, Page(s):3405 - 3408 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (356 KB)

    This contribution describes a fast frequency domain approach for blind channel identification which does not rely on the statistics of the symbols. The proposed approach is based on the so-called “intraspectral relations” of DFTs of PAM fractionally sampled signals. The use of DFTs is allowed under certain conditions commonly encountered in data communication systems. The intraspectral... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multi-pitch estimation for polyphonic musical signals

    Publication Year: 1998, Page(s):3565 - 3568 vol.6
    Cited by:  Papers (7)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (368 KB)

    Automatic score transcription goal is to achieve an score-like (notes pitches through time) representation from musical signals. Reliable pitch extraction methods for monophonic signals exist, but polyphonic signals are much more difficult, often ambiguous, to analyze. We propose a computationally efficient technique for automatic recognition of notes from a polyphonic signal. It looks for correct... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • QoS considerations for DMT-based ADSL and VDSL systems

    Publication Year: 1998, Page(s):3437 - 3440 vol.6
    Cited by:  Papers (3)  |  Patents (11)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (340 KB)

    Thanks to their high bandwidth capability, asymmetric digital subscriber lines (ADSL) and very high speed DSL (VDSL) are access technologies that permit the transmission of several applications simultaneously on telephone subscriber lines. Considering that these applications may require a different quality of service (QoS), and particularly different bit error rates (BER), for transmission, this p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A fast blind source separation for digital wireless applications

    Publication Year: 1998, Page(s):3305 - 3308 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (328 KB)

    The problem of blindly estimating multiple digital co-channel communication signals using an antenna array is studied in the presence of multipath fading. We develop a fast sequential-estimation algorithm for separating multi-user signals based on the geometric observation made by Kansen and Xu (see Proc. ICASSP'96, p.1085-8, Atlanta, GA, 1996). When the signals are constrained to a finite alphabe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Blind 2-D RAKE receivers based on RLS-type space-time adaptive filtering for DS-CDMA system

    Publication Year: 1998, Page(s):3173 - 3176 vol.6
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (380 KB)

    We previously presented a blind 2-D RAKE receiver for CDMA that cancels strong multi-user access interference and optimally combines multipath. The weight vector yielding the optimum signal to interference plus noise ratio for bit decisions is the “largest” generalized eigenvector of the spatio-frequency (spatio-temporal) correlation matrix pencil. However, the eigen-analysis based alg... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Applications of blind equalization in wireless ATM network

    Publication Year: 1998, Page(s):3277 - 3280 vol.6
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (344 KB)

    We investigated the feasibility of applying blind equalization to wireless ATM networks. Making use of the information exploited from the wireless ATM cell structure and medium access control (MAC), blind channel estimation together with a non-linear data directed estimator achieve good equalization performance without transmitting extra preamble. Simulation results are presented for ATM constant ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Robust exponential modeling of audio signals

    Publication Year: 1998, Page(s):3581 - 3584 vol.6
    Cited by:  Papers (23)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (352 KB)

    We present a numerically robust method for modeling audio signals which is based on a exponential data model. This model is a generalization of the classical sinusoidal model in the sense that it allows the amplitude of the sinusoids to evolve exponentially. We show that, using this model, so called attacks can be represented very efficiently and we propose an algorithm for finding the exponential... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Design of channel optimized vector quantizers in the presence of channel mismatch

    Publication Year: 1998, Page(s):3465 - 3468 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (328 KB)

    We propose algorithms to design channel-optimized vector quantizers in the presence of channel mismatch. We consider two cases: (i) no information about the statistics of the channel bit error rate is available and (ii) the probability density function of the channel bit error rate is known. We also consider the use of an estimate of the channel signal-to-noise ratio to improve the performance. Si... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • The effects of local scattering on direction of arrival estimation with MUSIC and ESPRIT

    Publication Year: 1998, Page(s):3333 - 3336 vol.6
    Cited by:  Papers (11)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (344 KB)

    In wireless communication scenarios, multipath propagation from local scatterers in the vicinity of mobile sources may cause angular spreading as seen from a base station antenna array. This paper studies the effects of such local scattering on direction of arrival (DOA) estimation with the MUSIC and ESPRIT algorithms. Previous work has considered rapidly time-varying scenarios, and concluded that... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.