Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

15-15 May 1998

Go

Filter Results

Displaying Results 1 - 25 of 166
  • Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing [front matter]

    Publication Year: 1998, Page(s):i - lxiii
    Request permission for commercial reuse | PDF file iconPDF (3916 KB)
    Freely Available from IEEE
  • A new subband perceptual audio coder using CELP

    Publication Year: 1998, Page(s):3661 - 3664 vol.6
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (307 KB)

    This paper presents an audio coding system which uses filter banks to decompose, in the frequency domain, the audio signal into constant width subbands. A specific compression is applied in each subband. This compression is achieved by means of CELP coders. In order to obtain a high audio quality, psychoacoustic models allocate dynamically the number of bits needed in each subband. A particular ca... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Extraction of detailed image regions for content-based image retrieval

    Publication Year: 1998, Page(s):3713 - 3716 vol.6
    Cited by:  Papers (2)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (708 KB)

    We present a technique for coarsely extracting the regions of natural color images which contain directional detail, e.g., edges, texture, etc., which we then use for image database indexing. As a measure of color activity, we use a perceptually modified distance measure based on the sum-of-angles criterion. We then apply histogram thresholding techniques to separate the image into smooth color re... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 1998, Page(s):A1 - A12
    Request permission for commercial reuse | PDF file iconPDF (1083 KB)
    Freely Available from IEEE
  • On the use of kernel structure for blind equalization

    Publication Year: 1998, Page(s):3393 - 3396 vol.6
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (352 KB)

    The mathematical theory of kernel (null space) structure of Hankel and Hankel-like matrices is applied to the problem of blind equalization of co-channel signals. This work builds on recently introduced ideas in blind equalization where the symbols are treated as deterministic parameters and estimated directly without estimating the channel first. The main contribution of the new approach is that ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Dichotic presentation of speech signal with critical band filtering for improving speech perception

    Publication Year: 1998, Page(s):3601 - 3604 vol.6
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (276 KB)

    The reduction in the frequency resolving capacity of the auditory system due to the spread of masking of frequency components by neighboring components degrades speech perception in cases of sensorineural hearing impairment. We have carried out an experimental evaluation of splitting speech into two signals by using a bank of critical band filters, in order to reduce the effect of spectral masking... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A semi-blind channel identification method for GSM receivers

    Publication Year: 1998, Page(s):3389 - 3392 vol.6
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (300 KB)

    In this paper, we present a semi-blind channel identification scheme for the GSM system. Even though the GMSK signal has almost zero excess bandwidth (oversampling will give no more information), two diversity channels for each GMSK signal can be generated using a de-rotation scheme without additional antennas. Based on this single input and two output system, the semi-blind algorithm is applied t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A semi-blind approach to structured channel equalization

    Publication Year: 1998, Page(s):3385 - 3388 vol.6
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (360 KB)

    This paper describes a direct equalization approach for channels with some underlying structure. A semi-blind approach is taken here where a small amount of training symbols is available. A family of MMSE equalizers is obtained that includes some prior information about the channel structure. The channel structure assumed in this paper is that the channel vector lies approximately in the subspace ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On combining watermarking with perceptual coding

    Publication Year: 1998, Page(s):3725 - 3728 vol.6
    Cited by:  Papers (16)  |  Patents (16)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (348 KB)

    A watermark is a data stream inserted into multimedia content. It contains information relevant to the ownership or authorized use of the content. A watermark which could be recovered without a priori knowledge of the identity of the content could be used by Web search mechanisms to flag unauthorized distribution of the content. Since media will be compressed on these sites, a mark detection algor... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A system for machine recognition of music patterns

    Publication Year: 1998, Page(s):3597 - 3600 vol.6
    Cited by:  Papers (11)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (388 KB)

    We introduce a system for machine recognition of music patterns. The problem is put into a pattern recognition framework in the sense that an error between a target pattern and scanned pattern is minimized. The error takes into account pitch and rhythm information. The pitch error measure consists of an absolute (objective) error and a perceptual error. The latter depends on an algorithm for estab... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A joint Viterbi algorithm to separate cochannel FM signals

    Publication Year: 1998, Page(s):3297 - 3300 vol.6
    Cited by:  Papers (4)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (340 KB)

    This paper presents a method for separating cochannel FM signals. We show that the Viterbi algorithm, traditionally limited to estimation of digital sequences, can jointly track analog FM signals by separately quantizing the derivatives of their instantaneous frequencies. We employ per-survivor processing in the trellis to estimate unknown channel effects. The approach works well when the signal t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Downlink beamforming avoiding DOA estimation for cellular mobile communications

    Publication Year: 1998, Page(s):3313 - 3316 vol.6
    Cited by:  Papers (31)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (296 KB)

    A new technique to overcome the induced difficulties of FDD (frequency division duplex) for the design of a forward link beamformer in cellular mobile communications systems is presented. It takes advantage of the array topology at the basestation, used to transpose second order statistics of the propagation channel from the uplink frequency to the downlink frequency, thus enabling one to optimize... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Decoupled direction finding: detection

    Publication Year: 1998, Page(s):3329 - 3332 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (304 KB)

    Antenna arrays are likely to be an important feature of future mobile communication systems. With an antenna array, mobile users can be separated by a spatial filtering procedure allowing several users on the same carrier frequency. The uplink part (mobile to base) not only can, but is better solved without using any spatial knowledge in terms of direction of arrival (DOA). However, DOA estimation... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Novel brick-wall filters based on the auditory system

    Publication Year: 1998, Page(s):3637 - 3640 vol.6
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (360 KB)

    A novel class of narrow band filters is presented that offers great rejection of out of band noise and a flat top at the peak. The filter is unconventional, as the output of such a filter is a series of spikes, like the action potential in the auditory nerve. The product of its time and frequency window is less than unity even using 100% output cutoff points. A bank of such filters has been used t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Subspace-based detection for CDMA communications

    Publication Year: 1998, Page(s):3201 - 3204 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (316 KB)

    The oblique projection supports the framework to resolve a signal space into desired signal and interference subspaces. This paper presents subspace-based detection methods using the oblique projection for the CDMA channel. For the synchronous case, it is shown that this detector represents the geometrical form of the decorrelating detector, and performs a complete rejection of interfering signals... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A truly recursive blind equalization algorithm

    Publication Year: 1998, Page(s):3381 - 3384 vol.6
    Cited by:  Papers (4)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB)

    This paper describes a new adaptive blind equalization algorithm based on a truly IIR structure that enables the correction of ISI over severely distorted channels. The recursive feedback filter is in lattice form to allow an easy monitoring of the filter stability. During blind training, the adaptation of the equalizer is carried out via the usual stochastic gradient algorithm by minimizing the S... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Personal computer software vowel training aid for the hearing impaired

    Publication Year: 1998, Page(s):3625 - 3628 vol.6
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (304 KB)

    A vowel training aid system for hearing impaired persons which uses a Windows-based multimedia computer has been developed. The system provides two main displays which give visual feedback for vowels spoken in isolation and short word contexts. Feature extraction methods and neural network processing techniques provide a high degree of accuracy for speaker independent vowel training. The system ty... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Blind and semi-blind maximum likelihood methods for FIR multichannel identification

    Publication Year: 1998, Page(s):3185 - 3188 vol.6
    Cited by:  Papers (27)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (368 KB)

    We investigate maximum likelihood (ML) methods for blind and semi-blind estimation of multiple FIR channels. Two blind deterministic ML (DML) strategies are presented. In the first one, we propose to modify the iterative quadratic ML (IQML) algorithm in order to “denoise” it and hence obtain consistent channel estimates. The second strategy, called pseudo-quadratic ML (PQML), is natura... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Digital image/video library and MPEG-7: standardization and research issues

    Publication Year: 1998, Page(s):3785 - 3788 vol.6
    Cited by:  Papers (7)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (388 KB)

    Much research activity and interest has emerged in two closely related areas: the digital image/video library (DIVL) and MPEG-7. We review the critical research issues in DIVL from a signal processing viewpoint, the objectives and scope of MPEG-7, and the relationships between these two View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Multi-pitch estimation for polyphonic musical signals

    Publication Year: 1998, Page(s):3565 - 3568 vol.6
    Cited by:  Papers (7)  |  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (368 KB)

    Automatic score transcription goal is to achieve an score-like (notes pitches through time) representation from musical signals. Reliable pitch extraction methods for monophonic signals exist, but polyphonic signals are much more difficult, often ambiguous, to analyze. We propose a computationally efficient technique for automatic recognition of notes from a polyphonic signal. It looks for correct... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Text-to-visual speech synthesis based on parameter generation from HMM

    Publication Year: 1998, Page(s):3745 - 3748 vol.6
    Cited by:  Papers (18)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (340 KB)

    This paper presents a new technique for synthesizing visual speech from arbitrarily given text. The technique is based on an algorithm for parameter generation from HMM with dynamic features, which has been successfully applied to text-to-speech synthesis. In the training phase, syllable HMMs are trained with visual speech parameter sequences that represent lip movements. In the synthesis phase, a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An integrated progressive image coding and watermark system

    Publication Year: 1998, Page(s):3721 - 3724 vol.6
    Cited by:  Papers (8)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (356 KB)

    The design of an integrated image coding and watermark system with the wavelet transform is examined in this work. First, the multi-threshold wavelet codec (MTWC) is used to achieve the image compression purpose. Unlike other embedded wavelet coders which use a single initial threshold in their successive approximate quantization (SAQ), MTWC adopts different initial thresholds in different subband... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Music recognition using note transition context

    Publication Year: 1998, Page(s):3593 - 3596 vol.6
    Cited by:  Papers (8)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (356 KB)

    As a typical example of sound-mixture recognition, the recognition of ensemble music is addressed. Here music recognition is defined as recognizing the pitch and the name of an instrument for each musical note in monaural or stereo recordings of real music performances. The first key part of the proposed method is adaptive template matching that can cope with variability in musical sounds. This is... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A method of optimizing source configuration in active control systems using Gram-Schmidt orthogonalization

    Publication Year: 1998, Page(s):3549 - 3552 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (300 KB)

    A method for optimizing the number and the configuration of control sources in an active control system is proposed. In the optimization process, sources are selected one by one so that the corresponding transfer impedance vector is the most linearly independent. From the results of the simulation, it is shown that the optimized configuration yields not only a small average control error but also ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Co-channel FM voice separation via cross coupled phase locked loops

    Publication Year: 1998, Page(s):3293 - 3295 vol.6
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (184 KB)

    This paper presents the results of simulation experiments that successfully demonstrate FM co-channel voice separation via cross coupled phase locked loops (CCPLL). Unlike previous CCPLL studies which are typically restricted to the situation where the FM modulation waveforms are steady state sinusoidal, triangular, etc., we have empirically determined CCPLL loop parameters that provide for stable... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.