By Topic

IEEE Transactions on Speech and Audio Processing

Issue 4 • Date Jul 1995

Filter Results

Displaying Results 1 - 9 of 9
  • A mixed excitation LPC vocoder model for low bit rate speech coding

    Publication Year: 1995, Page(s):242 - 250
    Cited by:  Papers (152)  |  Patents (32)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (808 KB)

    Traditional pitch-excited linear predictive coding (LPC) vocoders use a fully parametric model to efficiently encode the important information in human speech. These vocoders can produce intelligible speech at low data rates (800-2400 b/s), but they often sound synthetic and generate annoying artifacts such as buzzes, thumps, and tonal noises. These problems increase dramatically if acoustic backg... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Evaluation of speech recognizers for speech training applications

    Publication Year: 1995, Page(s):229 - 241
    Cited by:  Papers (4)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1276 KB)

    The use of speech recognition technology for speech training represents an important and potentially very large application of speech technology. However, speech training places unique demands on recognizer performance that have not been well-characterized. In this research, a database and testing procedures were developed to evaluate two facets of recognizer performance integral to speech trainin... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Bayesian approach to the restoration of degraded audio signals

    Publication Year: 1995, Page(s):267 - 278
    Cited by:  Papers (37)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (964 KB)

    In this paper we derive the a posteriori probability for the location of bursts of noise additively superimposed on a Gaussian AR process. The theory is developed to give a sequentially based restoration algorithm suitable for real-time applications. The algorithm is particularly appropriate for digital audio restoration, where clicks and scratches may be modelled as additive bursts of noise. Expe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Reducing acoustic feedback in hearing aids

    Publication Year: 1995, Page(s):304 - 313
    Cited by:  Papers (83)  |  Patents (44)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (964 KB)

    Five techniques for reducing acoustic feedback in hearing aids were investigated: an adaptive notch filter, three previously described methods for adaptive feedback cancellation, and a novel method for feedback cancellation with adaptation during quiet intervals. Through real-time implementations, these techniques were assessed for added stable gain and sound quality. Test results showed the novel... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A signal subspace approach for speech enhancement

    Publication Year: 1995, Page(s):251 - 266
    Cited by:  Papers (413)  |  Patents (14)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1440 KB)

    A comprehensive approach for nonparametric speech enhancement is developed. The underlying principle is to decompose the vector space of the noisy signal into a signal-plus-noise subspace and a noise subspace. Enhancement is performed by removing the noise subspace and estimating the clean signal from the remaining signal subspace. The decomposition can theoretically be performed by applying the K... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On estimating robust probability distribution in HMM-based speech recognition

    Publication Year: 1995, Page(s):279 - 285
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (644 KB)

    We present various methods for estimating a robust output probability distribution (PD) in speech recognition based on the discrete hidden Markov model (HMM). In speech recognition, we encounter the problem of an insufficient amount of training data, which may cause inaccurate modeling of the HMM parameters, especially the output PD's. In this paper, to enhance the robustness of the output PD's wi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Efficient adaptive vector quantization of LPC parameters

    Publication Year: 1995, Page(s):314 - 317
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (324 KB)

    This correspondence presents a new two-stage adaptive vector quantizer of LSF parameters in LPC speech coding. The first codebook is adapted by a partition-delete operation, whereas the code-vectors of the second codebook remain unchanged. The objective and subjective evaluations show that the proposed scheme offers transparent quantization with 22 b/frame View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A comparison of signal processing front ends for automatic word recognition

    Publication Year: 1995, Page(s):286 - 293
    Cited by:  Papers (54)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (756 KB)

    This paper compares the word error rate of a speech recognizer using several signal processing front ends based on auditory properties. Front ends were compared with a control mel filter bank (MFB) based cepstral front end in clean speech and with speech degraded by noise and spectral variability, using the TI-105 isolated word database. MFB recognition error rates ranged from 0.5 to 26.9% in nois... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Criteria for positioning of sensors for a microphone array

    Publication Year: 1995, Page(s):294 - 303
    Cited by:  Papers (13)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (856 KB)

    Sensor positioning has an important influence on the performance of array processing. The main contributions of this paper are criteria to evaluate and to optimize the performance of an array of sensors through its geometric configuration-the number and the positions of sensors. The distance between the observed and the desired directivity patterns has been used as a criterion. We propose a genera... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.

Aims & Scope

Covers the sciences, technologies and applications relating to the analysis, coding, enhancement, recognition and synthesis of audio, music, speech and language.

 

This Transactions ceased publication in 2005. The current retitled publication is IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Full Aims & Scope