IEEE Transactions on Speech and Audio Processing

Issue 4 • May 2002

Filter Results

Displaying Results 1 - 4 of 4
  • Speaker recognition with polynomial classifiers

    Publication Year: 2002, Page(s):205 - 212
    Cited by:  Papers (89)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (309 KB) | HTML iconHTML

    Modern speaker recognition applications require high accuracy at low complexity. We propose the use of a polynomial-based classifier to achieve these objectives. This approach has several advantages. First, polynomial classifier scoring yields a system which is highly computationally scalable with the number of speakers. Second, a new training algorithm is proposed which is discriminative, handles... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Convergence analysis of a twin-reference complex least-mean-squares algorithm

    Publication Year: 2002, Page(s):213 - 221
    Cited by:  Papers (17)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (375 KB) | HTML iconHTML

    In many noise control applications, the noise is dominated by low frequencies and generated by several independent periodic sources. In such situations the tonal noise may be suppressed by using a narrowband multiple-reference feedforward controller. The performance characteristics of the control system, e.g., the convergence behavior and noise reduction are directly related to the controller adap... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A joint source-channel speech coder using hybrid digital-analog (HDA) modulation

    Publication Year: 2002, Page(s):222 - 231
    Cited by:  Papers (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (339 KB) | HTML iconHTML

    A joint source-channel coding system for transmitting speech on a bandlimited additive white Gaussian noise (AWGN) channel is presented. The proposed method uses a hybrid of digital and analog modulation techniques. The digital part of the system consists of a Federal Standard 1016 code-excited linear predictive (FS 1016 CELP) speech coder followed by a rate-3/5 parallel concatenated (turbo) error... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Improved generalization of MCE parameter estimation with application to speech recognition

    Publication Year: 2002, Page(s):232 - 239
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (276 KB) | HTML iconHTML

    Discriminative training of hidden Markov models (HMMs) using minimum classification error training (MCE) has been shown to work well for certain speech recognition applications. MCE is, however, somewhat prone to overspecialization. This study investigates various techniques which improve performance and generalization of the MCE algorithm. Improvements of up to 10% in relative error rate on the t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.

Aims & Scope

Covers the sciences, technologies and applications relating to the analysis, coding, enhancement, recognition and synthesis of audio, music, speech and language.

This Transactions ceased publication in 2005. The current retitled publication is IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Full Aims & Scope