2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)

17-20 Sept. 2000

Filter Results

Displaying Results 1 - 25 of 56
  • 2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)

    Publication Year: 2000
    Request permission for commercial reuse | PDF file iconPDF (367 KB)
    Freely Available from IEEE
  • An overview of text-to-speech synthesis

    Publication Year: 2000
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (34 KB)

    Summary form only given. The article gives an overview of text-to-speech (TTS) technology and a description of some issues of potential interest to speech coding experts. After motivation for the use of TTS technology, it describes the general architecture of a text-to-speech system with particular emphasis on the speech synthesis component. Both formant synthesis and concatenative synthesis are p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Signal processing for cochlear implants and low-rate speech coding

    Publication Year: 2000
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (58 KB)

    Summary form only given. Cochlear implants are now established as a new option for individuals with profound (sensorineural) hearing impairment. Many of the cochlear implant patients are able to understand speech without lip-reading, and some can communicate over the phone. The success of cochlear implants can be attributed to the combined efforts of scientists from various disciplines including b... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Acoustic front-end processing for communication systems

    Publication Year: 2000
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (49 KB)

    Summary form only given. As communication systems have become more mobile and portable, we now have situations where audio communication in difficult acoustic environments is common. Speech coders at low bit rates tend to have problems with non-speech signals that are typically found in noisy acoustic environments. As a result, there can be degradation in the perceived audio quality for low-bit ra... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Trellis-based optimization of MPEG-4 advanced audio coding

    Publication Year: 2000, Page(s):142 - 144
    Cited by:  Papers (11)  |  Patents (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (339 KB)

    We outline a method to perform efficient low rate quantization for MPEG-4 advanced audio coding (AAC). The AAC bit stream consists of indices for quantized spectral coefficients as well as side information about quantizer step sizes and Huffman codebooks. The MPEG-4 Verification Model does not explicitly account for side information bits in its optimization and suffers from poor compression effici... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 2000, Page(s): 157
    Request permission for commercial reuse | PDF file iconPDF (56 KB)
    Freely Available from IEEE
  • New objective measures for characterisation of noise suppression algorithms

    Publication Year: 2000, Page(s):23 - 25
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (244 KB)

    We present two new objective quality measures for the assessment of the performance of noise suppression (NS) algorithms. The signal-to-noise ratio improvement (SNRI) measure attempts to characterise the capability of an NS method to enhance the speech component of a noisy speech signal from an additive background noise. The SNRI measure includes a segmentation of the input speech signal into thre... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Application of multidimensional scaling to subjective evaluation of coded speech

    Publication Year: 2000, Page(s):20 - 22
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (244 KB)

    We propose a new procedure for subjective evaluation of coded speech. This procedure has the potential of providing an anchorable measure of quality that contains more information than the single number provided by MOS testing. A stimulus space and the relationship between this space and speech quality are established with multidimensional scaling techniques in a large-scale listening test. In the... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • 4 kb/s improved multi-pulse based CELP speech coding with multiple location codebook and post-processing

    Publication Year: 2000, Page(s):17 - 19
    Cited by:  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (216 KB)

    This paper proposes an improved MP-CELP (Multi-Pulse-based CELP) speech coding at 4 kb/s. In MP-CELP, amplitudes or signs of multi-pulse excitation are simultaneonsly vector quantized (VQ). In order to improve speech quality for voiced speech, a multiple pulse location codebook is stored to enhance the coverage of the location. The optimum combination among the pulse location codebook, pulse ampli... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A novel approach to excitation coding in low-bit-rate high-quality CELP coders

    Publication Year: 2000, Page(s):14 - 16
    Cited by:  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB)

    A significant improvement in the efficiency of excitation coding with CELP at low bit rates is achieved by a new paradigm for encoding the fixed excitation. In the proposed scheme, the non-zero fixed-codebook excitation elements are substantially localized in a set of windows, with positions adaptive to the pitch peaks. Highly efficient coding is thus achieved by allocating most of the available e... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Exploiting simultaneously masked linear prediction in a WI speech coder

    Publication Year: 2000, Page(s):11 - 13
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (256 KB)

    This paper uses a method of incorporating simultaneous masking into the calculation of a linear predictive filter (SMLPC) as the front end to a 2 kbps waveform interpolation (WI) speech coder. A modification to the masking threshold calculation used in SMLPC is proposed. This modification improves the performance of SMLPC in noise like sections by placing greater emphasis on strongly voiced speech... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Design and performance of a 4.0 kbit/s speech coder based on frequency-domain interpolation

    Publication Year: 2000, Page(s):8 - 10
    Cited by:  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (248 KB)

    The 4.0 kbit/s speech codec described is based on a frequency domain interpolative (FDI) coding technique, which belongs to the class of prototype waveform interpolation (PWI) coding techniques. The codec also has an integrated voice activity detector (VAD) and a noise reduction capability. The input signal is subjected to LPC analysis and the prediction residual is separated into a slowly evolvin... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A novel algorithm for low bit rate speech compression using a hybrid LP-harmonics model

    Publication Year: 2000, Page(s):41 - 43
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (252 KB)

    We present a new LP-harmonic speech codec. At the coder speech signal is pre-processed, and an LP analysis is performed, together with pitch estimation and voicing decision. At the decoder and when the frame is voiced, the encoded parameters are used to estimate the spectrum envelope, extract and classify the harmonics as either strong or weak depending on their relative distance from multiples of... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An adaptive multi rate wideband speech codec with adaptive gain re-quantization

    Publication Year: 2000, Page(s):145 - 147
    Cited by:  Papers (1)  |  Patents (16)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (328 KB)

    This paper describes an adaptive multi-rate wideband (AMR-WB) speech codec proposed for the GSM system and also for the evolving third generation (3G) mobile speech services. The coder is a multi rate SB-CELP (subband-code excited linear prediction) with five modes operating at bit rates from 24 kbit/s down to 9.1 kbit/s. Our basic approach consists of an unequal band-splitting of the input signal... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Coding of spectral magnitudes using optimized linear transformations

    Publication Year: 2000, Page(s):5 - 7
    Cited by:  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (224 KB)

    This paper introduces a novel vector quantization (VQ) technique, wherein the quantized vector is obtained by applying a linear transformation selected from a first codebook to a codevector selected from a second codebook. The transformation is selected from a family of linear transformations, represented by a matrix codebook. Vectors in the second codebook are called residual codevectors. In orde... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • PDF optimized parametric vector quantization of speech line spectral frequencies

    Publication Year: 2000, Page(s):87 - 89
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (224 KB)

    A computationally efficient, high quality, vector quantization scheme based on a parametric probability density function (PDF) is developed for encoding speech line spectral frequencies (LSF). For this purpose, speech LSFs are modeled as i.i.d realizations of a multivariate normal mixture density. The mixture model parameters are efficiently estimated from the training data using the expectation m... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A pseudo-cepstrum based short-term postfilter

    Publication Year: 2000, Page(s):99 - 101
    Cited by:  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (268 KB)

    We propose an adaptive short-term postfilter for speech coders by incorporating the properties of the pseudo-cepstrum. Since the proposed postfilter implicitly has a characteristic of tilt compensation, it does not require an additional tilt compensation filter as conventional techniques. We derive a relationship between the parameters of the proposed postfilter based on a minimum phase distortion... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Performance comparison of intraframe and interframe LSF quantization in packet networks

    Publication Year: 2000, Page(s):126 - 128
    Cited by:  Papers (1)  |  Patents (11)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (224 KB)

    Line spectrum frequencies (LSF) have been the prevailing parameter set to represent LPC coefficients in speech coding. Extensive research has been performed to exploit their interframe and intraframe correlations and quantize them more efficiently. Interframe coding of the LSF can cause error propagation when frame erasures occur. Since most LSF quantizers were designed with the primary concerns o... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Subband synthesized LPC vector quantization (SBS-LPC-VQ)

    Publication Year: 2000, Page(s):114 - 116
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (220 KB)

    This paper presents a new LPC parameter quantization method-SubBand Synthesized LPC Vector Quantization (SBS-LPC-VQ). In the subband synthesis process, the relationships between subband spectra and whole band LPC spectrum are established and thus the vector quantized subband LPC parameters are mapped to the whole band LPC parameters. This new SBS-LPC-VQ method overcomes high complexity problem of ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Efficient parameter quantisation for 2.4/1.2 kb/s split-band LPC coding

    Publication Year: 2000, Page(s):32 - 34
    Cited by:  Papers (3)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (236 KB)

    Speech coding at very low bit rates has many applications such as answering machines, IP telephony, mobile communications, military communications etc. Most low bit rate coders operate at around 2.4 kb/s, as the speech quality degrades too much below this bit rate. We describe a frequency domain speech coder capable of operating at both 2.9 and 1.2 kb/s, and produces good quality synthesised speec... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Results on reverse water-filling, SNR, and log-spectral error in codebook-based coding

    Publication Year: 2000, Page(s):38 - 40
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (240 KB)

    This paper identifies optimum levels of reverse water-filling for codebook-based coding of noise and speech signals. We find that there is little to be gained from optimizing an effective rate parameter. We identify trade-offs between SNR and log-spectral error. We show that the use of a gain factor compares favorably with reverse water-filling in some situations View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Stereophonic CELP coding using cross channel prediction

    Publication Year: 2000, Page(s):136 - 138
    Cited by:  Papers (2)  |  Patents (29)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (288 KB)

    One step towards more realistic speech communication is the move from monophonic to stereophonic sound transmission. Stereophonic speech coding has been explored in the past with the use of cross-channel “cancellation” prediction combined with ADPCM. While this solution and other multi-channel audio techniques can be applied to speech coding there are some fundamental reasons that may ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A sinusoidal LPC vocoder

    Publication Year: 2000, Page(s):2 - 4
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (284 KB)

    Twenty years of work with sinusoidal modeling of speech has lead to very competitive principles of low rate coding. In this study, we discuss a few issues in the design of a sinusoidal coding system. We stress that by a careful design of all blocks of the encoder and decoder, allowing for some additional complexity, it is possible to build a low rate coder free of many of the artifacts associated ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Diversity control among multiple coders: a simple approach to multiple descriptions

    Publication Year: 2000, Page(s):69 - 71
    Cited by:  Papers (5)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (244 KB)

    This paper presents a voice communication arrangement in which multiple coder-decoder (codec) pairs are coordinated to provide diversified descriptions of the source signal. This arrangement allows the system to robustly mitigate channel erasures that may occur while providing the best possible quality in the absence of channel impairments. Since single descriptive systems have been largely deploy... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On the perceptual weighting function for phase quantization of speech

    Publication Year: 2000, Page(s):62 - 64
    Cited by:  Papers (1)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (256 KB)

    This paper addresses the issue on the utilization of the perceptual characteristics of the human auditory system for the phase quantization of speech signals. Taking into account the phase quantization noise, we propose the perceptual weighting function to make the quantization noise below the threshold of human perception. The weighting function is derived from psychoacoustic experiments in which... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.