By Topic

Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on

Date 16-19 Dec. 2008

Filter Results

Displaying Results 1 - 25 of 106
  • ISCSLP 2008 [Cover]

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (179 KB)
    Freely Available from IEEE
  • ISCSLP 2008 [Title page]

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (74 KB)
    Freely Available from IEEE
  • ISCSLP 2008 [Copyright notice]

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (114 KB)
    Freely Available from IEEE
  • ISCSLP 2008 Message from the General Chair

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (94 KB)
    Freely Available from IEEE
  • ISCSLP 2008 Preface

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (105 KB)
    Freely Available from IEEE
  • ISCSLP 2008 Organizing Committees

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (101 KB)
    Freely Available from IEEE
  • ISCSLP 2008 Technical Program Committee

    Publication Year: 2008, Page(s):1 - 3
    Request permission for commercial reuse | PDF file iconPDF (121 KB)
    Freely Available from IEEE
  • ISCSLP 2008 International Advisory Committee

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (101 KB)
    Freely Available from IEEE
  • ISCSLP 2008 Sponsors

    Publication Year: 2008, Page(s): 1
    Request permission for commercial reuse | PDF file iconPDF (79 KB)
    Freely Available from IEEE
  • ISCSLP 2008 Table of contents

    Publication Year: 2008, Page(s):1 - 6
    Request permission for commercial reuse | PDF file iconPDF (134 KB)
    Freely Available from IEEE
  • ISCSLP 2008 author index

    Publication Year: 2008, Page(s):1 - 2
    Request permission for commercial reuse | PDF file iconPDF (65 KB)
    Freely Available from IEEE
  • Simultaneous Acoustic, Prosodic, and Phrasing Model Training for TTs Conversion Systems

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (136 KB) | HTML iconHTML

    A new integrated model for simultaneous modeling of linguistic and acoustic models, and a training algorithm is proposed. Usually, text-to-speech (TTS) systems based on the hidden Markov model (HMM) consist of text analysis and speech synthesis modules. Linguistic and acoustic model training are performed independently using different training data sets. Integrated model parameters were simultaneo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Cross-Stream Dependency Modeling for HMM-Based Speech Synthesis

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (135 KB) | HTML iconHTML

    This paper presents a method that the dependency between F0 and spectral features are modeled for the HMM-based parametric speech synthesis system. In conventional systems these two features are modeled as two independent streams, which is inconsistent with the fact that there always exists interaction between the extracted F0 and spectral parameters for model training. A piecewise linear transfor... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Cross-Lingual Speaker Adaptation for HMM-Based Speech Synthesis

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (7)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (103 KB) | HTML iconHTML

    This paper explores a cross-lingual speaker adaptation technique for HMM-based speech synthesis, where a source voice model for English is transformed into a target speaker model using Mandarin Chinese speech data from the target speaker. A phone mapping- based method is adopted to map Chinese Initial/Finals into English phonemes and two types of mapping rules, including one-to-one and one-to-sequ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • HMM-Based Mixed-Language (Mandarin-English) Speech Synthesis

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (248 KB) | HTML iconHTML

    English words or short phrases embedded in Mandarin utterances have become more common among bilingually educated people like college students in China. Similarly, it becomes highly desirable that TTS systems can synthesize mixed- language speech properly. Recently, we proposed an HMM-based bilingual TTS to synthesize a target language when only monolingual source language recording from a speaker... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Improving HMM Based Speech Synthesis by Reducing Over-Smoothing Problems

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (423 KB) | HTML iconHTML

    Although hidden Markov model based speech synthesis has been proved to have good performance, there are still some factors which degrade the quality of synthesized speech: vocoder, model accuracy and over-smoothing. This paper analyzes these factors separately. Modifications for removing different factors are proposed. Experimental results show that over-smoothing in frequency domain mainly affect... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Pronunciation Space Models for Pronunciation Evaluation

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (173 KB) | HTML iconHTML

    Posterior probability is mostly used for pronunciation evaluation. This paper introduces pronunciation space models to calculate posterior probability replacing traditional phone-based acoustic models, which makes the calculated posterior probability more precise. Pronunciation space models are constructed using unsupervised clustering method guided by human scores and phone-level posterior probab... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Decision Fusion for Improving Mispronunciation Detection Using Language Transfer Knowledge and Phoneme-Dependent Pronunciation Scoring

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (248 KB) | HTML iconHTML

    Application of linguistic knowledge of language transfer to automatic speech recognition (ASR) technology can enhance mispronunciation detection performance in computer-aided pronunciation training (CAPT). This is achieved by pinpointing salient pronunciation errors made by second language learners. In this work, we propose to apply decision fusion for further improvement in mispronunciation detec... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Mandarin Learning Using Speech and Language Technologies: A Translation Game in the Travel Domain

    Publication Year: 2008, Page(s):1 - 4
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (231 KB) | HTML iconHTML

    This paper describes a new Web-based translation game we have designed to help a student learn spoken Chinese. The student talks to the system in Chinese and the system compares the recognized sentence against a set of English prompts to judge whether it is a suitable translation of any one of them. The game can also provide translation assistance upon request. The game was developed using the IWS... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Word Order Correction for Language Transfer Using Relative Position Language Modeling

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (234 KB) | HTML iconHTML

    Sentence correction has been an important and emerging issue in computer-assisted language learning. However, existing techniques based on grammar rules or statistical machine translation are still not robust enough to tackle the common incorrect word order errors in sentences produced by second language learners of Chinese. In this paper, a novel relative position language model is proposed to ad... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Improving Automatic Evaluation of Mandarin Pronunciation with Speaker Adaptive Training (SAT) and MLLR Speaker Adaption

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (223 KB) | HTML iconHTML

    Automatic pronunciation evaluation (APE) can be implemented with a speech recognition model trained by standard, "golden" speakers. The pronunciation accuracy is then measured with the Goodness of Pronunciation (GOP) as reported in our earlier work [1]. In this paper, we investigate two main strategies for improving the evaluation: speaker adaptive training (SAT) for reducing the speaker-specific ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Automatic Assessment of Language Proficiency through Shadowing

    Publication Year: 2008, Page(s):1 - 4
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (430 KB) | HTML iconHTML

    Shadowing is a practice that requires learners to shadow a presented native utterance as closely and quickly as possible. Learners' pronunciation in shadowing, especially in the case of beginners, often becomes inarticulate and corrupt. These features of shadowing make it very difficult to assess shadowing productions. In this paper, we investigate the automatic pronunciation scoring methods for s... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • What's in the F0 of Mandarin Speech: Tones, Intonation and Beyond

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (266 KB) | HTML iconHTML

    We analyzed F0 contours of fluent Mandarin speech using a modified command-response model. Adopting the multiple-phrase speech paragraph as a discourse prosodic unit, we investigated the composition of FO contours to see whether additional prosodic information beyond tones and intonation exists. Testing FO contributions with a previously constructed prosody hierarchy the HPG (hierarchy of prosodic... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Perceptual Study of Approximated Cantonese Tone Contours

    Publication Year: 2008, Page(s):1 - 4
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (314 KB) | HTML iconHTML

    This paper describes a perceptual study on approximated Cantonese tone contours. It is found that Cantonese tone contours and tone transitions can be approximated by a limited number of linear movements, without creating any noticeable perceptual difference. The slopes of these linear movements are analyzed. They are found to be related with two thresholds of pitch movement perception. The results... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A New Prosodic Strength Calculation Method for Prosody Reduction Modeling

    Publication Year: 2008, Page(s):1 - 4
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (208 KB) | HTML iconHTML

    To improve the naturalness of synthetic speech, prosody models in text-to-speech (TTS) system should be able to describe different prosody variations in natural speech. In this paper, prosody variation patterns behind the partial reduction phenomena are analyzed. In order to model the prosody reduction effect and incorporate it into the prosody model for speech synthesis, prosodic strength is intr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.