By Topic

Audio, Speech, and Language Processing, IEEE Transactions on

Issue 10 • Date Dec. 2012

Filter Results

Displaying Results 1 - 17 of 17
  • Table of Contents

    Publication Year: 2012 , Page(s): C1 - C4
    Save to Project icon | Request Permissions | PDF file iconPDF (167 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Audio, Speech, and Language Processing publication information

    Publication Year: 2012 , Page(s): C2
    Save to Project icon | Request Permissions | PDF file iconPDF (137 KB)  
    Freely Available from IEEE
  • A Message from the Vice President of Publications on New Developments in Signal Processing Society Publications

    Publication Year: 2012 , Page(s): 2625
    Save to Project icon | Request Permissions | PDF file iconPDF (83 KB)  
    Freely Available from IEEE
  • A Mixture Model Approach for Formant Tracking and the Robustness of Student's-t Distribution

    Publication Year: 2012 , Page(s): 2626 - 2636
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (2289 KB) |  | HTML iconHTML  

    We address the problem of robust formant tracking in continuous speech in the presence of additive noise. We propose a new approach based on mixture modeling of the formant contours. Our approach consists of two main steps: (i) Computation of a pyknogram based on multiband amplitude-modulation/frequency-modulation (AM/FM) decomposition of the input speech; and (ii) Statistical modeling of the pykn... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Structural Segmentation of Multitrack Audio

    Publication Year: 2012 , Page(s): 2637 - 2647
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (1581 KB) |  | HTML iconHTML  

    Structural segmentation of musical audio signals is one of many active areas of Music Information Retrieval (MIR) research. One aspect of this important topic which has so far received little attention though is the potential advantage to be gained by utilizing multitrack audio. This paper gives an overview of current segmentation techniques, and demonstrates that by applying a particular segmenta... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation

    Publication Year: 2012 , Page(s): 2648 - 2656
    Cited by:  Papers (3)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (603 KB) |  | HTML iconHTML  

    Unsupervised acoustic model adaptation for large vocabulary speech recognition is typically accomplished by using an estimated transcription of the adaptation data. The effectiveness of the technique is limited by errors in the estimated transcription. Previous work has mitigated this negative effect by using only those sections of the adaptation data which are transcribed with relatively high con... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Real-Time Perception-Based Clipping of Audio Signals Using Convex Optimization

    Publication Year: 2012 , Page(s): 2657 - 2671
    Cited by:  Papers (5)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (1676 KB) |  | HTML iconHTML  

    Clipping is an essential signal processing operation in many real-time audio applications, yet the use of existing clipping techniques generally has a detrimental effect on the perceived audio signal quality. In this paper, we present a novel multidisciplinary approach to clipping which aims to explicitly minimize the perceptible clipping-induced distortion by embedding a convex optimization crite... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Exploring the Predictability of Non-Unique Acoustic-to-Articulatory Mappings

    Publication Year: 2012 , Page(s): 2672 - 2682
    Cited by:  Papers (3)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (1006 KB) |  | HTML iconHTML  

    This paper explores statistical tools that help analyze the predictability in the acoustic-to-articulatory inversion of speech, using an Electromagnetic Articulography database of simultaneously recorded acoustic and articulatory data. Since it has been shown that speech acoustics can be mapped to non-unique articulatory modes, the variance of the articulatory parameters is not sufficient to under... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Inference of Room Geometry From Acoustic Impulse Responses

    Publication Year: 2012 , Page(s): 2683 - 2695
    Cited by:  Papers (9)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (2444 KB) |  | HTML iconHTML  

    Acoustic scene reconstruction is a process that aims to infer characteristics of the environment from acoustic measurements. We investigate the problem of locating planar reflectors in rooms, such as walls and furniture, from signals obtained using distributed microphones. Specifically, localization of multiple two- dimensional (2-D) reflectors is achieved by estimation of the time of arrival (TOA... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Beat Tracking for Multiple Applications: A Multi-Agent System Architecture With State Recovery

    Publication Year: 2012 , Page(s): 2696 - 2706
    Cited by:  Papers (4)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (1716 KB) |  | HTML iconHTML  

    In this paper we propose an audio beat tracking system, IBT, for multiple applications. The proposed system integrates an automatic monitoring and state recovery mechanism, that applies (re-)inductions of tempo and beats, on a multi-agent-based beat tracking architecture. This system sequentially processes a continuous onset detection function while propagating parallel hypotheses of tempo and bea... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Generalization of Multi-Channel Linear Prediction Methods for Blind MIMO Impulse Response Shortening

    Publication Year: 2012 , Page(s): 2707 - 2720
    Cited by:  Papers (3)
    Save to Project icon | Request Permissions | Click to expandAbstract | PDF file iconPDF (734 KB) |  | HTML iconHTML  

    The performance of many microphone array processing techniques deteriorates in the presence of reverberation. To provide a widely applicable solution to this longstanding problem, this paper generalizes existing dereverberation methods using subband-domain multi-channel linear prediction filters so that the resultant generalized algorithm can blindly shorten a multiple-input multiple-output (MIMO)... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • List of reviewers

    Publication Year: 2012 , Page(s): 2721 - 2723
    Save to Project icon | Request Permissions | PDF file iconPDF (100 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Audio, Speech, and Language Processing EDICS

    Publication Year: 2012 , Page(s): 2724 - 2725
    Save to Project icon | Request Permissions | PDF file iconPDF (31 KB)  
    Freely Available from IEEE
  • IEEE Transactions on Audio, Speech, and Language Processing Information for Authors

    Publication Year: 2012 , Page(s): 2726 - 2727
    Save to Project icon | Request Permissions | PDF file iconPDF (146 KB)  
    Freely Available from IEEE
  • Open Access [advertisement]

    Publication Year: 2012 , Page(s): 2728
    Save to Project icon | Request Permissions | PDF file iconPDF (1156 KB)  
    Freely Available from IEEE
  • 2012 Index IEEE Transactions on Audio, Speech, and Language Processing Vol. 20

    Publication Year: 2012 , Page(s): 2729 - 2756
    Save to Project icon | Request Permissions | PDF file iconPDF (531 KB)  
    Freely Available from IEEE
  • IEEE Signal Processing Society Information

    Publication Year: 2012 , Page(s): C3
    Save to Project icon | Request Permissions | PDF file iconPDF (33 KB)  
    Freely Available from IEEE

Aims & Scope

IEEE Transactions on Audio, Speech and Language Processing covers the sciences, technologies and applications relating to the analysis, coding, enhancement, recognition and synthesis of audio, music, speech and language.

 

This Transactions ceased publication in 2013. The current retitled publication is IEEE/ACM Transactions on Audio, Speech, and Language Processing.

Full Aims & Scope

Meet Our Editors

Editor-in-Chief
Li Deng
Microsoft Research