By Topic

Evolution of the performance of automatic speech recognition algorithms in transcribing conversational telephone speech

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

6 Author(s)
Padmanabhan, M. ; IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA ; Saon, G. ; Zweig, G. ; Huang, J.
more authors

Research in the speech recognition speech-to-text conversion area has been underway for a couple of decades, and a great deal of progress has been made in reducing the word error rate. In this paper, we attempt to summarize the state of the art in speech recognition algorithms. The algorithms we describe span the areas of lexicon design, feature extraction, classifier design, combination of hypotheses, and speaker adaptation of acoustic models. We will benchmark the algorithms on two main sources of speech, the first being Voicemail (conversational telephone speech from a single speaker) and the second being Switchboard (conversational telephone speech between two speakers). We also present the results of some cross-domain experiments which highlight the “brittleness” of speech recognition systems today and illustrates the need to focus research effort on improving cross-domain performance

Published in:

Instrumentation and Measurement Technology Conference, 2001. IMTC 2001. Proceedings of the 18th IEEE  (Volume:3 )

Date of Conference: