By Topic

On modeling non-word events in Large Vocabulary Continuous Speech Recognition

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

6 Author(s)
G. Sárosi ; Budapest University of Technology and Economics, Hungary ; B. Tarján ; A. Balog ; T. Mozsolics
more authors

This paper focuses on the integration of non-word acoustic events into LVCSR (Large Vocabulary Continuous Speech Recognition). Non-word events may have an important role in cognitive, paraverbal infocommunication; however, they often are not modeled explicitly due to computational difficulties. In our experiments a serial and a loopback WFST (Weighted Finite State Transducer) architecture was built to recognize and/or print out certain non-word events on the output. We have used a Hungarian Broadcast News corpus to evaluate the results. No performance degradation was observed in terms of normal word recognition accuracy as compared to the baseline, where no non-word event modeling was applied. The non-word event recognition accuracy was, however, lower than expected. One of the most likely reasons may be the less consistent manual transcription as compared to the normal words. Nonetheless, some of the non-word events were mostly correctly recognized. The loopback architecture has higher memory requirement, but gives significantly better non-word event accuracies, without any increase of recognition time.

Published in:

Cognitive Infocommunications (CogInfoCom), 2012 IEEE 3rd International Conference on

Date of Conference:

2-5 Dec. 2012