By Topic

Improvement In Automatic Speech Recognition Performance In Noisy Environments Using Time-Domain Blind Source Separation

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Cemil Demir ; Elektrik Elektronik Mühendisli¿i Bölümü Bo¿aziçi Üniversitesi, ¿stanbul; TÜB¿TAK-UEKAE (Türkiye Bilim ve Teknoloji Ara¿tirma Kurumu-Ulusal Elektronik ve Kriptoloji Ara¿tirma Enstitüsü) p.k. 74, 41470, Gebze/Kocaeli. ; F. Kerem Harmanci

Blind source separation (BSS) methods are generally used to separate speeches of people who are simultaneously speaking in the same room by using more than one microphone to record the speeches. However, in this study, the aim is to separate noise from the speech and therefore improve speech recognition performance using a time-domain BSS method. This method uses time-domain second-order statistics and is based on non-stationarity and non-whiteness properties of speech signals. Sphinx, which is a speaker independent, continuous speech recognition engine, is used to test the recognition performance of resulting enhanced speech. The simulation results demonstrate that, there is a remarkable improvement in recognition performance in terms of sentence error rate and word error rate.

Published in:

2007 IEEE 15th Signal Processing and Communications Applications

Date of Conference:

11-13 June 2007