By Topic

Arabic Speech Recognition by Bionic Wavelet Transform and MFCC using a Multi Layer Perceptron

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Ben Nasr, M. ; Dept. of Electron., Fac. of Sci. of Tunis, Tunis, Tunisia ; Talbi, M. ; Cherif, A.

In this paper, we have proposed a new technique of Arabic Speech Recognition (ASR) with monolocutor and a reduced vocabulary. This technique consists at first step in using our proper speech database containing Arabic speech words which are recorded by a mono-locutor. The second step consists in features extracting from those recorded words. The third step is to classify those extracted features. This extraction is performed by computing at first step, the Mel Frequency Cepstral Coefficients (MFCCs) from each recorded word, then the Bionic Wavelet Transform (BWT) is applied to the vector obtained from the concatenation of the computed MFCCs. The obtained bionic wavelet coefficients are then concatenated to construct one input of a Multi-Layer Perceptual (MLP) used for features classification. In the MLP learning and test phases, we have used eleven Arabic words and each of them is repeated twenty five times by the same locutor. A simulation program is performed to test the performance of the proposed technique and shows a classification rate equals to 99.39%. We have also introduced a module of denoising as a phase of preprocessing. In this denoising module, we have treated the case of white noise and we have used the Wiener filtering. In case of SNR=5dB, the obtained recognition rate is equals to 78.7% and in case of SNR=10dB, it is equals to 93.9%.

Published in:

Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on

Date of Conference:

21-24 March 2012