By Topic

Zero-crossing-based speech segregation and recognition for humanoid robots

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Sung Jun An ; Advanced Institute of Science and Technology ; Rhee Man Kil ; Young-ik Kim

Nowadays, humanoid robots attract people since their overall appearance is similar to the human body, allowing interaction with humans and the surrounding environment. In the case of the auditory interaction with humans, it is desirable that humanoid robots have similar capacity to the human¿s auditory information processing system. This is a very difficult task, since current automatic speech recognition (ASR) systems are not quite robust to noise and it¿s hard to attend to the selected speech source. In this context, this paper presents a new method of zero-crossing based binaural mask estimation for speech segregation and recognition, when multiple sound sources are present simultaneously. The proposed method provides high performance of speech segregation and recognition while offers significantly less computational complexity compared to the conventional methods based on cross-correlation. We expect that this method would be able to provide an effective tool for the auditory interaction with humanoid robots using the sensory information of binaural sounds.

Published in:

IEEE Transactions on Consumer Electronics  (Volume:55 ,  Issue: 4 )