By Topic

A sound-source localization system using three-microphone array and crosspower spectrum phase

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Hung-Yan Gu ; Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei 106, Taiwan ; Shan-Siang Yang

A sound source localization system is implemented that uses only three microphones to input sound signals. This system can estimate the azimuth and elevation of a sound source in real-time and in sufficient accuracy. We add a SNR measure besides spectra entropy to help detect voiced frames. Next, synchronous FFT phase copying is adopted, and cross-power spectrum phase is calculated to estimate TDOA (time delay of arrival) for each frame. Also, to enhance the accuracy of TDOA, parabolic interpolation is adopted. Then, by comparing the estimated TDOA values with theoretic ones, the azimuth and elevation of a sound source can be determined. Since a pair of azimuth and elevation is estimated from each voiced frame, these estimated values are thereafter summed with a weighting method to give one final answer of azimuth and elevation. According to the experiment results, the average errors in estimating azimuth and elevation are 4.02 and 2.18 degrees, respectively.

Published in:

2012 International Conference on Machine Learning and Cybernetics  (Volume:5 )

Date of Conference:

15-17 July 2012