By Topic

A user voice reduction algorithm based on binaural signal separation for portable digital imaging devices

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Ji Hun Park ; Sch. of Inf. & Commun., Gwangju Inst. of Sci. & Technol. (GIST), Gwangju, South Korea ; Hong Kook Kim ; Myeong Bo Kim ; Sang Ryong Kim

In this paper, a user voice reduction algorithm for portable digital imaging devices is proposed based on a binaural signal separation approach in order to improve the naturalness of user-generated video contents. The proposed algorithm first estimates the interaural time differences (ITDs) from binaural signals recorded by the microphones equipped on a device. Then, the estimated ITDs are used to obtain the time-frequency domain masking patterns of a user voice against an actual subject sound of video content. Finally, the user voice recorded in video content can be reduced by applying the mask patterns to the binaural signals. In order to demonstrate the effectiveness of the proposed algorithm, the proposed algorithm is implemented on a portable digital imaging device having a clock speed of 600 MHz. It is shown from the performance evaluation by measuring a sound pressure level that the proposed algorithm reduces user voice by around 10 dB.

Published in:

Consumer Electronics, IEEE Transactions on  (Volume:58 ,  Issue: 2 )