By Topic

Combining log-spectral mean subtraction at different frequency resolutions for handset-channel compensation in single utterance speaker verification

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $33
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
O. Buyuk ; Electr. & Electron. Eng. Dept., Bogazici Univ., Istanbul, Turkey ; L. M. Arslan

Cepstral mean subtraction (CMS) is a well-known feature domain channel compensation technique employed to eliminate the effects of convolutive channel distortion. However, as the authors use in log-spectral mean subtraction (LSMS), the compensation might be applied in spectral domain before the filter-bank analysis with a higher-frequency resolution. LSMS can also be combined with CMS to further improve the recognition performance. In this study, the authors compare the performances of LSMS and CMS methods using a multi-channel, text-dependent single utterance speaker recognition database. In the experiments, the authors observe that LSMS outperforms CMS especially in the high false acceptance region. Moreover, the accuracy is further improved when the methods are combined together. With the combination, the authors achieve 15.5% relative reduction in equal error rate for no score normalisation and 9.4% for test normalisation cases when compared with the baseline CMS experiment.

Published in:

IET Signal Processing  (Volume:6 ,  Issue: 9 )