Language Identification using Acoustic Models and Speaker Compensated Cepstral-Time Matrices
Castaldo, F.; Dalmasso, E.; Laface, P.; Colibro, D.; Vair, C.
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Volume 4, Issue , 15-20 April 2007 Page(s):IV-1013 - IV-1016
Digital Object Identifier 10.1109/ICASSP.2007.367244
Summary:This work presents two contributions to language identification. The first contribution is the definition of a set of properly selected time-frequency features that are a valid alternative to the commonly used shifted delta cepstral features. As a second contribution, we show that significant performance improvement in language recognition can be obtained estimating a subspace that represents the distortions due to inter-speaker variability within the same language, and compensating these distortions in the domain of the features. Experiments on the NIST 1996 and 2003 Language Recognition Evaluation data have been successfully used to validate the effectiveness of the proposed techniques
View citation and abstract |