A Comparative Study of Linear and Nonlinear Dimensionality Reduction for Speaker Identification
Errity, A.
McKenna, J.
Dublin City Univ., Dublin;
This paper appears in: Digital Signal Processing, 2007 15th International Conference on
Publication Date: 1-4 July 2007
On page(s): 587-590
Location: Cardiff,
ISBN: 1-4244-0882-2
INSPEC Accession Number: 9855702
Digital Object Identifier: 10.1109/ICDSP.2007.4288650
Current Version Published: 2007-08-13
Abstract
In this paper we apply linear and nonlinear dimensionality reduction methods to speech produced by a number of different speakers in an effort to yield low dimensional features capable of discriminating between speakers. The classical linear dimensionality reduction method, principal component analysis (PCA), and the nonlinear manifold learning method, Isomap, are investigated. The resulting features are evaluated in GMM-based speaker identification experiments and compared to conventional cepstral features. Isomap is shown to give the highest accuracy for very low dimensions, outperforming MFCCs and PCA transformed features. Isomap is shown to be useful for visualisation of speaker clusters. For higher dimensions, speaker identification results indicate that features resulting from PCA offer improvements over conventional MFCCs.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.