Modeling of the glottal flow derivative waveform with applicationto speaker identification
Plumpe, M.D.; Quatieri, T.F.; Reynolds, D.A.
Speech and Audio Processing, IEEE Transactions on
Volume 7, Issue 5, Sep 1999 Page(s):569 - 586
Digital Object Identifier 10.1109/89.784109
Summary:An automatic technique for estimating and modeling the glottal
flow derivative source waveform from speech, and applying the model
parameters to speaker identification, is presented. The estimate of the
glottal flow derivative is decomposed into coarse structure,
representing the general flow shape, and fine structure, comprising
aspiration and other perturbations in the flow, from which model
parameters are obtained. The glottal flow derivative is estimated using
an inverse filter determined within a time interval of vocal-fold
closure that is identified through differences in formant frequency
modulation during the open and closed phases of the glottal cycle. This
formant motion is predicted by Ananthapadmanabha and Fant (1982) to be a
result of time-varying and nonlinear source/vocal tract coupling within
a glottal cycle. The glottal flow derivative estimate is modeled using
the Liljencrants-Fant (1986) model to capture its coarse structure,
while the fine structure of the flow derivative is represented through
energy and perturbation measures. The model parameters are used in a
Gaussian mixture model speaker identification (SID) system. Both coarse-
and fine-structure glottal features are shown to contain significant
speaker-dependent information. For a large TIMIT database subset,
averaging over male and female SID scores, the coarse-structure
parameters achieve about 60% accuracy, the fine-structure parameters
give about 40% accuracy, and their combination yields about 70% correct
identification. Finally, in preliminary experiments on the counterpart
telephone-degraded NTIMIT database, about a 5% error reduction in SID
scores is obtained when source features are combined with traditional
mel-cepstral measures
View citation and abstract |