Overview of compression and packet loss effects in speech biometrics
Besacier, L.
Mayorga, P.
Bonastre, J.-F.
Fredouille, C.
Meignier, S.
CLIPS/IMAG, Grenoble, France;
This paper appears in: Vision, Image and Signal Processing, IEE Proceedings -
Publication Date: 15 Dec. 2003
Volume: 150,
Issue: 6
On page(s): 372- 376
ISSN: 1350-245X
INSPEC Accession Number: 7840620
Digital Object Identifier: 10.1049/ip-vis:20031033
Current Version Published: 2004-02-06
Abstract
An overview is presented of compression and packet loss effects in speech biometrics. These new problems appear particularly in recent applications of biometrics over mobile or Internet networks. The influence of speech compression on speaker recognition performance in mobile networks is investigated. In a first experiment, it is found that the use of GSM coding degrades the performance. In a second experiment, the features for the speaker recognition system are calculated directly from the information available in the encoded bit stream. It is found that a low LPC order in GSM coding is responsible for most performance degradations. A speaker recognition system was obtained which is equivalent in performance to the original one which decodes and reanalyses speech before performing recognition. The joint packet loss and compression effects over IP networks are also studied. It is experimentally demonstrated that the adverse effects of packet loss alone are negligible, while the encoding of speech, particularly at a low bit rate, coupled with packet loss, can reduce the verification accuracy considerably.
Index
Terms
Available to subscribers and IEEE members.
References
Available to subscribers and IEEE members.
Citing Documents
Available to subscribers and IEEE members.