Animated computer graphics displays of the visible speech gestures of the human face have a number of potential applications. The paper describes a novel method for their creation by bringing together two statistically-based techniques, namely hidden Markov modelling and principal component analysis. The animations are derived from images of a real speaker's face and incorporate all the visible features of the primary articulators, including the lips, teeth and tongue, in a graphical display which does not use an artificial facial model. A pilot `video speech synthesiser' of this kind has been implemented and tested on spoken digit strings
Published in:
Speech, Image Processing and Neural Networks, 1994. Proceedings, ISSIPNN '94., 1994 International Symposium on
Date of Conference: 13-16 Apr 1994