The integration of emotional states in (spoken) human-computer interfaces has emerged to a recent field of research. In this paper we describe the enhancements and optimizations of a speech-based emotion recognizer jointly operating with automatic speech recognition. We argue that the knowledge about the textual content of an utterance can improve the recognition of the emotional content. Having outlined the experimental setup we present results and demonstrate the feasibility of a post-processing algorithm combining multiple speech-emotion recognizers. For the dialogue management we propose a stochastic approach comprising a dialogue model and an emotional model interfering with each other in a combined dialogue-emotion model. These models are trained from dialogue corpora and being assigned different weighting factors they determine the course of the dialogue.
Published in:
Intelligent Environments, 2007. IE 07. 3rd IET International Conference on
Date of Conference: 24-25 Sept. 2007