Abstract:
Emotion recognition in speech signals has the objective to classify speaker emotions exactly as human listeners would do. However, common accuracy measures only consider ...Show MoreMetadata
Abstract:
Emotion recognition in speech signals has the objective to classify speaker emotions exactly as human listeners would do. However, common accuracy measures only consider the majority voting of the raters and therefore require a classification with a final hard decision. In the present paper, we thus propose a new accuracy measure for automatic speech emotion recognition, which takes into account the distribution of the labels of all raters for a speech sample or segment, and the respective distribution of the classifier's confidence output. Hence, by example of a well-known but newly labeled database we will discuss important annotation aspects regarding the design of a ground truth for emotional databases. Furthermore, we demonstrate experimental results of an example emotion recognition applying both state-of-the-art and the novel accuracy measurement approach.
Published in: Speech Communication; 11. ITG Symposium
Date of Conference: 24-26 September 2014
Date Added to IEEE Xplore: 17 October 2014
Print ISBN:978-3-8007-3640-9
Conference Location: Erlangen, Germany