Visible to the public On the modeling of natural vocal emotion expressions through binary key

TitleOn the modeling of natural vocal emotion expressions through binary key
Publication TypeConference Paper
Year of Publication2014
AuthorsLuque, J., Anguera, X.
Conference NameSignal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European
Date PublishedSept
Keywordsacoustic descriptors, Acoustic signal processing, Acoustics, binary acoustic modeling, binary fingerprint, binary key modeling, binary value representation, correlation coefficient, dimensional emotions, Emotion modeling, emotion recognition, feature extraction, German TV talk-show, mean absolute error, natural vocal emotion expression modelling, regression analysis, speaker emotion characteristics, spectral parameters, Speech, speech features, Speech recognition, spontaneous dialogues, standard acoustic feature mapping, Support vector machines, support vector regression model, three-continuous emotional dimensions, Training, VAM corpus, Vectors
Abstract

This work presents a novel method to estimate natural expressed emotions in speech through binary acoustic modeling. Standard acoustic features are mapped to a binary value representation and a support vector regression model is used to correlate them with the three-continuous emotional dimensions. Three different sets of speech features, two based on spectral parameters and one on prosody are compared on the VAM corpus, a set of spontaneous dialogues from a German TV talk-show. The regression analysis, in terms of correlation coefficient and mean absolute error, show that the binary key modeling is able to successfully capture speaker emotion characteristics. The proposed algorithm obtains comparable results to those reported on the literature while it relies on a much smaller set of acoustic descriptors. Furthermore, we also report on preliminary results based on the combination of the binary models, which brings further performance improvements.

URLhttp://ieeexplore.ieee.org/document/6952552/
Citation Key6952552