On the modeling of natural vocal emotion expressions through binary key
Title | On the modeling of natural vocal emotion expressions through binary key |
Publication Type | Conference Paper |
Year of Publication | 2014 |
Authors | Luque, J., Anguera, X. |
Conference Name | Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European |
Date Published | Sept |
Keywords | acoustic descriptors, Acoustic signal processing, Acoustics, binary acoustic modeling, binary fingerprint, binary key modeling, binary value representation, correlation coefficient, dimensional emotions, Emotion modeling, emotion recognition, feature extraction, German TV talk-show, mean absolute error, natural vocal emotion expression modelling, regression analysis, speaker emotion characteristics, spectral parameters, Speech, speech features, Speech recognition, spontaneous dialogues, standard acoustic feature mapping, Support vector machines, support vector regression model, three-continuous emotional dimensions, Training, VAM corpus, Vectors |
Abstract | This work presents a novel method to estimate natural expressed emotions in speech through binary acoustic modeling. Standard acoustic features are mapped to a binary value representation and a support vector regression model is used to correlate them with the three-continuous emotional dimensions. Three different sets of speech features, two based on spectral parameters and one on prosody are compared on the VAM corpus, a set of spontaneous dialogues from a German TV talk-show. The regression analysis, in terms of correlation coefficient and mean absolute error, show that the binary key modeling is able to successfully capture speaker emotion characteristics. The proposed algorithm obtains comparable results to those reported on the literature while it relies on a much smaller set of acoustic descriptors. Furthermore, we also report on preliminary results based on the combination of the binary models, which brings further performance improvements. |
URL | http://ieeexplore.ieee.org/document/6952552/ |
Citation Key | 6952552 |
- natural vocal emotion expression modelling
- Vectors
- VAM corpus
- Training
- three-continuous emotional dimensions
- support vector regression model
- Support vector machines
- standard acoustic feature mapping
- spontaneous dialogues
- Speech recognition
- speech features
- Speech
- spectral parameters
- speaker emotion characteristics
- regression analysis
- acoustic descriptors
- mean absolute error
- German TV talk-show
- feature extraction
- emotion recognition
- Emotion modeling
- dimensional emotions
- correlation coefficient
- binary value representation
- binary key modeling
- binary fingerprint
- binary acoustic modeling
- Acoustics
- Acoustic signal processing