Please use this identifier to cite or link to this item:
Title: New approach in quantification of emotional intensity from the speech signal: emotional temperature
Authors: Alonso Hernández, Jesús Bernardino 
Cabrera, Josué
Medina Molina, Manuel Martín 
Travieso, Carlos M. 
UNESCO Clasification: 3307 Tecnología electrónica
Keywords: Emotional speech recognition
Pattern recognition
Emotional intensity
Emotional temperature
Issue Date: 2015
Journal: Expert Systems with Applications 
Abstract: The automatic speech emotion recognition has a huge potential in applications of fields such as psychology, psychiatry and the affective computing technology. The spontaneous speech is continuous, where the emotions are expressed in certain moments of the dialogue, given emotional turns. Therefore, it is necessary that the real-time applications are capable of detecting changes in the speaker's affective state. In this paper, we emphasize on recognizing activation from speech using a few feature set obtained from a temporal segmentation of the speech signal of different language like German, English and Polish. The feature set includes two prosodic features and four paralinguistic features related to the pitch and spectral energy balance. This segmentation and feature set are suitable for real-time emotion applications because they allow detect changes in the emotional state with very low processing times. The German Corpus EMO-DB (Berlin Database of Emotional Speech), the English Corpus LDC (Emotional Prosody Speech and Transcripts database) and the Polish Emotional Speech Database are used to train the Support Vector Machine (SVM) classifier and for gender-dependent activation recognition. The results are analyzed for each speech emotion with gender-dependent separately and obtained accuracies of 94.9%, 88.32% and 90% for EMO-DB, LDC and Polish databases respectively. This new approach provides a comparable performance with lower complexity than other approaches for real-time applications, thus making it an appealing alternative, may assist in the future development of automatic speech emotion recognition systems with continuous tracking
ISSN: 0957-4174
DOI: 10.1016/j.eswa.2015.07.062
Source: Expert Systems with Applications [ISSN 0957-4174], v. 42 (24), p. 9554-9564
Appears in Collections:Artículos
Show full item record

Google ScholarTM




Export metadata

Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.