New approach in quantification of emotional intensity from the speech signal: emotional temperature

Alonso Hernández, Jesús Bernardino; Cabrera, Josué; Medina Molina, Manuel Martín; Travieso, Carlos M.

Título:	New approach in quantification of emotional intensity from the speech signal: emotional temperature
Autores/as:	Alonso Hernández, Jesús Bernardino Cabrera, Josué Medina Molina, Manuel Martín Travieso, Carlos M.
Clasificación UNESCO:	3307 Tecnología electrónica
Palabras clave:	Emotional speech recognition Pattern recognition Emotional intensity Emotional temperature
Fecha de publicación:	2015
Publicación seriada:	Expert Systems with Applications
Resumen:	The automatic speech emotion recognition has a huge potential in applications of fields such as psychology, psychiatry and the affective computing technology. The spontaneous speech is continuous, where the emotions are expressed in certain moments of the dialogue, given emotional turns. Therefore, it is necessary that the real-time applications are capable of detecting changes in the speaker's affective state. In this paper, we emphasize on recognizing activation from speech using a few feature set obtained from a temporal segmentation of the speech signal of different language like German, English and Polish. The feature set includes two prosodic features and four paralinguistic features related to the pitch and spectral energy balance. This segmentation and feature set are suitable for real-time emotion applications because they allow detect changes in the emotional state with very low processing times. The German Corpus EMO-DB (Berlin Database of Emotional Speech), the English Corpus LDC (Emotional Prosody Speech and Transcripts database) and the Polish Emotional Speech Database are used to train the Support Vector Machine (SVM) classifier and for gender-dependent activation recognition. The results are analyzed for each speech emotion with gender-dependent separately and obtained accuracies of 94.9%, 88.32% and 90% for EMO-DB, LDC and Polish databases respectively. This new approach provides a comparable performance with lower complexity than other approaches for real-time applications, thus making it an appealing alternative, may assist in the future development of automatic speech emotion recognition systems with continuous tracking
URI:	https://accedacris.ulpgc.es/handle/10553/55746
ISSN:	0957-4174
DOI:	10.1016/j.eswa.2015.07.062
Fuente:	Expert Systems with Applications [ISSN 0957-4174], v. 42 (24), p. 9554-9564
Colección:	Artículos

Vista completa

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Visitas

Google Scholar^TM

Altmetric

Comparte

Exporta metadatos

Dirección

Contacto

Legal

De interés

Citas SCOPUSTM

Citas de WEB OF SCIENCETM Citations

Visitas

Google ScholarTM

Altmetric

Comparte

Exporta metadatos

Dirección

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Google Scholar^TM