Continuous tracking of the emotion temperature

Alonso, Jesús B.; Cabrera  Cruz,Josué Jacob; Travieso González, Carlos Manuel; López-de-Ipiña, Karmele; Sánchez-Medina, Agustín

Título:	Continuous tracking of the emotion temperature
Autores/as:	Alonso, Jesús B. Cabrera Cruz,Josué Jacob Travieso González, Carlos Manuel López-de-Ipiña, Karmele Sánchez-Medina, Agustín
Clasificación UNESCO:	33 Ciencias tecnológicas 530602 Innovación tecnológica
Palabras clave:	Emotional speech recognition Pattern recognition Continuous tracking
Fecha de publicación:	2017
Publicación seriada:	Neurocomputing
Resumen:	The speech emotion recognition has a huge potential in human computer interaction applications in fields such as psychology, psychiatry and affective computing technology. The great majority of research works on speech emotion recognition have been made based on record repositories consisting of short sentences recorded under laboratory conditions. In this work, we researched the use of the Emotional Temperature strategy for continuous tracking in long-term samples of speech in which there are emotional changes during the speech. Emotional Temperature uses a few prosodic and paralinguistic features set obtained from a temporal segmentation of the speech signal. The simplicity and limitation of the set, previously validated under laboratory conditions, make it appropriate to be used under real conditions, where the spontaneous speech is continuous and the emotions are expressed in certain moments of the dialogue, given emotional turns. This strategy is robust, offers low computational cost, ability to detect emotional changes and improves the performance of a segmentation based on linguistic aspects. The German Corpus EMO-DB (Berlin Database of Emotional Speech), the English Corpus LDC (Emotional Prosody Speech and Transcripts database), the Polish Emotional Speech Database and RECOLA (Remote Collaborative and Affective Interactions) database are used to validate the system of continuous tracking from emotional speech. Two experimentation conditions are analyzed, dependence and independence on language and gender, using acted and spontaneous speech respectively. In acted conditions, the approach obtained accuracies of 67-97% while under spontaneous conditions, compared to annotation performed by human judges, accuracies of 41-50% were obtained. In comparison with previous studies in continuous emotion recognition, the approach improves the existing results with an accuracy of 9% higher on average. Therefore, this approach has a good performance with low complexity to develop real-time applications or continuous tracking emotional speech applications.
URI:	https://accedacris.ulpgc.es/handle/10553/35406
ISSN:	0925-2312
DOI:	10.1016/j.neucom.2016.06.093
Fuente:	Neurocomputing[ISSN 0925-2312],v. 255, p. 17-25
Colección:	Artículos

Vista completa

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Visitas

Google Scholar^TM

Altmetric

Comparte

Exporta metadatos

Dirección

Contacto

Legal

De interés

Citas SCOPUSTM

Citas de WEB OF SCIENCETM Citations

Visitas

Google ScholarTM

Altmetric

Comparte

Exporta metadatos

Dirección

Citas SCOPUS^TM

Citas de WEB OF SCIENCE^TM
Citations

Google Scholar^TM