Identificador persistente para citar o vincular este elemento:
http://hdl.handle.net/10553/55746
Título: | New approach in quantification of emotional intensity from the speech signal: emotional temperature | Autores/as: | Alonso Hernández, Jesús Bernardino Cabrera, Josué Medina Molina, Manuel Martín Travieso, Carlos M. |
Clasificación UNESCO: | 3307 Tecnología electrónica | Palabras clave: | Emotional speech recognition Pattern recognition Emotional intensity Emotional temperature |
Fecha de publicación: | 2015 | Publicación seriada: | Expert Systems with Applications | Resumen: | The automatic speech emotion recognition has a huge potential in applications of fields such as psychology, psychiatry and the affective computing technology. The spontaneous speech is continuous, where the emotions are expressed in certain moments of the dialogue, given emotional turns. Therefore, it is necessary that the real-time applications are capable of detecting changes in the speaker's affective state. In this paper, we emphasize on recognizing activation from speech using a few feature set obtained from a temporal segmentation of the speech signal of different language like German, English and Polish. The feature set includes two prosodic features and four paralinguistic features related to the pitch and spectral energy balance. This segmentation and feature set are suitable for real-time emotion applications because they allow detect changes in the emotional state with very low processing times. The German Corpus EMO-DB (Berlin Database of Emotional Speech), the English Corpus LDC (Emotional Prosody Speech and Transcripts database) and the Polish Emotional Speech Database are used to train the Support Vector Machine (SVM) classifier and for gender-dependent activation recognition. The results are analyzed for each speech emotion with gender-dependent separately and obtained accuracies of 94.9%, 88.32% and 90% for EMO-DB, LDC and Polish databases respectively. This new approach provides a comparable performance with lower complexity than other approaches for real-time applications, thus making it an appealing alternative, may assist in the future development of automatic speech emotion recognition systems with continuous tracking | URI: | http://hdl.handle.net/10553/55746 | ISSN: | 0957-4174 | DOI: | 10.1016/j.eswa.2015.07.062 | Fuente: | Expert Systems with Applications [ISSN 0957-4174], v. 42 (24), p. 9554-9564 |
Colección: | Artículos |
Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.