Identificador persistente para citar o vincular este elemento: https://accedacris.ulpgc.es/jspui/handle/10553/154925
Campo DC Valoridioma
dc.contributor.authorSalas Cáceres, José Ignacio-
dc.contributor.authorLorenzo Navarro, José Javier-
dc.contributor.authorCastrillón Santana, Modesto Fernando-
dc.date.accessioned2026-01-13T09:59:17Z-
dc.date.available2026-01-13T09:59:17Z-
dc.date.issued2026-
dc.identifier.isbn978-3-032-10191-4-
dc.identifier.issn0302-9743-
dc.identifier.otherScopus-
dc.identifier.urihttps://accedacris.ulpgc.es/jspui/handle/10553/154925-
dc.description.abstractHuman–machine interactions are becoming increasingly common in society, making it important to improve their user experience. In this regard, an accurate emotion recognition system could substantially benefit the experience. This work presents a novel framework for multimodal emotion recognition that performs fusion at multiple levels, feature and score, to effectively combine visual, audio, and textual information. Modality-specific embeddings are extracted using VGGFace for visual data, a Wav2Vec2-Large-Robust model for audio, and BERT for text. These representations are unified via three different feature-level fusion strategies: concatenation, Embrace, and cross-attention. A subsequent score-level fusion employs an adaptive weighted sum to produce the final class probabilities. On the four-emotion classification task of the IEMOCAP dataset, our approach achieves an unweighted accuracy of 73.53%, which represents solid results comparable with some state-of-the-art baselines and demonstrates the added value of visual cues. Our experiments also analyze the impact of fusion and pooling choices, providing insights for future multimodal systems.-
dc.languageeng-
dc.relation.ispartofLecture Notes In Computer Science-
dc.sourceLecture Notes in Computer Science[ISSN 0302-9743],v. 16168 LNCS, p. 536-547, (Enero 2026)-
dc.subject120304 Inteligencia artificial-
dc.subject.otherMultimodal data fusion-
dc.subject.otherEmotion recognition-
dc.subject.otherBiometry-
dc.subject.otherHuman-Machine Interaction-
dc.titleMultimodal Emotion Recognition via Multilevel Fusion of Visual, Audio, and Textual Data-
dc.typebook_content-
dc.relation.conference23rd International Conference on Image Analysis and Processing (ICIAP2025)-
dc.identifier.doi10.1007/978-3-032-10192-1_45-
dc.identifier.scopus105028364204-
dc.contributor.orcid0009-0004-7543-3385-
dc.contributor.orcid0000-0002-2834-2067-
dc.contributor.orcid0000-0002-8673-2725-
dc.contributor.authorscopusid58745737800-
dc.contributor.authorscopusid15042453800-
dc.contributor.authorscopusid57218418238-
dc.identifier.eissn1611-3349-
dc.description.lastpage547-
dc.description.firstpage536-
dc.relation.volume16168 LNCS-
dc.investigacionIngeniería y Arquitectura-
dc.type2Artículo-
dc.utils.revision-
dc.date.coverdateEnero 2026-
dc.identifier.conferenceidevents156154-
dc.identifier.ulpgc-
dc.contributor.buulpgcBU-INF-
dc.description.sjr0,352
dc.description.sjrqQ2
dc.description.miaricds10,0
item.grantfulltextnone-
item.fulltextSin texto completo-
crisitem.event.eventsstartdate14-05-2024-
crisitem.event.eventsenddate16-05-2024-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Robótica y Oceanografía Computacional-
crisitem.author.deptIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Robótica y Oceanografía Computacional-
crisitem.author.deptIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Robótica y Oceanografía Computacional-
crisitem.author.deptIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.orcid0009-0004-7543-3385-
crisitem.author.orcid0000-0002-2834-2067-
crisitem.author.orcid0000-0002-8673-2725-
crisitem.author.parentorgIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.parentorgIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.parentorgIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.fullNameSalas Cáceres, José Ignacio-
crisitem.author.fullNameLorenzo Navarro, José Javier-
crisitem.author.fullNameCastrillón Santana, Modesto Fernando-
Colección:Actas de congresos
Vista resumida

Google ScholarTM

Verifica

Altmetric


Comparte



Exporta metadatos



Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.