Identificador persistente para citar o vincular este elemento: https://accedacris.ulpgc.es/jspui/handle/10553/163430
Campo DC Valoridioma
dc.contributor.authorMedina Ramírez, Miguel Ángelen_US
dc.contributor.authorEstupiñán Ojeda, Cristian Daviden_US
dc.contributor.authorTorres Rodríguez, Victoriaen_US
dc.contributor.authorSánchez-Nielsen, Elenaen_US
dc.contributor.authorGuerra Artal, Cayetanoen_US
dc.contributor.authorHernández Tejera, Francisco Marioen_US
dc.date.accessioned2026-04-16T10:38:35Z-
dc.date.available2026-04-16T10:38:35Z-
dc.date.issued2026en_US
dc.identifier.issn2169-3536en_US
dc.identifier.urihttps://accedacris.ulpgc.es/jspui/handle/10553/163430-
dc.description.abstractParliamentary institutions generate extensive, domain-specific legislative documents, including normative texts and parliamentary debate transcripts. These documents differ in content and linguistic complexity, making automatic summarization essential for producing coherent summaries aligned with institutional standards. While large language models (LLMs) achieve high summarization quality, their computational requirements limit deployment in parliamentary and public-sector environments. In contrast, small language models (SLMs) offer a more resource-efficient alternative, but their capabilities and performance relative to LLMs, extractive methods, and other SLMs remain underexplored. In this work, we present the first comprehensive evaluation of SLMs for legislative summarization, assessing their effectiveness across document types and languages. We use two complementary datasets: EUR-LexSum, a multilingual corpus of normative texts covering six European languages, and ParcanDeb-Sum, a Spanish dataset of parliamentary debate records aligned with expert-written summaries. Summary quality is evaluated through a three-tier framework combining automatic metrics (ROUGE and BERTScore), LLMbased qualitative assessment, and expert-guided evaluation formalizing parliamentary debate summarization criteria. Our results show that: 1) instruction-tuned SLMs consistently outperform extractive baselines and, in several settings, rival LLMs with seven to eight billion parameters; 2) performance differs by document type, with fine-tuning being critical for debate transcripts, whereas instruction-tuning alone suffices for normative texts; and 3) for normative texts, SLMs establish a new benchmark for multilingual performance, while for parliamentary debates, fine-tuned SLMs achieve performance comparable to domain experts. These findings provide empirical evidence that high-quality legislative summarization can be achieved with SLMs, offering actionable guidance for selecting models that balance performance with computational constraints.en_US
dc.languageengen_US
dc.relation.ispartofIEEE Accessen_US
dc.subject33 Ciencias tecnológicasen_US
dc.subject.otherSmall language modelsen_US
dc.subject.otherlong document summarizationen_US
dc.subject.othernormative text summarizationen_US
dc.subject.otherparliamentary debate summarizationen_US
dc.subject.otherlegislative natural language processingen_US
dc.titleSmall Language Models for Legislative Summarization: An Empirical Evaluation of Performance and Suitabilityen_US
dc.typeArticleen_US
dc.identifier.doi10.1109/ACCESS.2026.3679718en_US
dc.investigacionIngeniería y Arquitecturaen_US
dc.utils.revisionen_US
dc.identifier.ulpgcen_US
dc.contributor.buulpgcBU-INFen_US
dc.description.sjr0,849
dc.description.jcr3,6
dc.description.sjrqQ1
dc.description.jcrqQ2
dc.description.scieSCIE
dc.description.miaricds10,4
item.grantfulltextopen-
item.fulltextCon texto completo-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Redes Neuronales, Aprendizaje Automático e Ingeniería de Datos-
crisitem.author.deptIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Redes Neuronales, Aprendizaje Automático e Ingeniería de Datos-
crisitem.author.deptIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Redes Neuronales, Aprendizaje Automático e Ingeniería de Datos-
crisitem.author.deptIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.deptGIR SIANI: Inteligencia Artificial, Redes Neuronales, Aprendizaje Automático e Ingeniería de Datos-
crisitem.author.deptIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.deptDepartamento de Informática y Sistemas-
crisitem.author.orcid0000-0001-6734-2257-
crisitem.author.orcid0000-0003-1381-2262-
crisitem.author.orcid0000-0001-9717-8048-
crisitem.author.parentorgIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.parentorgIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.parentorgIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.parentorgIU de Sistemas Inteligentes y Aplicaciones Numéricas en Ingeniería-
crisitem.author.fullNameMedina Ramírez, Miguel Ángel-
crisitem.author.fullNameEstupiñán Ojeda, Cristian David-
crisitem.author.fullNameTorres Rodríguez, Victoria-
crisitem.author.fullNameGuerra Artal, Cayetano-
crisitem.author.fullNameHernández Tejera, Francisco Mario-
Colección:Artículos
Adobe PDF (4,8 MB)
Vista resumida

Google ScholarTM

Verifica

Altmetric


Comparte



Exporta metadatos



Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.