Identificador persistente para citar o vincular este elemento:
http://hdl.handle.net/10553/106914
Campo DC | Valor | idioma |
---|---|---|
dc.contributor.author | Estupiñán Ojeda, Cristian David | en_US |
dc.contributor.author | Guerra Artal, Cayetano | en_US |
dc.contributor.author | Hernández Tejera, Francisco Mario | en_US |
dc.date.accessioned | 2021-04-19T13:14:48Z | - |
dc.date.available | 2021-04-19T13:14:48Z | - |
dc.date.issued | 2021 | en_US |
dc.identifier.isbn | 978-989-758-484-8 | en_US |
dc.identifier.issn | 2184-433X | en_US |
dc.identifier.other | Scopus | - |
dc.identifier.uri | http://hdl.handle.net/10553/106914 | - |
dc.description.abstract | The use of architectures based on transformers presents a state of the art revolution in natural language processing (NLP). The employment of these architectures with high computational costs has increased in the last few months, despite the existing use of parallelization techniques. This is due to the high performance that is obtained by increasing the size of the learnable parameters for these kinds of architectures, while maintaining the models' predictability. This relates to the fact that it is difficult to do research with limited computational resources. A restrictive element is the memory usage, which seriously affects the replication of experiments. We are presenting a new architecture called Informer, which seeks to exploit the concept of information organization. For the sake of evaluation, we use a neural machine translation (NMT) dataset, the English-Vietnamese IWSLT15 dataset (Luong and Manning, 2015). In this paper, we also compare this proposal with architectures that reduce the computational cost to O(n · r), such as Linformer (Wang et al., 2020). In addition, we have managed to improve the SOTA of the BLEU score from 33.27 to 35.11. | en_US |
dc.language | eng | en_US |
dc.publisher | SciTePress Digital Library | en_US |
dc.relation.ispartof | ICAART (Setúbal) | en_US |
dc.source | ICAART 2021 - Proceedings of the 13th International Conference on Agents and Artificial Intelligence [ISSN 2184-433X] ,v. 2, p. 381-389, (Enero 2021) | en_US |
dc.subject | 120304 Inteligencia artificial | en_US |
dc.subject.other | Convolution | en_US |
dc.subject.other | Deep learning | en_US |
dc.subject.other | Informer | en_US |
dc.subject.other | Linear transformer | en_US |
dc.subject.other | Neural machine translation | en_US |
dc.subject.other | Organization | en_US |
dc.subject.other | Self attention | en_US |
dc.title | Informer, an information organization transformer architecture | en_US |
dc.type | info:eu-repo/semantics/conferenceObject | en_US |
dc.type | ConferenceObject | en_US |
dc.relation.conference | 13th International Conference on Agents and Artificial Intelligence (ICAART 2021) | en_US |
dc.identifier.doi | 10.5220/0010372703810389 | en_US |
dc.identifier.scopus | 85103812520 | - |
dc.contributor.authorscopusid | 57222726987 | - |
dc.contributor.authorscopusid | 57222721527 | - |
dc.contributor.authorscopusid | 57222720474 | - |
dc.description.lastpage | 389 | en_US |
dc.description.firstpage | 381 | en_US |
dc.relation.volume | 2 | en_US |
dc.investigacion | Ingeniería y Arquitectura | en_US |
dc.type2 | Actas de congresos | en_US |
dc.utils.revision | Sí | en_US |
dc.date.coverdate | Enero 2021 | en_US |
dc.identifier.conferenceid | events128729 | - |
dc.identifier.ulpgc | Sí | en_US |
dc.contributor.buulpgc | BU-INF | en_US |
item.grantfulltext | none | - |
item.fulltext | Sin texto completo | - |
crisitem.event.eventsstartdate | 12-09-2019 | - |
crisitem.event.eventsenddate | 14-09-2019 | - |
crisitem.author.dept | GIR SIANI: Inteligencia Artificial, Redes Neuronales, Aprendizaje Automático e Ingeniería de Datos | - |
crisitem.author.dept | IU Sistemas Inteligentes y Aplicaciones Numéricas | - |
crisitem.author.dept | Departamento de Informática y Sistemas | - |
crisitem.author.dept | GIR SIANI: Inteligencia Artificial, Redes Neuronales, Aprendizaje Automático e Ingeniería de Datos | - |
crisitem.author.dept | IU Sistemas Inteligentes y Aplicaciones Numéricas | - |
crisitem.author.dept | Departamento de Informática y Sistemas | - |
crisitem.author.dept | GIR SIANI: Inteligencia Artificial, Redes Neuronales, Aprendizaje Automático e Ingeniería de Datos | - |
crisitem.author.dept | IU Sistemas Inteligentes y Aplicaciones Numéricas | - |
crisitem.author.dept | Departamento de Informática y Sistemas | - |
crisitem.author.orcid | 0000-0003-1381-2262 | - |
crisitem.author.orcid | 0000-0001-9717-8048 | - |
crisitem.author.parentorg | IU Sistemas Inteligentes y Aplicaciones Numéricas | - |
crisitem.author.parentorg | IU Sistemas Inteligentes y Aplicaciones Numéricas | - |
crisitem.author.parentorg | IU Sistemas Inteligentes y Aplicaciones Numéricas | - |
crisitem.author.fullName | Estupiñán Ojeda, Cristian David | - |
crisitem.author.fullName | Guerra Artal, Cayetano | - |
crisitem.author.fullName | Hernández Tejera, Francisco Mario | - |
Colección: | Actas de congresos |
Citas SCOPUSTM
1
actualizado el 09-feb-2025
Citas de WEB OF SCIENCETM
Citations
1
actualizado el 02-feb-2025
Visitas
269
actualizado el 24-ago-2024
Google ScholarTM
Verifica
Altmetric
Comparte
Exporta metadatos
Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.