Please use this identifier to cite or link to this item:
http://hdl.handle.net/10553/42678
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Carreras-Riudavets, Francisco J. | en_US |
dc.contributor.author | Rodríguez-del-Pino, Juan Carlos | en_US |
dc.contributor.author | Hernández-Figueroa, Zenón | en_US |
dc.contributor.author | Rodríguez-Rodríguez, Gustavo | en_US |
dc.date.accessioned | 2018-11-21T10:38:40Z | - |
dc.date.available | 2018-11-21T10:38:40Z | - |
dc.date.issued | 2012 | en_US |
dc.identifier.isbn | 978-3-642-28603-2 | en_US |
dc.identifier.issn | 0302-9743 | en_US |
dc.identifier.uri | http://hdl.handle.net/10553/42678 | - |
dc.description.abstract | This paper presents a morphological analyzer for the Spanish language (MAHT). This system is mainly based on the storage of words and its morphological information, leading to a lexical knowledge base that has almost five million words. The lexical knowledge base practically covers the whole morphological casuistry of the Spanish language. However, the analyzer solves the processing of prefixes and of enclitic pronouns by easy rules, since the words that can include these elements are much and some of them are neologisms. MAHT reaches a processing average speed over 275,000 words per second. This one is possible because it uses hash tables in main memory. MAHT has been designed to isolate the data from the algorithms that analyze words, even with their irregular forms. This design is very important for an irregular and highly inflectional language, like Spanish, to simplify the insertion of new words and the maintenance of program code. | en_US |
dc.language | eng | en_US |
dc.relation | Caracterización Objetiva de la Dificultad General de Los Originales. | en_US |
dc.relation.ispartof | Lecture Notes in Computer Science | en_US |
dc.source | Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2012. Lecture Notes in Computer Science, vol 7181. Springer, Berlin, Heidelberg | en_US |
dc.subject | 570104 Lingüística informatizada | en_US |
dc.subject | 570503 Lexicografía | en_US |
dc.subject.other | Lingüística computacional | en_US |
dc.subject.other | Procesamiento de texto | en_US |
dc.subject.other | Español | en_US |
dc.subject.other | Computational linguistics | en_US |
dc.subject.other | Natural language processing systems | en_US |
dc.subject.other | Text processing | en_US |
dc.subject.other | Lematización | en_US |
dc.title | A morphological analyzer using hash tables in main memory (MAHT) and a lexical knowledge base | en_US |
dc.type | info:eu-repo/semantics/conferenceObject | es |
dc.type | ConferenceObject | es |
dc.relation.conference | 13th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2012 | |
dc.identifier.doi | 10.1007/978-3-642-28604-9_7 | |
dc.identifier.scopus | 84858316186 | - |
dc.contributor.authorscopusid | 14031088300 | - |
dc.contributor.authorscopusid | 55096895200 | - |
dc.contributor.authorscopusid | 57213994414 | |
dc.contributor.authorscopusid | 56630335700 | - |
dc.contributor.authorscopusid | 22735188900 | - |
dc.description.lastpage | 91 | - |
dc.description.firstpage | 80 | - |
dc.relation.volume | 7181 | - |
dc.investigacion | Ingeniería y Arquitectura | en_US |
dc.type2 | Actas de congresos | en_US |
dc.identifier.eisbn | 978-3-642-28604-9 | - |
dc.utils.revision | Sí | en_US |
dc.date.coverdate | Marzo 2012 | |
dc.identifier.conferenceid | events121429 | |
dc.identifier.ulpgc | Sí | es |
dc.description.sjr | 0,323 | |
dc.description.sjrq | Q3 | |
dc.description.ggs | 3 | |
item.grantfulltext | none | - |
item.fulltext | Sin texto completo | - |
crisitem.project.principalinvestigator | Muñoz Martín, Ricardo | - |
crisitem.event.eventsstartdate | 11-03-2012 | - |
crisitem.event.eventsenddate | 17-03-2012 | - |
crisitem.author.dept | GIR IATEXT: Cognition, linguistic, text and information processing | - |
crisitem.author.dept | IU de Análisis y Aplicaciones Textuales | - |
crisitem.author.dept | Departamento de Informática y Sistemas | - |
crisitem.author.dept | Departamento de Informática y Sistemas | - |
crisitem.author.dept | GIR IATEXT: Cognition, linguistic, text and information processing | - |
crisitem.author.dept | IU de Análisis y Aplicaciones Textuales | - |
crisitem.author.dept | Departamento de Informática y Sistemas | - |
crisitem.author.dept | GIR IATEXT: Cognition, linguistic, text and information processing | - |
crisitem.author.dept | IU de Análisis y Aplicaciones Textuales | - |
crisitem.author.orcid | 0000-0001-9221-664X | - |
crisitem.author.orcid | 0000-0001-7126-0406 | - |
crisitem.author.orcid | 0000-0002-1657-4020 | - |
crisitem.author.orcid | 0000-0001-6299-1813 | - |
crisitem.author.parentorg | IU de Análisis y Aplicaciones Textuales | - |
crisitem.author.parentorg | IU de Análisis y Aplicaciones Textuales | - |
crisitem.author.parentorg | IU de Análisis y Aplicaciones Textuales | - |
crisitem.author.fullName | Carreras Riudavets, Francisco Javier | - |
crisitem.author.fullName | Rodríguez Del Pino, Juan Carlos | - |
crisitem.author.fullName | Hernández Figueroa, Zenón José | - |
crisitem.author.fullName | Rodríguez Rodríguez,Gustavo | - |
Appears in Collections: | Actas de congresos |
Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.