Identificador persistente para citar o vincular este elemento: http://hdl.handle.net/10553/42678
Título: A morphological analyzer using hash tables in main memory (MAHT) and a lexical knowledge base
Autores/as: Carreras-Riudavets, Francisco J. 
Rodríguez-del-Pino, Juan Carlos 
Hernández-Figueroa, Zenón 
Rodríguez-Rodríguez, Gustavo 
Clasificación UNESCO: 570104 Lingüística informatizada
570503 Lexicografía
Palabras clave: Lingüística computacional
Procesamiento de texto
Español
Computational linguistics
Natural language processing systems, et al.
Fecha de publicación: 2012
Proyectos: Caracterización Objetiva de la Dificultad General de Los Originales. 
Publicación seriada: Lecture Notes in Computer Science 
Conferencia: 13th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2012 
Resumen: This paper presents a morphological analyzer for the Spanish language (MAHT). This system is mainly based on the storage of words and its morphological information, leading to a lexical knowledge base that has almost five million words. The lexical knowledge base practically covers the whole morphological casuistry of the Spanish language. However, the analyzer solves the processing of prefixes and of enclitic pronouns by easy rules, since the words that can include these elements are much and some of them are neologisms. MAHT reaches a processing average speed over 275,000 words per second. This one is possible because it uses hash tables in main memory. MAHT has been designed to isolate the data from the algorithms that analyze words, even with their irregular forms. This design is very important for an irregular and highly inflectional language, like Spanish, to simplify the insertion of new words and the maintenance of program code.
URI: http://hdl.handle.net/10553/42678
ISBN: 978-3-642-28603-2
ISSN: 0302-9743
DOI: 10.1007/978-3-642-28604-9_7
Fuente: Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2012. Lecture Notes in Computer Science, vol 7181. Springer, Berlin, Heidelberg
Colección:Actas de congresos
Vista completa

Google ScholarTM

Verifica

Altmetric


Comparte



Exporta metadatos



Los elementos en ULPGC accedaCRIS están protegidos por derechos de autor con todos los derechos reservados, a menos que se indique lo contrario.