Please use this identifier to cite or link to this item: http://hdl.handle.net/10553/43859
Title: Text classification in natural language using Wikipedia
Authors: Quinteiro-González, Jose María 
Martel-Jordán, Ernestina 
Hernández-Morera, Pablo 
Ligero-Fleitas, Juan A.
López-Rodriguez, Aaron 
UNESCO Clasification: 3307 Tecnología electrónica
Issue Date: 2011
Publisher: 1646-9895
Journal: RISTI - Revista Iberica de Sistemas e Tecnologias de Informacao 
Abstract: Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers.
URI: http://hdl.handle.net/10553/43859
ISSN: 1646-9895
Source: RISTI - Revista Iberica de Sistemas e Tecnologias de Informacao[ISSN 1646-9895], p. 39-52
Appears in Collections:Artículos
Show full item record

Google ScholarTM

Check


Share



Export metadata



Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.