Please use this identifier to cite or link to this item:
http://hdl.handle.net/10553/43859
Title: | Text classification in natural language using Wikipedia | Authors: | Quinteiro-González, Jose María Martel-Jordán, Ernestina Hernández-Morera, Pablo Ligero-Fleitas, Juan A. López-Rodriguez, Aaron |
UNESCO Clasification: | 3307 Tecnología electrónica | Issue Date: | 2011 | Publisher: | 1646-9895 | Journal: | RISTI - Revista Iberica de Sistemas e Tecnologias de Informacao | Abstract: | Automatic Text Classifiers are needed in environments where the amount of data to handle is so high that human classification would be ineffective. In our study, the proposed classifier takes advantage of the Wikipedia to generate the corpus defining each category. The text is then analyzed syntactically using Natural Language Processing software. The proposed classifier is highly accurate and outperforms Machine Learning trained classifiers. | URI: | http://hdl.handle.net/10553/43859 | ISSN: | 1646-9895 | Source: | RISTI - Revista Iberica de Sistemas e Tecnologias de Informacao[ISSN 1646-9895], p. 39-52 |
Appears in Collections: | Artículos |
Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.