Please use this identifier to cite or link to this item:
http://hdl.handle.net/10553/47784
Title: | Self-organizing maps in the design of anti-spam filters a proposal based on thematic categories | Authors: | Cabrera-León, Ylermi García Báez, Patricio Suárez-Araujo, Carmen Paz |
UNESCO Clasification: | 120304 Inteligencia artificial | Keywords: | Anti-spam Artificial neural networks Ham Inverse category or class frequency Self-organizing maps (SOMs), et al |
Issue Date: | 2016 | Conference: | 8th International Joint Conference on Computational Intelligence, IJCCI 2016 | Abstract: | Spam, or unsolicited messages sent massively, is one of the threats that affects email and other media. Its high volume generates substantial time and economic losses. A solution to this problem is presented: a hybrid anti-spam filter based on unsupervised Artificial Neural Networks (ANNs). It consists of two steps, preprocessing and processing, both based on different computation models: programmed and neural (using Kohonen SOM). This system has been optimized using, as a data corpus, ham from "Enron Email" and spam from two different sources: traditional (user's inbox) and spamtrap-honeypot. It has been proved that thematic categories can be found both in spam and ham words. 1260 system configurations were analyzed, comparing their quality and performance with the most used metrics. All of them achieved AUC > 0.90 and the best 204 AUC > 0.95, despite just using 13 attributes for the input vectors of the SOM, one for each thematic category. Results were similar to other researchers' over the same corpus, though they make use of different Machine Learning (ML) methods and a number of attributes several orders of magnitude greater. It was further tested with datasets not utilized during design, obtaining 0.77 < AUC < 0.96 with normalized data. | URI: | http://hdl.handle.net/10553/47784 | ISBN: | 978-989-758-201-1 | DOI: | 10.5220/0006041400210032 | Source: | IJCCI 2016 - Proceedings of the 8th International Joint Conference on Computational Intelligence, v. 3, p. 21-32 |
Appears in Collections: | Actas de congresos |
SCOPUSTM
Citations
7
checked on Dec 1, 2024
WEB OF SCIENCETM
Citations
3
checked on Nov 24, 2024
Page view(s)
176
checked on Aug 10, 2024
Google ScholarTM
Check
Altmetric
Share
Export metadata
Items in accedaCRIS are protected by copyright, with all rights reserved, unless otherwise indicated.