SpamHunting:: An instance-based reasoning system for spam labelling and filtering

被引:39
作者
Fdez-Riverola, F.
Iglesias, E. L.
Diaz, F.
Mendez, J. R.
Corchado, J. M.
机构
[1] Univ Vigo, Escuela Super Ingn Informat, Dept Informat, Orense 32004, Spain
[2] Univ Valladolid, Dept Informat, Escuela Univ Informat, Segovia 40005, Spain
[3] Univ Salamanca, Dept Informat & Automat, E-37008 Salamanca, Spain
关键词
IBR system; automatic reasoning; anti-spam filtering; model comparison;
D O I
10.1016/j.dss.2006.11.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we show an instance-based reasoning e-mail filtering model that outperforms classical machine learning techniques and other successful lazy learners approaches in the domain of anti-spam filtering. The architecture of the learning-based anti-spain filter is based on a tuneable enhanced instance retrieval network able to accurately generalize e-mail representations. The reuse of similar messages is carried out by a simple unanimous voting mechanism to determine whether the target case is spam or not. Previous to the final response of the system, the revision stage is only performed when the assigned class is spam whereby the system employs general knowledge in the form of meta-rules. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:722 / 736
页数:15
相关论文
共 38 条
[1]  
AAMODT A, 1994, AI COMMUN, V7, P39
[2]  
Aha D., 1997, LAZY LEARNING
[3]  
Androutsopoulos I., 2000, P WORKSH MACH LEARN
[4]  
Androutsopoulos I, 20042 NCSR
[5]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[6]  
[Anonymous], 1997, APPL CASE BASED REAS
[7]  
[Anonymous], P ICCBR 03 WORKSH LO
[8]   Context-sensitive learning methods for text categorization [J].
Cohen, WW ;
Singer, Y .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1999, 17 (02) :141-173
[9]  
CORCHADO ES, 2004, J EXPT THEORETICAL A, V6, P145
[10]  
Cristiani N., 2000, INTRO SUPPORT VECTOR