A comparative study on feature selection and adaptive strategies for email foldering using the ABC-DynF framework

被引:4
作者
Carmona-Cejudo, Jose M. [1 ]
Castillo, Gladys [3 ]
Baena-Garcia, Manuel [2 ]
Morales-Bueno, Rafael [1 ]
机构
[1] Univ Malaga, Dept Comp Sci, E-29071 Malaga, Spain
[2] Clin Rincon, Malaga, Spain
[3] Univ Aveiro, Dept Math, P-3810193 Aveiro, Portugal
关键词
Email foldering; Adaptive systems; Text mining; Feature selection; FEATURE-EXTRACTION;
D O I
10.1016/j.knosys.2013.03.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Email foldering is a challenging problem mainly due to its high dimensionality and dynamic nature. This work presents ABC-DynF, an adaptive learning framework with dynamic feature space that we use to compare several incremental and adaptive strategies to cope with these two difficulties. Several studies have been carried out using datasets from the ENRON email corpus and different configuration settings of the framework. The main aim is to study how feature ranking methods, concept drift monitoring, adaptive strategies and the implementation of a dynamic feature space can affect the performance of Bayesian email classification systems. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:81 / 94
页数:14
相关论文
共 38 条
[21]  
Jou C., 2012 INT C BUS INF B, P19
[22]   An adaptive personalized news dissemination system [J].
Katakis, Ioannis ;
Tsoumakas, Grigorios ;
Banos, Evangelos ;
Bassiliades, Nick ;
Vlahavas, Ioannis .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2009, 32 (02) :191-212
[23]  
Katakis L, 2005, LECT NOTES COMPUT SC, V3746, P338
[24]  
Klimt B, 2004, LECT NOTES COMPUT SC, V3201, P217
[25]   A framework for adaptive mail classification [J].
Manco, G ;
Masciari, E ;
Tagarelli, A .
14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, :387-392
[26]  
Porter M.F., 1993, PROGRAM ELECT LIB IN, V14
[27]  
Rennie J.D.M., 2000, P KDD WORKSH TEXT MI
[28]   A review of feature selection techniques in bioinformatics [J].
Saeys, Yvan ;
Inza, Inaki ;
Larranaga, Pedro .
BIOINFORMATICS, 2007, 23 (19) :2507-2517
[29]   Machine learning in automated text categorization [J].
Sebastiani, F .
ACM COMPUTING SURVEYS, 2002, 34 (01) :1-47
[30]  
Tang L, 2005, FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P781