Adaptive spam filtering using dynamic feature space

被引:0
|
作者
Zhou, Y [1 ]
Mulekar, MS [1 ]
Nerellapalli, P [1 ]
机构
[1] Univ S Alabama, Sch CIS, Mobile, AL 36688 USA
来源
ICTAI 2005: 17TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS | 2005年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsolicited bulk e-mail, also known as spam, has been an increasing problem for the e-mail society. This paper presents a new spam filtering strategy that 1) uses a practical entropy coding technique, Huffman coding, to dynamically encode the feature space of e-mail collections over time and, 2) applies an online algorithm to adaptively enhance the learned spam concept as new e-mail data becomes available. The contributions of this work include a highly efficient spam filtering algorithm in which the input space is radically reduced to a single-dimension input vector and an adaptive learning technique that is robust to vocabulary change, concept drifting and skewed data distribution. We compare our technique to several existing off-line learning techniques including Support Vector Machine, Naive Bayes, k-Nearest Neighbor C4.5 decision tree, RBFNetwork, Boosted decision tree and Stacking, and demonstrate the effectiveness of our technique by presenting the experimental results on the e-mail data that is publicly available.
引用
收藏
页码:302 / 309
页数:8
相关论文
共 50 条
  • [1] Adaptive spam filtering using dynamic feature spaces
    Zhou, Yan
    Mulekar, Madhuri S.
    Nerellapalli, Praveen
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (04) : 627 - 646
  • [2] Dynamic feature selection for spam filtering using Support Vector Machine
    Islam, Md. Rafiqul
    Zhou, Wanlei
    Choudhury, Morshed U.
    6TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2007, : 757 - +
  • [3] Adaptive filtering of SPAM
    Pelletier, L
    Almhana, J
    Choulakian, V
    SECOND ANNUAL CONFERENCE ON COMMUNICATION NETWORKS AND SERVICES RESEARCH, PROCEEDINGS, 2004, : 218 - 224
  • [4] Feature selection for spam filtering
    Menghour, Kamilia
    Souici-Meslati, Labiba
    CORIA 2010: Actes de la COnference en Recherche d'Information et Applications - Proceedings of the Conference on Information Retrieval and Applications, 2010, : 349 - 360
  • [5] Adaptive spam mail filtering using genetic algorithm
    Sanpakdee, U
    Walairacht, A
    Walairacht, S
    8th International Conference on Advanced Communication Technology, Vols 1-3: TOWARD THE ERA OF UBIQUITOUS NETWORKS AND SOCIETIES, 2006, : U441 - U445
  • [6] Efficient spam email filtering using adaptive ontology
    Youn, Seongwook
    McLeod, Dennis
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 249 - +
  • [7] Spam filtering using semantic similarity approach and adaptive BPNN
    Li, Cheng Hua
    Huang, Jimmy Xiangji
    NEUROCOMPUTING, 2012, 92 : 88 - 97
  • [8] Adaptive spam mail filtering system
    Lee, K
    Wyee, J
    Kim, W
    Kim, M
    ICOMP '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON INTERNET COMPUTING, 2005, : 402 - 408
  • [9] FEATURE SELECTION USING PARTICLE SWARM OPTIMIZATION WITH APPLICATION IN SPAM FILTERING
    Lai, Chih-Chin
    Wu, Chih-Hung
    Tsai, Ming-Chi
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (02): : 423 - 432
  • [10] Efficient Feature Set for Spam Email Filtering
    Varghese, Reshma
    Dhanya, K. A.
    2017 7TH IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2017, : 732 - 737