Content-based concept drift detection for Email spam filtering

被引:9
|
作者
Zi Hayat M. [1 ]
Basiri J. [1 ]
Seyedhossein L. [1 ]
Shakery A. [1 ]
机构
[1] School of Electrical and Computer Engineering, University of Tehran, Tehran
关键词
Concept drift; KL divergence; Language model; Spam filtering;
D O I
10.1109/ISTEL.2010.5734082
中图分类号
学科分类号
摘要
The continued growth of Email usage, which is naturally followed by an increase in unsolicited emails so called spams, motivates research in spam filtering area. In the context of spam filtering systems, addressing the evolving nature of spams, which leads to obsolete the related models, has been always a challenge. In this paper an adaptive spam filtering system based on language model is proposed which can detect concept drift based on computing the deviation in email contents distribution. The proposed method can be used along with any existing classifier; particularly in this paper we use Naïve Bayes method as classifier. The proposed method has been evaluated with Enron data set. The results indicate the efficiency of the method in detecting concept drift and its superiority over Naïve Bayes classifier in terms of accuracy. © 2010 IEEE.
引用
收藏
页码:531 / 536
页数:5
相关论文
共 50 条
  • [21] Adaptive email spam filtering based on information theory
    Zhang, Xin
    Dai, Wenyuan
    Xue, Gui-Rong
    Yu, Yong
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007, PROCEEDINGS, 2007, 4831 : 159 - 170
  • [22] A Collaborative Abstraction Based Email Spam Filtering with Fingerprints
    P. Rajendran
    A. Tamilarasi
    R. Mynavathi
    Wireless Personal Communications, 2022, 123 : 1913 - 1923
  • [23] Using Metadata in Detection Spam Email with Pornography Content
    Wardani, Dewi
    Siwi, Retisa
    Harjito, Bambang
    Marshallia, Maysa
    2018 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND COMPUTER SCIENCE (ICECOS), 2018, : 213 - 217
  • [24] Content-Based Email Classification at Scale
    Early, Kirstin
    O'Hare, Neil
    LuVogt, Christopher
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4559 - 4566
  • [25] Selected methods of spam filtering in email
    Miszalska, Izabella
    Zabierowski, Wojciech
    Napieralski, Andrzej
    2007 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS, 2007, : 507 - 513
  • [26] Improving Email Spam Detection Using Content Based Feature Engineering Approach
    Hijawi, Wadi'
    Faris, Hossam
    Alqatawna, Ja'far
    Al-Zoubi, Ala' M.
    Aljarah, Ibrahim
    2017 IEEE JORDAN CONFERENCE ON APPLIED ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (AEECT), 2017,
  • [27] Filtering and email pricing as solutions to spam
    Eaton, B. Curtis
    MacDonald, Ian A.
    Meriluoto, Laura
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2013, 46 (03): : 881 - 899
  • [28] An email geographic path-based technique for spam filtering
    Jiang, Yu
    Zhang, Ni
    Fang, Binxing
    CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 750 - +
  • [29] A survey of learning-based techniques of email spam filtering
    Blanzieri, Enrico
    Bryl, Anton
    ARTIFICIAL INTELLIGENCE REVIEW, 2008, 29 (01) : 63 - 92
  • [30] A survey of learning-based techniques of email spam filtering
    Enrico Blanzieri
    Anton Bryl
    Artificial Intelligence Review, 2008, 29 : 63 - 92