Catching the drift: Using feature-free case-based reasoning for spam filtering

被引:0
|
作者
Delany, Sarah Jane [1 ]
Bridge, Derek [2 ]
机构
[1] Dublin Inst Technol, Dublin, Ireland
[2] Univ Coll Cork, Cork, Ireland
来源
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS | 2007年 / 4626卷
关键词
TRACKING CONCEPT DRIFT; SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we compare case-based spam filters, focusing on their resilience to concept drift. In particular, we evaluate how to track concept drift using a case-based spam filter that uses a feature-free distance measure based on text compression. In our experiments, we compare two ways to normalise such a distance measure, finding that the one proposed in [1] performs better. We show that a policy as simple as retaining misclassified examples has a hugely beneficial effect on handling concept drift in spam but, on its own, it results in the case base growing by over 30%. We then compare two different retention policies and two different forgetting policies (one a form of instance selection, the other a form of instance weighting) and find that they perform roughly as well as each other while keeping the case base size constant. Finally, we compare a feature-based textual case-based spam filter with our feature-free approach. In the face of concept drift, the feature-based approach requires the case base to be rebuilt periodically so that we can select a new feature set that better predicts the target concept. We find feature-free approaches to have lower error rates than their feature-based equivalents.
引用
收藏
页码:314 / +
页数:4
相关论文
共 48 条
  • [21] Case-based reasoning approach for supporting building green retrofit decisions
    Zhao, Xue
    Tan, Yongtao
    Shen, Liyin
    Zhang, Guomin
    Wang, Jinhuan
    BUILDING AND ENVIRONMENT, 2019, 160
  • [22] Intelligent green retrofitting of existing buildings based on case-based reasoning and random forest
    Liu, Tianyi
    Ma, Guofeng
    Wang, Ding
    Pan, Xinming
    AUTOMATION IN CONSTRUCTION, 2024, 162
  • [23] A hybrid approach of rough set and case-based reasoning to remanufacturing process planning
    Jiang, Zhigang
    Jiang, Ya
    Wang, Yan
    Zhang, Hua
    Cao, Huajun
    Tian, Guangdong
    JOURNAL OF INTELLIGENT MANUFACTURING, 2019, 30 (01) : 19 - 32
  • [24] Bankruptcy prediction modeling with hybrid case-based reasoning and genetic algorithms approach
    Ahn, Hyunchul
    Kim, Kyoung-Jae
    APPLIED SOFT COMPUTING, 2009, 9 (02) : 599 - 607
  • [25] Analysis of moral reasoning in dentistry students through case-based learning (CBL)
    Macpherson, Ignacio
    Arregui, Maria
    Marchini, Leonardo
    Giner-Tarrida, Luis
    JOURNAL OF DENTAL EDUCATION, 2022, 86 (04) : 416 - 424
  • [26] Learning Project Management Decisions: A Case Study with Case-Based Reasoning versus Data Farming
    Menzies, Tim
    Brady, Adam
    Keung, Jacky
    Hihn, Jairus
    Williams, Steven
    El-Rawas, Oussama
    Green, Phillip
    Boehm, Barry
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (12) : 1698 - 1713
  • [27] Achieving quality assurance functionality in the food industry using a hybrid case-based reasoning and fuzzy logic approach
    Lao, S. I.
    Choy, K. L.
    Ho, G. T. S.
    Yam, Richard C. M.
    Tsim, Y. C.
    Poon, T. C.
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5251 - 5261
  • [28] An online optimization strategy for a fluid catalytic cracking process using a case-based reasoning method based on big data technology
    Ni, Peng
    Liu, Bin
    He, Ge
    RSC ADVANCES, 2021, 11 (46) : 28557 - 28564
  • [29] Extending Choosing-by-Advantages Decision-Making Model with Case-Based Reasoning
    Olde Scholtenhuis, Leon L.
    Naderpajouh, Nader
    Chen Goh, Kai
    Hayes, Jan
    Doree, Andre
    JOURNAL OF MANAGEMENT IN ENGINEERING, 2025, 41 (03)
  • [30] A case-based reasoning approach for solving schedule delay problems in prefabricated construction projects
    Xie, Linlin
    Wu, Sisi
    Chen, Yajiao
    Chang, Ruidong
    Chen, Xiaoyan
    AUTOMATION IN CONSTRUCTION, 2023, 154