Catching the drift: Using feature-free case-based reasoning for spam filtering

被引:0
|
作者
Delany, Sarah Jane [1 ]
Bridge, Derek [2 ]
机构
[1] Dublin Inst Technol, Dublin, Ireland
[2] Univ Coll Cork, Cork, Ireland
来源
CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS | 2007年 / 4626卷
关键词
TRACKING CONCEPT DRIFT; SELECTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we compare case-based spam filters, focusing on their resilience to concept drift. In particular, we evaluate how to track concept drift using a case-based spam filter that uses a feature-free distance measure based on text compression. In our experiments, we compare two ways to normalise such a distance measure, finding that the one proposed in [1] performs better. We show that a policy as simple as retaining misclassified examples has a hugely beneficial effect on handling concept drift in spam but, on its own, it results in the case base growing by over 30%. We then compare two different retention policies and two different forgetting policies (one a form of instance selection, the other a form of instance weighting) and find that they perform roughly as well as each other while keeping the case base size constant. Finally, we compare a feature-based textual case-based spam filter with our feature-free approach. In the face of concept drift, the feature-based approach requires the case base to be rebuilt periodically so that we can select a new feature set that better predicts the target concept. We find feature-free approaches to have lower error rates than their feature-based equivalents.
引用
收藏
页码:314 / +
页数:4
相关论文
共 48 条
  • [11] Case-Based Reasoning in Achieving Sustainability Targets of New Products
    Relich, Marcin
    Adamczyk, Janusz
    Dylewski, Robert
    Kister, Agnieszka
    SUSTAINABILITY, 2024, 16 (04)
  • [12] Global optimization of case-based reasoning for breast cytology diagnosis
    Ahn, Hyunchul
    Kim, Kyoung-jae
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (01) : 724 - 734
  • [13] SCOUT: A Case-Based Reasoning Agent for Playing Race for the Galaxy
    Woolford, Michael
    Watson, Ian
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2017, 2017, 10339 : 390 - 402
  • [14] The application of case-based reasoning in construction management research: An overview
    Hu, Xin
    Xia, Bo
    Skitmore, Martin
    Chen, Qing
    AUTOMATION IN CONSTRUCTION, 2016, 72 : 65 - 74
  • [15] Development of a case-based reasoning prototype for cogeneration plant design
    Matelli, Jose Alexandre
    Bazzo, Edson
    da Silva, Jonny Carlos
    APPLIED ENERGY, 2011, 88 (09) : 3030 - 3041
  • [16] Performance Evaluation of Distance Measurement Methods for Construction Noise Prediction Using Case-Based Reasoning
    Kwon, Nahyun
    Lee, Joosung
    Park, Moonsun
    Yoon, Inseok
    Ahn, Yonghan
    SUSTAINABILITY, 2019, 11 (03):
  • [17] Spatial case revision in case-based reasoning for risk assessment of geological disasters
    Deng, Shuguang
    Li, WenShu
    GEOMATICS NATURAL HAZARDS & RISK, 2020, 11 (01) : 1052 - 1074
  • [18] Forecasting of Industrial Water Demand Using Case-Based Reasoning-A Case Study in Zhangye City, China
    Yang, Bohan
    Zheng, Weiwei
    Ke, Xinli
    WATER, 2017, 9 (08):
  • [19] Formulating procurement selection criteria through case-based reasoning approach
    Luu, DT
    Ng, ST
    Chen, SE
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2005, 19 (03) : 269 - 276
  • [20] A case-based reasoning system for supervised classification problems in the medical field
    Bentaiba-Lagrid, Miled Basma
    Bouzar-Benlabiod, Lydia
    Rubin, Stuart H.
    Bouabana-Tebibel, Thouraya
    Hanini, Maria R.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150