A rough set-based case-based reasoner for text categorization

被引:34
|
作者
Li, Y
Shiu, SCK [1 ]
Pal, SK
Liu, JNK
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700035, W Bengal, India
关键词
text categorization (TC); case-based reasoning (CBR); rough set; case coverage; case reachability;
D O I
10.1016/j.ijar.2005.06.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel rough set-based case-based reasoner For Use in text categorization (TC). The reasoner has four main components: feature term extractor, document representor, case selector, and case retriever. It operates by first reducing the number of feature terms in the documents Using the rough set technique. Then, the number of documents is reduced using a new document selection approach based on the case-based reasoning (CBR) concepts of coverage and reachability. As a result, both the number of feature terms and documents are reduced with only minimal loss of information. Finally, this smaller set of documents with fewer feature terms is Used in TC. The proposed rough set-based case-based reasoner wits tested on the Reuters21578 text datasets. The experimental results demonstrate its effectiveness and efficiency as it significantly reduced feature terms and documents, important for improving the efficiency of TC, while preserving and even improving classification accuracy. (C) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:229 / 255
页数:27
相关论文
共 50 条
  • [21] Dominance-based rough set approach to case-based reasoning
    Greco, S
    Matarazzo, B
    Slowinski, R
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, 2006, 3885 : 7 - 18
  • [22] A MODIFIED CASE-BASED REASONING METHOD BASED ON THE ROUGH SET THEORY
    Kovalenko, I. I.
    Shved, A., V
    Koval, N., V
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2018, (04) : 106 - 112
  • [23] On the Definability of a Set and Rough Set-Based Rule Generation
    Sakai, Hiroshi
    Wu, Mao
    Yamaguchi, Naoto
    2014 IIAI 3RD INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2014), 2014, : 122 - 125
  • [24] An Algorithm of Text Categorization Based on Similar Rough Set and Fuzzy Cognitive Map
    Zhou, Xin
    Zhang, Huaxiang
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2008, : 127 - 131
  • [25] Dimensions of Case-Based Reasoner Quality Management
    Bierer, Annett
    Hofmann, Marcus
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2009, 5650 : 90 - 104
  • [26] A hybrid case-based reasoner for footwear design
    Main, J
    Dillon, TS
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, 1999, 1650 : 497 - 509
  • [27] Smart case-based indexing in worsted roving process: Combination of rough set and case-based reasoning
    Liu, Gui
    Yu, Weidong
    APPLIED MATHEMATICS AND COMPUTATION, 2009, 214 (01) : 280 - 286
  • [28] The Rough Set-Based Algorithm for Two Steps
    Liao, Shu-Hsien
    Chen, Yin-Ju
    Ho, Shiu-Hwei
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 63 - +
  • [29] Rough set-based ANFIS control strategies
    Li, TY
    Zhang, CM
    ISTM/2005: 6TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-9, CONFERENCE PROCEEDINGS, 2005, : 7519 - 7521
  • [30] Rough set-based feature selection method
    Zhan, YM
    Zeng, XY
    Sun, JC
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2005, 15 (03) : 280 - 284