Data mining the PIMA dataset using rough set theory with a special emphasis on rule reduction

被引:0
|
作者
Khan, A [1 ]
Revett, K [1 ]
机构
[1] Univ Luton, CIS Dept, Luton, Beds, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes how rough set theory can be utilized as a tool for analyzing relatively complex decision tables like the Pima Indian Diabetes Database (PIDD). We utilized Rosetta, a public domain implementation of rough sets on the PIDD in order to determine how we could generate a predictive rule set with the highest accuracy and the fewest number of rules. Having a reduced rule set is advantageous as it provides focus on the salient attributes and makes application in clinical practice more efficient (and likely). In this paper, we report the use of a genetic algorithm based rough set approach to classification of diabetes and achieved a success rate on the test set of 83%. This classification accuracy favors highly compared to other, reported results, which ranged from 65% to 75%. In addition, we were able to achieve this accuracy with less than 100 rules. The high accuracy and low rule number provides support to the use of rough sets as a datamining tool in biological databases.
引用
收藏
页码:334 / 339
页数:6
相关论文
共 50 条
  • [1] The Dynamic Data Reduction and Association Rule Parallel Mining Based on Rough Set
    He Youquan
    Wang Lijun
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 2803 - 2806
  • [2] Data mining and rough set theory
    Grzymala-Busse, JW
    Ziarko, W
    COMMUNICATIONS OF THE ACM, 2000, 43 (04) : 108 - 109
  • [3] An Knowledge Reduction Algorithms in Data Mining Based on Rough Set Theory
    Liu Tieying
    Jia Ru
    Ye Jianchun
    ADVANCES IN MANAGEMENT OF TECHNOLOGY, PT 1, 2009, : 562 - +
  • [4] Data mining an EEG dataset with an emphasis on dimensionality reduction
    Jahankhani, Pari
    Revett, Kenneth
    Kodogiannis, Vassilis
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 405 - 412
  • [5] Rough set theory with applications to data mining
    Grzymala-Busse, JW
    REAL WORLD APPLICATIONS OF COMPUTATIONAL INTELLIGENCE, 2005, 179 : 223 - 244
  • [6] Using data mining techniques and rough set theory for language modeling
    Chen, Yong
    Chan, Kwok-Ping
    ACM Transactions on Asian Language Information Processing, 2007, 6 (01):
  • [7] Improving intrusion detection performance using rough set theory and association rule mining
    Wang Xuren
    He Famei
    2006 INTERNATIONAL CONFERENCE ON HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2006, : 114 - +
  • [8] Multi-level rough set reduction for decision rule mining
    Ye, Mingquan
    Wu, Xindong
    Hu, Xuegang
    Hu, Donghui
    APPLIED INTELLIGENCE, 2013, 39 (03) : 642 - 658
  • [9] Multi-level rough set reduction for decision rule mining
    Mingquan Ye
    Xindong Wu
    Xuegang Hu
    Donghui Hu
    Applied Intelligence, 2013, 39 : 642 - 658
  • [10] Variable precision rough set model based dataset partition and association rule mining
    Wang, QD
    Wang, XJ
    Wang, XP
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 2175 - 2179