Research on Application of a Naive Bayes Algorithm Based on the Rough Set Approach

被引:0
作者
Xu, Xu-Dong [1 ]
Kong, Ling-Tao [1 ]
Wang, Qun [1 ]
机构
[1] Beijing Univ Technol, Coll Comp Sci, Beijing, Peoples R China
来源
INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015) | 2015年
关键词
Rough set; Naive bayes; Medical case classification; System attribute significance;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A Naive Bayes algorithm based on the rough set approach is proposed in this manuscriptand successfully applied in the hospital medical caseclassification. The core of the algorithm is to filter the redundant data and parameterize the key data in the medical cases through the approaches ofattribute reduction and system attribute significance in the rough set theory, and calculate the weights in order to achieve more satisfactory accuracyof medical case classification through the Naive Bayes algorithm. The specific steps are: firstly removing the redundant phrases from the medical cases by attribute reduction of the rough set, then calculating the weight of each attribute based onsystem attribute significance, and finally constructing the Naive Bayes algorithm model based on the weighted system attributes. Experiments have shown that the above rough set approach-based Naive Bayes algorithm model is a better classification model that achieves higher classification accuracy than other Naive Bayes algorithm models.
引用
收藏
页码:57 / 64
页数:8
相关论文
共 8 条
  • [1] [程克非 CHENG Kefei], 2006, [计算机仿真, Computer Simulation], V23, P92
  • [2] Han Duoqian, 2008, ROUGH SETS THEORY AL
  • [3] Han J.W., 2007, data mining concepts and technology
  • [4] KONONENKO I, 1991, LECT NOTES ARTIF INT, V482, P206, DOI 10.1007/BFb0017015
  • [5] Li Huaxiong, 6 INT C ROUGH SETS K, P466
  • [6] Pawlak Z, 2003, INT J INTELLIGENT SY, V18
  • [7] Wang Guocai, 2010, RES APPL NAIVE BAYES
  • [8] Learning weighted naive Bayes with accurate ranking
    Zhang, H
    Sheng, SL
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 567 - 570