Naive Bayes Classification Algorithm Based on Optimized Training Data

被引:5
作者
Zhu, Xiaodan [1 ]
Su, Jinsong [1 ]
Wu, Qingfeng [1 ]
Dong, Huailin [1 ]
机构
[1] Xiamen Univ, Software Sch, Xiamen, Peoples R China
来源
MECHATRONICS AND INTELLIGENT MATERIALS II, PTS 1-6 | 2012年 / 490-495卷
关键词
optimized training data; effectiveness; Naive Bayes;
D O I
10.4028/www.scientific.net/AMR.490-495.460
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Naive Bayes classification algorithm is an effective simple classification algorithm. Most researches in traditional Naive Bayes classification focus on the improvement of the classification algorithm, ignoring the selection of training data which has a great effect on the performance of classifier. And so a method is proposed to optimize the selection of training data in this paper. Adopting this method, the noisy instances in training data are eliminated by user-defined effectiveness threshold, improving the performance of classifier. Experimental results on large-scale data show that our approach significantly outperforms the baseline classifier.
引用
收藏
页码:460 / 464
页数:5
相关论文
共 8 条
  • [1] [程克非 CHENG Kefei], 2006, [计算机仿真, Computer Simulation], V23, P92
  • [2] DENG Wei-Bin, 2007, COMPUTER SCI, V34
  • [3] Harry Z, 2004, P ICDM 2004
  • [4] Langley P, P AAAI 1990
  • [5] Pankaj M, 1992, ARTIFICIAL NEURAL NE
  • [6] Qin ZC, 2006, ICMLA 2006: 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, P34
  • [7] Quinlan J. R., 1993, C4 5 PROGRAM MACHINE
  • [8] WANG Yuzhen, 2007, MICROCOMPUTER APPL, V28, p664~668