Selective Bayesian Classifier Based on Semi-supervised Clustering

被引:0
作者
Cheng Yuhu [1 ]
Tong Yaoyao [1 ]
Wang Xuesong [1 ]
机构
[1] China Univ Min & Technol, Sch Informat & Elect Engn, Xuzhou 221116, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2012年 / 21卷 / 01期
基金
中国国家自然科学基金;
关键词
Minimal-redundancy-maximal-relevance; Clustering; Semi-supervised; Selective Bayesian classifier; Markov blanket;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Small scale of labeled samples results in incorrect of computation of mutual information, which may lower the classification accuracy of minimal-redundancy-maximal-relevance (mRMR) selective Bayesian classifiers. In order to solve the above problem, a kind of selective Bayesian classifier based on semi-supervised clustering algorithm is proposed. At first, a new semi-supervised K-representative clustering algorithm is designed by using the Bayesian posterior probability, which is applied to labeling the unlabeled samples so as to enlarge the scale of labeled samples. Then a novel feature selection criterion is proposed by combining the mRMR and the concept of Markov blanket to automatically determine a reasonably compact subset of features. In addition, a risk-regulation factor is introduced into the feature selection criterion to reduce the risk of mislabeling. At last, a Bayesian classifier is constructed based on the preprocessed samples. Experimental results indicate that the proposed Bayesian classifier can select optimal features to obtain high classification accuracy.
引用
收藏
页码:73 / 77
页数:5
相关论文
共 12 条
  • [1] Abraham R., 2007, Proceedings of the 10th International Conference on Information Technology (ICIT'07), P44, DOI [10.1109/ICIT.2007.41, DOI 10.1109/ICIT.2007.41, 10.1109/ICIT.2007, DOI 10.1109/ICIT.2007]
  • [2] [Anonymous], 2006, ESANN
  • [3] Bayesian network classifiers[J]. Friedman, N;Geiger, D;Goldszmidt, M. MACHINE LEARNING, 1997(2-3)
  • [4] Hong C.S., 2009, STAT PAP, V52, P17
  • [5] Lin Zhi-qing, 2002, Acta Electronica Sinica, V30, P1804
  • [6] Ohn Mar San, 2004, International Journal of Applied Mathematics and Computer Science, V14, P241
  • [7] Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy[J]. Peng, HC;Long, FH;Ding, C. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005(08)
  • [8] Clustering-based Feature Selection in Semi-supervised Problems[J]. Quinzan, Ianisse;Sotoca, Jose M.;Pla, Filiberto. 2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009
  • [9] Ren JT, 2008, LECT NOTES ARTIF INT, V5012, P970
  • [10] Wang Na, 2010, Acta Electronica Sinica, V38, P172