How to use negative class information for Naive Bayes classification

被引:24
作者
Ko, Youngjoong [1 ]
机构
[1] Dong A Univ, Comp Engn, Busan 604714, South Korea
关键词
Naive Bayes classifier; Negative class information; Odds of class probabilities; Text classification; PROBABILITY RANKING PRINCIPLE; TEXT CATEGORIZATION;
D O I
10.1016/j.ipm.2017.07.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Naive Bayes (NB) classifier is a popular classifier for text classification problems due to its simple, flexible framework and its reasonable performance. In this paper, we present how to effectively utilize negative class information to improve NB classification. As opposed to information retrieval, supervised learning based text classification already obtains class information, a negative class as well as a positive class, from a labeled training dataset. Since the negative class can also provide significant information to improve the NB classifier, the negative class information is applied to the NB classifier through two phases of indexing and class prediction tasks. As a result, the new classifier using the negative class information consistently performs better than the traditional multinomial NB classifier. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1255 / 1268
页数:14
相关论文
共 38 条
[1]  
Aggarwal C. C., 2012, MINING TEXT DATA
[2]  
[Anonymous], 1998, P AAAI 98 WORKSH LEA, DOI DOI 10.1109/TSMC.1985.6313426
[3]  
Bai J., 2004, P AS INF RETR S C
[4]  
Banerjee A, 2007, PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, P431
[5]   Learning to construct knowledge bases from the World Wide Web [J].
Craven, M ;
DiPasquo, D ;
Freitag, D ;
McCallum, A ;
Mitchell, T ;
Nigam, K ;
Slattery, S .
ARTIFICIAL INTELLIGENCE, 2000, 118 (1-2) :69-113
[6]  
Debole F, 2004, STUD FUZZ SOFT COMP, V138, P81
[7]  
Dumais S., 1998, Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, P148, DOI 10.1145/288627.288651
[8]   An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes [J].
Galar, Mikel ;
Fernandez, Alberto ;
Barrenechea, Edurne ;
Bustince, Humberto ;
Herrera, Francisco .
PATTERN RECOGNITION, 2011, 44 (08) :1761-1776
[9]  
Gliozzo Alfio., 2005, HLT 05, P129
[10]  
GORDON MD, 1991, J AM SOC INFORM SCI, V42, P703, DOI 10.1002/(SICI)1097-4571(199112)42:10<703::AID-ASI3>3.0.CO