Chinese News Text Multi Classification Based on Naive Bayes Algorithm

被引:1
作者
Wang, Fei [1 ]
Deng, Xin [1 ]
Hou, Lunqing [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software, Shenzhen, Peoples R China
来源
ISCSIC'18: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL | 2018年
关键词
Naive Bayes; text classification; multi classification; MODEL;
D O I
10.1145/3284557.3284704
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the development of Internet, there are more and more text data appear, the companies face the challenge to organize the content and the users feel confused about what is useful content for them. If the text data can be classified will make a contribution to solve the problem. It has been a long time, text classification work is done by human beings, like editors. So text classification become a hot topic in nature language processing field, especially for Chinese text classification. Sentiment classification just need to classify two classes, but there are more situations where we need to do multi classification. Such as the news editors have to give an article tags manually. There are several ways to solve the text classification problem: (1) Naive Bayes algorithm (2) support vector machine algorithm (3) neural network (4) k nearest neighbors (5) decision tree [1][2] [3] [4] [5]. Naive Bayes applies Bayes' theorem with strong(naive) independence assumptions between the features. This paper proposes to use Naive Bayes to finish a Chinese news text multi classification with nine classes.
引用
收藏
页数:5
相关论文
共 20 条
[1]  
[Anonymous], 2010, ARXIV10031795
[2]  
[Anonymous], 2011, J MACH LEARN TECHNOL
[3]  
[Anonymous], 1998, LEARNING TEXT CATEGO
[4]  
[Anonymous], 2008, P 3 INT JOINT C NAT
[5]  
Feng Z., 2006, P 5 WSEAS INT C APPL, P1010
[6]  
Guo GD, 2003, LECT NOTES COMPUT SC, V2888, P986
[7]   A decision-tree-based symbolic rule induction system for text categorization [J].
Johnson, DE ;
Oles, FJ ;
Zhang, T ;
Goetz, T .
IBM SYSTEMS JOURNAL, 2002, 41 (03) :428-437
[8]  
Korde V., 2012, INT J ARTIFICAL INTE, V3, P85, DOI [10.5121/ijaia.2012.3208, DOI 10.5121/IJAIA.2012.3208]
[9]  
Lai SW, 2015, AAAI CONF ARTIF INTE, P2267
[10]  
Miah Muhammed, 2009, Proceedings of the 2009 International Conference on Data Mining. DMIN 2009, P434