Short Text Classification Based on Rule and Information Entropy

被引:0
作者
Jin, Hua [1 ]
Zhu, Yat-Tao [1 ]
Jin, Zhi-Qiang [1 ]
机构
[1] Agr Univ Hebei, Coll Informat Sci & Technol, Baoding, Hebei, Peoples R China
来源
PROCEEDINGS OF THE 2013 ASIA-PACIFIC COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY CONFERENCE | 2013年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of computer technology and the progress of the means of information dissemination, microblog, social networking and other information in the form of short text develop rapidly. Mining useful information and filter harmful information from short text become particularly important and urgent. In recent years, technology of classification and filter from lengthy document has made a considerable development, but for short text classification, traditional text classification techniques are no longer applicable due to the small number of words, short length of text and less information contained. In this paper, taking the topic classification of microblogs of Sina as application background, short text classification algorithm for rule-based information entropy is presented, as an effective algorithm to improve the effect of short text classification and meet the needs of it through experimental analysis.
引用
收藏
页码:193 / 199
页数:7
相关论文
共 8 条
[1]  
[Anonymous], 2010, PREDICTING FUTURE SO
[2]  
[Anonymous], P 5 ANN WORKSH COMP
[3]   Large-scale Bayesian logistic regression for text categorization [J].
Genkin, Alexander ;
Lewis, David D. ;
Madigan, David .
TECHNOMETRICS, 2007, 49 (03) :291-304
[4]  
QUINLAN JR, 1993, P 1993 EUR C MACH LE, P3
[5]  
Sakaki T., 2010, P 19 INT C WORLD WID, P851
[6]  
Shannon, 1948, BELL SYST TECH J, V27, p[379, 623]
[7]  
Sriram B., 2010, P ACM SIGIR 2010
[8]  
Yin XX, 2003, SIAM PROC S, P331