Improving text classification with weighted word embeddings via a multi-channel TextCNN model

被引:145
作者
Guo, Bao [1 ]
Zhang, Chunxia [1 ]
Liu, Junmin [1 ]
Ma, Xiaoyi [2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Shaanxi, Peoples R China
[2] Univ Colorado, Sch Art & Sci, Boulder, CO 80310 USA
基金
中国国家自然科学基金;
关键词
Text classification; Term weighting; Word embedding; Convolutional neural network; Term frequency-inverse document frequency (TF-IDF); NEURAL-NETWORKS;
D O I
10.1016/j.neucom.2019.07.052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, convolutional neural networks (CNNs) have gained considerable attention in text classification because of the remarkable good performance they achieved in various situations. The usual practice is to first perform word embedding (i.e., mapping each word into a word vector), and then employ a CNN to perform classification. To improve classification accuracy, term weighting approaches have been proven to be quite effective. But to the best of our knowledge, almost all these methods assign only one weight to each term (word). Considering the fact that one term generally has different importance in documents with different class labels, we propose in this paper a novel term weighting scheme to be combined with word embeddings to enhance the classification performance of CNNs. In the novel method, multiple weights are assigned to each term and these weights are applied to the word embeddings of the words separately. Subsequently, the transformed features are fed into a multi-channel CNN model to predict the label of the sentence. By comparing the novel method with several other baseline methods with five benchmark data sets, the results manifest that the classification accuracy of the proposed method exceeds that of other methods by an amazing margin. Moreover, the weights assigned by different weighting schemes are also analyzed to get more insights of their working mechanism. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:366 / 374
页数:9
相关论文
共 33 条
[1]  
Bengio Y, 2001, ADV NEUR IN, V13, P932
[2]   Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN [J].
Chen, Tao ;
Xu, Ruifeng ;
He, Yulan ;
Wang, Xuan .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 72 :221-230
[3]  
Cho K., 2014, ARXIV140610783V3
[4]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[5]  
Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107
[6]  
Debole F, 2004, STUD FUZZ SOFT COMP, V138, P81
[7]  
Devlin J., 2018, C N AM CHAPT ASS COM
[8]  
Harris D., 2010, Digital design and computer architecture
[9]   Detection of review spam: A survey [J].
Heydari, Atefeh ;
Tavakoli, Mohammad Ali ;
Salim, Naomie ;
Heydari, Zahra .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (07) :3634-3642
[10]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]