Text Classification Based on Word2vec and Convolutional Neural Network

被引:5
作者
Li, Lin [1 ]
Xiao, Linlong [1 ]
Jin, Wenzhen [1 ]
Zhu, Hong [1 ]
Yang, Guocai [1 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China
来源
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V | 2018年 / 11305卷
关键词
Text classification; Text representation; Word2vec; Convolutional neural network;
D O I
10.1007/978-3-030-04221-9_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text representations in text classification usually have high dimensionality and are lack of semantics, resulting in poor classification effect. In this paper, TF-IDF is optimized by using optimization factors, then word2vec with semantic information is weighted, and the single-text representation model CD_STR is obtained. Based on the CD_STR model, the latent semantic index (LSI) and the TF-IDF weighted vector space model (T_VSM) are merged to obtain a fusion model, CD_MTR, which is more efficient. The text classification method MTR_MCNN of the fusion model CD_MTR combined with convolutional neural network is further proposed. This method first designs convolution kernels of different sizes and numbers, allowing them to extract text features from different aspects. Then the text vectors trained by the CD_MTR model are used as the input to the improved convolutional neural network. Tests on two datasets have verified that the performance of the two models, CD_STR and CD_MTR, is superior to other comparable textual representation models. The classification effect of MTR_MCNN method is better than that of other comparison methods, and the classification accuracy is higher than that of CD_MTR model.
引用
收藏
页码:450 / 460
页数:11
相关论文
共 50 条
  • [31] Improving convolutional neural network for text classification by recursive data pruning
    Li, Qi
    Li, Pengfei
    Mao, Kezhi
    Lo, Edmond Yat-Man
    NEUROCOMPUTING, 2020, 414 : 143 - 152
  • [32] Word2vec based deep learning network for DNA N4-methylcytosine sites identification
    Fang, Guanyun
    Zeng, Feng
    Li, Xingcun
    Yao, Lan
    2020 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI2020), 2021, 187 : 270 - 277
  • [33] Text Classification of Public Feedbacks using Convolutional Neural Network Based on Differential Evolution Algorithm
    Zhang, S.
    Chen, Y.
    Huang, X. L.
    Cai, Y. S.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2019, 14 (01) : 124 - 134
  • [34] Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification
    Wang, Peng
    Xu, Bo
    Xu, Jiaming
    Tian, Guanhua
    Liu, Cheng-Lin
    Hao, Hongwei
    NEUROCOMPUTING, 2016, 174 : 806 - 814
  • [35] Automatic text classification algorithm based on Gauss improved convolutional neural network
    Du, Jian-hai
    JOURNAL OF COMPUTATIONAL SCIENCE, 2017, 21 : 195 - 200
  • [36] Covariance Matrix Adaptation Evolution Strategy for Convolutional Neural Network in Text Classification
    Toledano-Lopez, Orlando Grabiel
    Madera, Julio
    Gonzalez, Hector
    Simon Cuevas, Alfredo
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2021, 13055 : 69 - 78
  • [37] A Framework for Text Classification Using Evolutionary Contiguous Convolutional Neural Network and Swarm Based Deep Neural Network
    Prabhakar, Sunil Kumar
    Rajaguru, Harikumar
    So, Kwangsub
    Won, Dong-Ok
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2022, 16
  • [38] APPLICATION OF CONVOLUTIONAL NEURAL NETWORK (CNN) IN MICROBLOG TEXT CLASSIFICATION
    Wang, Xiaoming
    Li, Jianping
    Liu, Yifei
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 127 - 130
  • [39] Convolutional Neural Network Based Text Steganalysis
    Wen, Juan
    Zhou, Xuejing
    Zhong, Ping
    Xue, Yiming
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (03) : 460 - 464
  • [40] Convolutional Recurrent Neural Networks for Text Classification
    Wang, Ruishuang
    Li, Zhao
    Cao, Jian
    Chen, Tong
    Wang, Lei
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,