Text Classification Based on Word2vec and Convolutional Neural Network

被引:5
作者
Li, Lin [1 ]
Xiao, Linlong [1 ]
Jin, Wenzhen [1 ]
Zhu, Hong [1 ]
Yang, Guocai [1 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China
来源
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V | 2018年 / 11305卷
关键词
Text classification; Text representation; Word2vec; Convolutional neural network;
D O I
10.1007/978-3-030-04221-9_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text representations in text classification usually have high dimensionality and are lack of semantics, resulting in poor classification effect. In this paper, TF-IDF is optimized by using optimization factors, then word2vec with semantic information is weighted, and the single-text representation model CD_STR is obtained. Based on the CD_STR model, the latent semantic index (LSI) and the TF-IDF weighted vector space model (T_VSM) are merged to obtain a fusion model, CD_MTR, which is more efficient. The text classification method MTR_MCNN of the fusion model CD_MTR combined with convolutional neural network is further proposed. This method first designs convolution kernels of different sizes and numbers, allowing them to extract text features from different aspects. Then the text vectors trained by the CD_MTR model are used as the input to the improved convolutional neural network. Tests on two datasets have verified that the performance of the two models, CD_STR and CD_MTR, is superior to other comparable textual representation models. The classification effect of MTR_MCNN method is better than that of other comparison methods, and the classification accuracy is higher than that of CD_MTR model.
引用
收藏
页码:450 / 460
页数:11
相关论文
共 50 条
  • [41] Convolutional Recurrent Neural Networks for Text Classification
    Lyu, Shengfei
    Liu, Jiaqi
    JOURNAL OF DATABASE MANAGEMENT, 2021, 32 (04) : 65 - 82
  • [42] A Neural Network Based Text Classification with Attention Mechanism
    Lu SiChen
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2019), 2019, : 333 - 338
  • [43] Sentiment classification with word localization based on weakly supervised learning with a convolutional neural network
    Lee, Gichang
    Jeong, Jaeyun
    Seo, Seungwan
    Kim, CzangYeob
    Kang, Pilsung
    KNOWLEDGE-BASED SYSTEMS, 2018, 152 : 70 - 82
  • [44] Comparative Analysis of Convolutional Neural Network and LSTM in Text-Based Sentiment Classification
    Kalaivani, M. S.
    Jayalakshmi, S.
    PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 1205 - 1211
  • [45] Categorizing Customer Notifications with an Artificial Intelligence Method Word2vec
    Tunc, Ali
    Altun, Adem Alpaslan
    Tasdemir, Sakir
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 68 - 73
  • [46] A New Method for Sentence Vector Normalization Using Word2vec
    Abdolahi, Mohamad
    Zahedi, Morteza
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2019, 10 (02): : 87 - 96
  • [47] Text Classification Based on Convolutional Neural Networks and Word Embedding for Low-Resource Languages: Tigrinya
    Fesseha, Awet
    Xiong, Shengwu
    Emiru, Eshete Derb
    Diallo, Moussa
    Dahou, Abdelghani
    INFORMATION, 2021, 12 (02) : 1 - 17
  • [48] Combining a Bi-LSTM-Based Siamese Network with Word2Vec Algorithm for Classifying High-Dimensional Dataset
    Lin, Xian-Zhong
    Yeh, Chia-Hsuan
    Jea, Kuen-Fang
    2021 INTERNATIONAL CONFERENCE ON SECURITY AND INFORMATION TECHNOLOGIES WITH AI, INTERNET COMPUTING AND BIG-DATA APPLICATIONS, 2023, 314 : 201 - 211
  • [49] Gender Classification Based on the Convolutional Neural Network
    Lu, Qingqing
    Lu, Jianfeng
    Yu, Dongjun
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 1962 - 1965
  • [50] Convolutional Neural Network based for Automatic Text Summarization
    Alquliti, Wajdi Homaid
    Ghani, Norjihan Binti Abdul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (04) : 200 - 211