Text Classification Based on Word2vec and Convolutional Neural Network

被引:5
作者
Li, Lin [1 ]
Xiao, Linlong [1 ]
Jin, Wenzhen [1 ]
Zhu, Hong [1 ]
Yang, Guocai [1 ]
机构
[1] Southwest Univ, Sch Comp & Informat Sci, Chongqing, Peoples R China
来源
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V | 2018年 / 11305卷
关键词
Text classification; Text representation; Word2vec; Convolutional neural network;
D O I
10.1007/978-3-030-04221-9_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text representations in text classification usually have high dimensionality and are lack of semantics, resulting in poor classification effect. In this paper, TF-IDF is optimized by using optimization factors, then word2vec with semantic information is weighted, and the single-text representation model CD_STR is obtained. Based on the CD_STR model, the latent semantic index (LSI) and the TF-IDF weighted vector space model (T_VSM) are merged to obtain a fusion model, CD_MTR, which is more efficient. The text classification method MTR_MCNN of the fusion model CD_MTR combined with convolutional neural network is further proposed. This method first designs convolution kernels of different sizes and numbers, allowing them to extract text features from different aspects. Then the text vectors trained by the CD_MTR model are used as the input to the improved convolutional neural network. Tests on two datasets have verified that the performance of the two models, CD_STR and CD_MTR, is superior to other comparable textual representation models. The classification effect of MTR_MCNN method is better than that of other comparison methods, and the classification accuracy is higher than that of CD_MTR model.
引用
收藏
页码:450 / 460
页数:11
相关论文
共 50 条
  • [21] TextConvoNet: a convolutional neural network based architecture for text classification
    Soni, Sanskar
    Chouhan, Satyendra Singh
    Rathore, Santosh Singh
    APPLIED INTELLIGENCE, 2023, 53 (11) : 14249 - 14268
  • [22] News Text Classification Based on an Improved Convolutional Neural Network
    Tao, Wenjing
    Chang, Dan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1400 - 1409
  • [23] Text Classification with Topic-based Word Embedding and Convolutional Neural Networks
    Xu, Haotian
    Dong, Ming
    Zhu, Dongxiao
    Kotov, Alexander
    Carcone, April Idalski
    Naar-King, Sylvie
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 88 - 97
  • [24] The new deep learning architecture based on GRU and word2vec
    Atassi, Abdelhamid
    El Azami, Ikram
    Sadiq, Abdelalim
    2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, CONTROL, OPTIMIZATION AND COMPUTER SCIENCE (ICECOCS), 2018,
  • [25] Application of an Improved Convolutional Neural Network Algorithm in Text Classification
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2024, 23 (03): : 315 - 340
  • [26] Document Classification Using Word2Vec and Chi-square on Apache Spark
    Choi, Mijin
    Jin, Rize
    Chung, Tae-Sun
    ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2017, 421 : 867 - 872
  • [27] A morpheme sequence and convolutional neural network based Kazakh text classification
    Parhat, Sardar
    Ting, Gao
    Ablimit, Mijit
    Hamdulla, Askar
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1903 - 1906
  • [28] Semantic Template-based Convolutional Neural Network for Text Classification
    Chang, Yung-Chun
    Ng, Siu Hin
    Chen, Jung-Peng
    Liang, Yu-Chi
    Hsu, Wen-Lian
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [29] Research on application of article recommendation algorithm based on Word2Vec and Tfidf
    Wang, Rui
    Shi, Yuliang
    2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 454 - 457
  • [30] Automated Classification of Exchange Information Requirements for Construction Projects Using Word2Vec and SVM
    Mitera-Kielbasa, Ewelina
    Zima, Krzysztof
    INFRASTRUCTURES, 2024, 9 (11)