Automatic text classification algorithm based on Gauss improved convolutional neural network

被引:13
|
作者
Du, Jian-hai [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
关键词
Clustering algorithm; Text classification; Parallel computation; Natural language; Neural network;
D O I
10.1016/j.jocs.2017.06.010
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The traditional KNN query is a kind of algorithm with good stability and accuracy performance. However, when the sample size is too large, the computational efficiency of the algorithm is affected greatly. Therefore, a kind of parallel MKNN text classification algorithm based on clustering center text series has been proposed. Firstly, the effective dimensionality reduction of similarity calculation amount of the algorithm is realized based on the clustering center, and the original large-scale document samples are replaced with a relatively small number of clustering sample centers to realize improvement of the KNN query process. Secondly, MapReduce parallel framework is used to meet real-time demand of large-scale text classification and calculation combined with features of text classification, and to effectively overcome slow speed of the KNN query process and ensure accuracy of text classification as higher as possible. Finally, the classification speed of proposed algorithm can be effectively improved under the premise of ensuring sufficient accuracy through comparison in experiment of text classification accuracy and algorithmic efficiency with the similar single-threaded algorithm. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:195 / 200
页数:6
相关论文
共 50 条
  • [1] Application of an Improved Convolutional Neural Network Algorithm in Text Classification
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2024, 23 (03): : 315 - 340
  • [2] News Text Classification Based on an Improved Convolutional Neural Network
    Tao, Wenjing
    Chang, Dan
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2019, 26 (05): : 1400 - 1409
  • [3] Application of Improved Convolutional Neural Network in Text Classification
    Ronghui, Liu
    Xinhong, Wei
    IAENG International Journal of Computer Science, 2022, 49 (03)
  • [4] Convolutional Neural Network Algorithm-Based Novel Automatic Text Classification Framework for Construction Accident Reports
    Luo, Xixi
    Li, Xinchun
    Song, Xuefeng
    Liu, Quanlong
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2023, 149 (12)
  • [5] Convolutional Neural Network based for Automatic Text Summarization
    Alquliti, Wajdi Homaid
    Ghani, Norjihan Binti Abdul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (04) : 200 - 211
  • [6] Automatic Classification of Microscopic Hair Images Based on Improved Convolutional Neural Network
    Jiang Xiaojia
    Gao Shuhui
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (23)
  • [7] An Improved BP Neural Network Algorithm for Text Classification
    Lei, Fei
    Yu, Yongbin
    Guo, Yuxin
    Tashi, Nyima
    Zhang, Huan
    Dang, Bo
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 4474 - 4478
  • [8] Fault Text Classification Based on Convolutional Neural Network
    Wang, Lixia
    Zhang, Botao
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS (ICIEA 2020), 2020, : 937 - 941
  • [9] Convolutional-neural-network-based Multilabel Text Classification for Automatic Discrimination of Legal Documents
    Qiu, Ming
    Zhang, Yiru
    Ma, Tianqi
    Wu, Qingfeng
    Jin, Fanzhu
    SENSORS AND MATERIALS, 2020, 32 (08) : 2659 - 2672
  • [10] Text Classification Based on Convolutional Neural Network and Attention Model
    Yang, Shuang
    Tang, Yan
    2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA (ICAIBD 2020), 2020, : 67 - 73