The Automatic Text Classification Method Based on BERT and Feature Union

被引:20
|
作者
Li, Wenting [1 ]
Gao, Shangbing [1 ]
Zhou, Hong [1 ]
Huang, Zihe [1 ]
Zhang, Kewen [1 ]
Li, Wei [1 ]
机构
[1] Huaiyin Inst Technol, Fac Comp & Software Engn, Huaian 223003, Peoples R China
来源
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | 2019年
基金
国家重点研发计划;
关键词
NLP; BERT; BiLSTM; CNN; Feature Union; Text classification;
D O I
10.1109/ICPADS47876.2019.00114
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
For the traditional model based on the deep learning method most used CNN(convolutional neural networks) or RNN(Recurrent neural Network) model and is based on the dynamic character-level embedding or word-level embedding as input, so there is a problem that the text feature extraction is not comprehensive. In the development environment of the Internet of Things, A method of Automatic text classification based on BERT(Bidirectional Encoder Representations from Transformers) and Feature Fusion was proposed in this paper. Firstly, the text-to-dynamic character-level embedding is transformed by the BERT model, and the BiLSTM(Bi-directional Long-Short Term Memory) and CNN output features are combined and merged to make full use of CNN to extract the advantages of local features and to use BiLSTM to have the advantage of memory to link the extracted context features to better represent the text, so as to improve the accuracy of text classification task. A comparative study with state-of-the-art approaches manifests the proposed method outperforms the state-of-the-art methods in accuracy. It can effectively improve the accuracy of tag prediction for text data with sequence features and obvious local features.
引用
收藏
页码:774 / 777
页数:4
相关论文
共 50 条
  • [31] Emotion Classification of Text Based on BERT and Broad Learning System
    Peng, Sancheng
    Zeng, Rong
    Liu, Hongzhan
    Chen, Guanghao
    Wu, Ruihuan
    Yang, Aimin
    Yu, Shui
    WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 382 - 396
  • [32] Few-shot Text Classification Method Based on Feature Optimization
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2023, 22 (03): : 497 - 514
  • [33] A feature selection method based on synonym merging in text classification system
    Yao, Haipeng
    Liu, Chong
    Zhang, Peiying
    Wang, Luyao
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2017,
  • [34] Chinese News Text Classification Method Based On Attention Mechanism
    Ruan, Jinjun
    Caballero, Jonathan M.
    Juanatas, Ronaldo A.
    2022 7TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR2022), 2022, : 330 - 334
  • [35] Chinese Text Feature Extraction and Classification Based on Deep Learning
    Wang, Ruishuang
    Li, Zhao
    Cao, Jian
    Chen, Tong
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [36] Effective text classification using BERT, MTM LSTM, and DT
    Jamshidi, Saman
    Mohammadi, Mahin
    Bagheri, Saeed
    Najafabadi, Hamid Esmaeili
    Rezvanian, Alireza
    Gheisari, Mehdi
    Ghaderzadeh, Mustafa
    Shahabi, Amir Shahab
    Wu, Zongda
    DATA & KNOWLEDGE ENGINEERING, 2024, 151
  • [37] Bert-Enhanced Text Graph Neural Network for Classification
    Yang, Yiping
    Cui, Xiaohui
    ENTROPY, 2021, 23 (11)
  • [38] Research on Intelligent Classification Method of Seismic Information Text Based on BERT-BiLSTM Optimization Algorithm
    Wang Zhonghao
    Li Chenxi
    Huang Meng
    Liu Shuai
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 55 - 59
  • [39] Analyzing the Performance of BERT for the Sentiment Classification Task in Bengali Text
    Banshal, Sumit Kumar
    Uddin, Ashraf
    Piryani, Rajesh
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III, 2024, 2092 : 273 - 285
  • [40] A Chi-square Statistics Based Feature Selection Method in Text Classification
    Zhai, Yujia
    Song, Wei
    Liu, Xianjun
    Liu, Lizhen
    Zhao, Xinlei
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 160 - 163