The Automatic Text Classification Method Based on BERT and Feature Union

被引:20
|
作者
Li, Wenting [1 ]
Gao, Shangbing [1 ]
Zhou, Hong [1 ]
Huang, Zihe [1 ]
Zhang, Kewen [1 ]
Li, Wei [1 ]
机构
[1] Huaiyin Inst Technol, Fac Comp & Software Engn, Huaian 223003, Peoples R China
来源
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | 2019年
基金
国家重点研发计划;
关键词
NLP; BERT; BiLSTM; CNN; Feature Union; Text classification;
D O I
10.1109/ICPADS47876.2019.00114
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
For the traditional model based on the deep learning method most used CNN(convolutional neural networks) or RNN(Recurrent neural Network) model and is based on the dynamic character-level embedding or word-level embedding as input, so there is a problem that the text feature extraction is not comprehensive. In the development environment of the Internet of Things, A method of Automatic text classification based on BERT(Bidirectional Encoder Representations from Transformers) and Feature Fusion was proposed in this paper. Firstly, the text-to-dynamic character-level embedding is transformed by the BERT model, and the BiLSTM(Bi-directional Long-Short Term Memory) and CNN output features are combined and merged to make full use of CNN to extract the advantages of local features and to use BiLSTM to have the advantage of memory to link the extracted context features to better represent the text, so as to improve the accuracy of text classification task. A comparative study with state-of-the-art approaches manifests the proposed method outperforms the state-of-the-art methods in accuracy. It can effectively improve the accuracy of tag prediction for text data with sequence features and obvious local features.
引用
收藏
页码:774 / 777
页数:4
相关论文
共 50 条
  • [1] Chinese Text Classification Method Based on BERT Word Embedding
    Wang, Ziniu
    Huang, Zhilin
    Gao, Jianling
    2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 66 - 71
  • [2] Research on feature selection method in Chinese text automatic classification
    Hong, Ying
    Geng, Zengmin
    ENERGY SCIENCE AND APPLIED TECHNOLOGY, 2016, : 359 - 361
  • [3] Research on Feature Selection Method in Chinese Text Automatic Classification
    Hong, Ying
    Shao, Xiwen
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND ENGINEERING INNOVATION, 2015, 12 : 1759 - 1763
  • [4] A NEW FEATURE SELECTION METHOD BASED ON CONCEPT EXTRACTION IN AUTOMATIC CHINESE TEXT CLASSIFICATION
    Liao, Shasha
    Jiang, Minghu
    NEW MATHEMATICS AND NATURAL COMPUTATION, 2007, 3 (03) : 331 - 347
  • [5] A New Method of Improving BERT for Text Classification
    Zheng, Shaomin
    Yang, Meng
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 442 - 452
  • [6] Research on Public Service Request Text Classification Based on BERT-BiLSTM-CNN Feature Fusion
    Xiong, Yunpeng
    Chen, Guolian
    Cao, Junkuo
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [7] Reduce the medical burden: An automatic medical tri-age system using text classification BERT based on Transformer structure
    Wang, Xinyuan
    Tao, Make
    Wang, Runpu
    Zhang, Likui
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 679 - 685
  • [8] Federated Freeze BERT for text classification
    Omar Galal
    Ahmed H. Abdel-Gawad
    Mona Farouk
    Journal of Big Data, 11
  • [9] Federated Freeze BERT for text classification
    Galal, Omar
    Abdel-Gawad, Ahmed H.
    Farouk, Mona
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [10] A Multiscale Interactive Attention Short Text Classification Model Based on BERT
    Zhou, Lu
    Wang, Peng
    Zhang, Huijun
    Wu, Shengbo
    Zhang, Tao
    IEEE ACCESS, 2024, 12 : 160992 - 161001