The Automatic Text Classification Method Based on BERT and Feature Union

被引:20
|
作者
Li, Wenting [1 ]
Gao, Shangbing [1 ]
Zhou, Hong [1 ]
Huang, Zihe [1 ]
Zhang, Kewen [1 ]
Li, Wei [1 ]
机构
[1] Huaiyin Inst Technol, Fac Comp & Software Engn, Huaian 223003, Peoples R China
来源
2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | 2019年
基金
国家重点研发计划;
关键词
NLP; BERT; BiLSTM; CNN; Feature Union; Text classification;
D O I
10.1109/ICPADS47876.2019.00114
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
For the traditional model based on the deep learning method most used CNN(convolutional neural networks) or RNN(Recurrent neural Network) model and is based on the dynamic character-level embedding or word-level embedding as input, so there is a problem that the text feature extraction is not comprehensive. In the development environment of the Internet of Things, A method of Automatic text classification based on BERT(Bidirectional Encoder Representations from Transformers) and Feature Fusion was proposed in this paper. Firstly, the text-to-dynamic character-level embedding is transformed by the BERT model, and the BiLSTM(Bi-directional Long-Short Term Memory) and CNN output features are combined and merged to make full use of CNN to extract the advantages of local features and to use BiLSTM to have the advantage of memory to link the extracted context features to better represent the text, so as to improve the accuracy of text classification task. A comparative study with state-of-the-art approaches manifests the proposed method outperforms the state-of-the-art methods in accuracy. It can effectively improve the accuracy of tag prediction for text data with sequence features and obvious local features.
引用
收藏
页码:774 / 777
页数:4
相关论文
共 50 条
  • [41] A new feature selection method based on frequent and associated itemsets for text classification
    Farghaly, Heba Mamdouh
    Abd El-Hafeez, Tarek
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (25)
  • [42] Hybrid Support Vector Machine based Feature Selection Method for Text Classification
    Sabbah, Thabit
    Ayyash, Mosab
    Ashraf, Mahmood
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2018, 15 (3A) : 599 - 609
  • [43] SSMBERT: A Space Science Mission Requirement Classification Method Based on BERT
    Zhu, Yiming
    Zhang, Yuzhu
    Peng, Xiaodong
    Xue, Changbin
    Chen, Bin
    Cao, Yu
    AEROSPACE, 2024, 11 (12)
  • [44] Feature Extraction based Text Classification: A review
    Shaker, Saif Safaa
    Alhajim, Dhafer
    Al-Khazaali, Ahmed Ali Talib
    Hussein, Hussein Aqeel
    Athab, Ali F.
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (01) : 646 - 653
  • [45] A New Filter Feature Selection Method for Text Classification
    Cekik, Rasim
    IEEE ACCESS, 2024, 12 : 139316 - 139335
  • [46] Statera: A Balanced Feature Selection Method for Text Classification
    Gama Bispo, Braian Varjao
    Rios, Tatiane Nogueira
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 260 - 265
  • [47] A parallel feature selection method study for text classification
    Zhao Li
    Wei Lu
    Zhanquan Sun
    Weiwei Xing
    Neural Computing and Applications, 2017, 28 : 513 - 524
  • [48] A novel probabilistic feature selection method for text classification
    Uysal, Alper Kursat
    Gunal, Serkan
    KNOWLEDGE-BASED SYSTEMS, 2012, 36 : 226 - 235
  • [49] Three-Branch BERT-Based Text Classification Network for Gastroscopy Diagnosis Text
    Wang Z.
    Zheng X.
    Zhang J.
    Zhang M.
    International Journal of Crowd Science, 2024, 8 (01) : 56 - 63
  • [50] A parallel feature selection method study for text classification
    Li, Zhao
    Lu, Wei
    Sun, Zhanquan
    Xing, Weiwei
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S513 - S524