BERT-based chinese text classification for emergency management with a novel loss function

被引:22
|
作者
Wang, Zhongju [1 ,2 ]
Wang, Long [1 ,2 ,3 ]
Huang, Chao [1 ,2 ]
Sun, Shutong [4 ]
Luo, Xiong [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Shunde Grad Sch, Foshan, Peoples R China
[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
关键词
Natural language processing; Deep learning; Text classification; Emergency management; SMOTE; DRIVEN;
D O I
10.1007/s10489-022-03946-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an automatic Chinese text categorization method for solving the emergency event report classification problem. Since the bidirectional encoder representations from transformers (BERT) has achieved great success in the natural language processing domain, it is employed to derive emergency text features in this study. To overcome the data imbalance problem in the distribution of emergency event categories, a novel loss function is proposed to improve the performance of the BERT-based model. Meanwhile, in order to avoid the negative impacts of the extreme learning rate, the Adabound optimization algorithm that achieves a gradual smooth transition from Adam optimizer to stochastic gradient descent optimizer is employed to learn the parameters of the model. The feasibility and competitiveness of the proposed method are validated on both imbalanced and balanced datasets. Furthermore, the generic BERT, BERT ensemble LSTM-BERT (BERT-LB), Attention-based BiLSTM fused CNN with gating mechanism (ABLG-CNN), TextRCNN, Att-BLSTM, and DPCNN are used as benchmarks on these two datasets. Meanwhile, sampling methods, including random sampling, ADASYN, synthetic minority over-sampling techniques (SMOTE), and Borderline-SMOTE, are employed to verify the performance of the proposed loss function on the imbalance dataset. Compared with benchmarking methods, the proposed method has achieved the best performance in terms of accuracy, weighted average precision, weighted average recall, and weighted average F1 values. Therefore, it is promising to employ the proposed method for real applications in smart emergency management systems.
引用
收藏
页码:10417 / 10428
页数:12
相关论文
共 50 条
  • [41] Financial causal sentence recognition based on BERT-CNN text classification
    Wan, Chang-Xuan
    Li, Bo
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (05) : 6503 - 6527
  • [42] Research on Intelligent Classification Method of Seismic Information Text Based on BERT-BiLSTM Optimization Algorithm
    Wang Zhonghao
    Li Chenxi
    Huang Meng
    Liu Shuai
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 55 - 59
  • [43] Text classification for distribution substation inspection based on BERT-TextRCNN model
    Lu, Jiangang
    Zhao, Ruifeng
    Yu, Zhiwen
    Dai, Yue
    Shu, Jiawei
    Yang, Ting
    FRONTIERS IN ENERGY RESEARCH, 2024, 12
  • [44] Multi-label Classification of Chinese Judicial Documents based on BERT
    Dai, Mian
    Liu, Chao-Lin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 1866 - 1867
  • [45] Text classification for evaluating digital technology adoption maturity based on BERT: An evidence of Industrial AI from China
    Wang, Yanhong
    Gong, Chen
    Ji, Xiaodong
    Yuan, Qi
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2025, 211
  • [46] BERT-based Prediction Model of Management Sales Forecast Error using Japanese Firms' Earnings Meeting Transcripts
    Bao, Siya
    Jin, Yiqun
    2024 IEEE 21ST CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2024, : 1066 - 1067
  • [47] News Text Classification and Recommendation Technology Based on Wide & Deep-Bert Model
    Wu Jing
    Yang Bailong
    2021 IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SOFTWARE ENGINEERING (ICICSE 2021), 2021, : 209 - 216
  • [48] A Chinese text classification algorithm based on granular computing
    Qiu, Taorong
    Huang, Houkuan
    Liu, Qing
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 4042 - +
  • [49] The Instructional Design of Chinese Text Classification based on SVM
    Wei, Sichao
    Guo, Jianyi
    Yu, Zhengtao
    Chen, Peng
    Xian, Yantuan
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 5114 - 5117
  • [50] The Research of Chinese Text Automatic Classification Based on Multiple
    Zhang, Shengli
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1543 - 1548