BERT-based chinese text classification for emergency management with a novel loss function

被引:22
|
作者
Wang, Zhongju [1 ,2 ]
Wang, Long [1 ,2 ,3 ]
Huang, Chao [1 ,2 ]
Sun, Shutong [4 ]
Luo, Xiong [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Univ Sci & Technol Beijing, Shunde Grad Sch, Foshan, Peoples R China
[4] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
关键词
Natural language processing; Deep learning; Text classification; Emergency management; SMOTE; DRIVEN;
D O I
10.1007/s10489-022-03946-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an automatic Chinese text categorization method for solving the emergency event report classification problem. Since the bidirectional encoder representations from transformers (BERT) has achieved great success in the natural language processing domain, it is employed to derive emergency text features in this study. To overcome the data imbalance problem in the distribution of emergency event categories, a novel loss function is proposed to improve the performance of the BERT-based model. Meanwhile, in order to avoid the negative impacts of the extreme learning rate, the Adabound optimization algorithm that achieves a gradual smooth transition from Adam optimizer to stochastic gradient descent optimizer is employed to learn the parameters of the model. The feasibility and competitiveness of the proposed method are validated on both imbalanced and balanced datasets. Furthermore, the generic BERT, BERT ensemble LSTM-BERT (BERT-LB), Attention-based BiLSTM fused CNN with gating mechanism (ABLG-CNN), TextRCNN, Att-BLSTM, and DPCNN are used as benchmarks on these two datasets. Meanwhile, sampling methods, including random sampling, ADASYN, synthetic minority over-sampling techniques (SMOTE), and Borderline-SMOTE, are employed to verify the performance of the proposed loss function on the imbalance dataset. Compared with benchmarking methods, the proposed method has achieved the best performance in terms of accuracy, weighted average precision, weighted average recall, and weighted average F1 values. Therefore, it is promising to employ the proposed method for real applications in smart emergency management systems.
引用
收藏
页码:10417 / 10428
页数:12
相关论文
共 50 条
  • [21] BERT-Based Joint Model for Aspect Term Extraction and Aspect Polarity Detection in Arabic Text
    Chouikhi, Hasna
    Alsuhaibani, Mohammed
    Jarray, Fethi
    ELECTRONICS, 2023, 12 (03)
  • [22] A BERT-Based Multi-Criteria Recommender System for Hotel Promotion Management
    Zhuang, Yuanyuan
    Kim, Jaekyeong
    SUSTAINABILITY, 2021, 13 (14)
  • [23] Research on news text classification based on improved BERT-UNet model
    Li, Zeqin
    Liu, Jianwen
    Lin, Jin
    Tan, Deli
    Gong, Ruyue
    Wang, Linglin
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024, 2024, : 1 - 7
  • [24] BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning
    Wang, Kai
    Zeng, Xuan
    Zhou, Jingwen
    Liu, Fei
    Luan, Xiaoli
    Wang, Xinglong
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [25] Research on Chinese Keyword Recognition Based on BERT Binary Classification Algorithm
    Zhu, Chunling
    Wu, Di
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 689 - 695
  • [26] BRsyn-Caps: Chinese Text Classification Using Capsule Network Based on Bert and Dependency Syntax
    Luo, Jie
    He, Chengwan
    Luo, Hongwei
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107 (02) : 212 - 219
  • [27] A Methodology for Emergency Calls Severity Prediction: From Pre-processing to BERT-Based Classifiers
    Kanaan, Marianne Abi
    Couchot, Jean-Francois
    Guyeux, Christophe
    Laiymani, David
    Atechian, Talar
    Darazi, Rony
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 329 - 342
  • [28] Text Classification Research Based on Bert Model and Bayesian Network
    Liu, Songsong
    Tao, Haijun
    Feng, Shiling
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5842 - 5846
  • [29] The Automatic Text Classification Method Based on BERT and Feature Union
    Li, Wenting
    Gao, Shangbing
    Zhou, Hong
    Huang, Zihe
    Zhang, Kewen
    Li, Wei
    2019 IEEE 25TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2019, : 774 - 777
  • [30] Cross-Domain Text Classification Based on BERT Model
    Zhang, Kuan
    Hei, Xinhong
    Fei, Rong
    Guo, Yufan
    Jiao, Rui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS: DASFAA 2021 INTERNATIONAL WORKSHOPS, 2021, 12680 : 197 - 208