Research on news text classification based on improved BERT-UNet model

被引:0
作者
Li, Zeqin [1 ]
Liu, Jianwen [1 ]
Lin, Jin [1 ]
Tan, Deli [1 ]
Gong, Ruyue [1 ]
Wang, Linglin [1 ]
机构
[1] Neusoft Inst Guangdong Foshan, Foshan, Peoples R China
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MODELING, NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING, CMNM 2024 | 2024年
关键词
BERT-UNet model; Deep learning; News text classification; Natural language processing;
D O I
10.1145/3677779.3677780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The classification of news text is crucial for various applications such as information retrieval, sentiment analysis, and intelligence gathering. In light of the limitations of conventional convolutional neural networks in news text classification, this study introduces an enhanced BERT-UNet model for improved long-distance text feature extraction. Initially, the model leverages BERT for pre-training the text word vectors, followed by embedding and mapping them onto the UNet architecture to extract contextual key features. The Softmax function is then utilized for news opinion text categorization. To validate the model's performance, comparative experiments are conducted on the THUCNews dataset. The results indicate that the BERT-UNET model outperforms the standard TextCNN model and standalone BERT approach with a 3.11% and 0.29% increase in macro average F1 value, respectively. These findings demonstrate the effectiveness of the enhanced BERT-UNet model in capturing textual relationships, offering a fresh perspective on enhancing traditional news text classification methods.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 18 条
[1]  
Chen Xin, 2021, 2021 3rd International Conference on Natural Language Processing (ICNLP), P14, DOI 10.1109/ICNLP52887.2021.00008
[2]   Chinese News Text Classification Method via Key Feature Enhancement [J].
Ge, Bin ;
He, Chunhui ;
Xu, Hao ;
Wu, Jibing ;
Tang, Jiuyang .
APPLIED SCIENCES-BASEL, 2023, 13 (09)
[3]  
Hu W., 2019, 2019 IEEE 4 INT C DA, P284
[4]  
Huang LZ, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3444
[5]  
Imad J., 2023, International Journal on Electrical Engineering and Informatics, V15, P277, DOI [10.15676/ijeei.2023.15.2.7, DOI 10.15676/IJEEI.2023.15.2.7]
[6]  
Kim Y., 2014, ACL, DOI DOI 10.3115/V1/D14-1181
[7]   Novel heuristic-based hybrid ResNeXt with recurrent neural network to handle multi class classification of sentiment analysis [J].
Krosuri, Lakshmi Revathi ;
Aravapalli, Rama Satish .
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (01)
[8]  
Lai SW, 2015, AAAI CONF ARTIF INTE, P2267
[9]   Research on the Classification of New Energy Industry Policy Texts Based on BERT Model [J].
Li, Qian ;
Xiao, Zezhong ;
Zhao, Yanyun .
SUSTAINABILITY, 2023, 15 (14)
[10]  
Lin Y, 2014, 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), P842, DOI 10.1109/ICSESS.2014.6933697