Implicit Offensive Speech Detection Based on Multi-feature Fusion

被引:0
作者
Guo, Tengda [1 ]
Lin, Lianxin [1 ]
Liu, Hang [1 ]
Zheng, Chengping [1 ]
Tu, Zhijian [1 ]
Wang, Haizhou [1 ]
机构
[1] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu 610207, Peoples R China
来源
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2023 | 2023年 / 14118卷
关键词
Implicit Offensive Speech; Classification; Deep Learning; Natural Language Processing; Weibo;
D O I
10.1007/978-3-031-40286-9_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As social media platforms become increasingly strict in censorship of user posts, aggressive and insulting language has gradually shifted to implicit expressions using homophones, metaphors, and other forms of camouflage. This not only intensifies the satirical attacks of negative posts but also leads to a significant decrease in the effectiveness of models designed to detect offensive speech. This paper aims to achieve high accurate detection of implicit offensive speech. We conducted targeted investigations on the speech characteristics on Weibo, which is one of the largest social media platform in China. Based on the identified features of implicit offensive speech, including semantic, emotional, metaphorical, and fallacy characteristics, this paper constructs a BERT-based Multi-Task learning model named BMA (BERT-Mate-Ambiguity) to accurately detect the implicit offensive speech in the real world. Additionally, this paper establishes a dataset based on posts of Weibo that contain implicit offensive speech and conducts various comparative, robustness, and ablation experiments. The effectiveness of the model is demonstrated by comparing it to existing models that perform well in this field. Finally, this paper discusses some of its limitations and proposes future research work.
引用
收藏
页码:27 / 38
页数:12
相关论文
共 20 条
[1]   Mean Birds: Detecting Aggression and Bullying on Twitter [J].
Chatzakou, Despoina ;
Kourtellis, Nicolas ;
Blackburn, Jeremy ;
De Cristofaro, Emiliano ;
Stringhini, Gianluca ;
Vakali, Athena .
PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, :13-22
[2]  
Chen Y., 2015, CONVOLUTIONAL NEURAL
[3]  
Davidson T, 2017, Proceedings of the International AAAI Conference on Web and Social Media, V11, P512, DOI [10.1609/icwsm.v11i1.14955, 10.1609/icwsm.v11i1.14955, DOI 10.1609/ICWSM.V11I1.14955]
[4]  
Deng JW, 2022, Arxiv, DOI [arXiv:2201.06025, 10.48550/arXiv.2201.06025]
[5]  
Dhanya L., 2021, P 1 IEEE INT C COMM, P1
[6]  
Gui T, 2019, AAAI CONF ARTIF INTE, P110
[7]  
Kavatagi S., 2021, P 1 IEEE INT C SMART, P1
[8]  
Kennedy G.W., 2017, P 1 WORKSH AB LANG O, P73, DOI DOI 10.18653/V1/W17-3011
[9]  
Kumar A., 2022, P 6 IEEE INT C INT C, P1
[10]  
Lu DM, 2022, PROCEEDINGS OF THE 16TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2022, P186