Domain-Enhanced Prompt Learning for Chinese Implicit Hate Speech Detection

被引：2

作者：

Zhang, Yaosheng ^{[1
]}

Zhong, Tiegang ^{[2
]}

Yi, Tingjun ^{[1
]}

Li, Haoming ^{[2
]}

机构：

[1] Liaoning Tech Univ, Sch Elect & Informat Engn, Huludao 125105, Peoples R China

[2] Henan Univ Anim Husb & Econ, Sch Informat Engn, Zhengzhou 450000, Peoples R China

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Hate speech; Task analysis; Feature extraction; Speech recognition; Training; Support vector machines; Semantics; Few-shot learning; Hate speech detection; domain feature; prompt learning; few-shot;

D O I：

10.1109/ACCESS.2024.3351804

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hate Speech Detection, aims to identify the widespread presence of harmful speech on social networks, is a long-standing research field. Despite its significance, previous efforts almost focused on English, leading to a notable scarcity of datasets for Hate Speech Detection in Chinese. Even more, two emerging forms of hate speech under stringent regulatory environments: 1) domain specificity, manifesting itself as nuanced and harder-to-detect proprietary aggressive rhetoric within various domains; and 2) implicitness, characterized by indirect, abstract and ambiguous cold language. This evolution presents additional complexities for Multi-domain Implicit Hate Speech Detection in Chinese. To fill this gap, we construct a 20,000-large implicit hate speech detection dataset containing nine domains. Furthermore, this research introduce a Domain-enhanced Prompt Learning (DePL) approach, tailored to navigate the complexities of multi-domain and data-limited scenarios. This methodology innovatively combines domain feature fusion to effectively encode domain-specific features in hate speech with the latest advances in prompt learning, effectively tackling the dual challenges of domain diversity and data scarcity. Experimental results demonstrate that the DePL method achieves state-of-the-art (SOTA) results on our benchmark dataset in both few-shot and full-scale scenarios.

引用

页码：13773 / 13782

页数：10

共 37 条

[1] Deep Learning for Hate Speech Detection in Tweets [J].

Badjatiya, Pinkesh ;

Gupta, Shashank ;

Gupta, Manish ;

Varma, Vasudeva .

WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, :759-760

[2] Verbal aggression detection on Twitter comments: convolutional neural network for short-text sentiment analysis [J].

Chen, Junyi ;

Yan, Shankai ;

Wong, Ka-Chun .

NEURAL COMPUTING & APPLICATIONS, 2020, 32 (15) :10809-10818

[3] Detecting Offensive Language in Social Media to Protect Adolescent Online Safety [J].

Chen, Ying ;

Zhou, Yilu ;

Zhu, Sencun ;

Xu, Heng .

PROCEEDINGS OF 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON PRIVACY, SECURITY, RISK AND TRUST AND 2012 ASE/IEEE INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING (SOCIALCOM/PASSAT 2012), 2012, :71-80

[4]

Dai WL, 2020, PROCEEDINGS OF THE FOURTEENTH WORKSHOP ON SEMANTIC EVALUATION, P2060

[5]

Davidson T, 2017, P INT AAAI C WEB SOC, P512, DOI [10.1609/icwsm.v11i1.14955, DOI 10.1609/ICWSM.V11I1.14955]

[6]

Deng J, 2022, COLD BENCHMARK CHINE, P11580

[7]

Ding Y., 2020, M.S. thesis

[8] Hate Speech Detection with Comment Embeddings [J].

Djuric, Nemanja ;

Zhou, Jing ;

Morris, Robin ;

Grbovic, Mihajlo ;

Radosavljevic, Vladan ;

Bhamidipati, Narayan .

WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, :29-30

[9]

ElSherief M, 2021, 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), P345

[10]

Gamback Bjorn, 2017, P 1 WORKSH AB LANG O, P85, DOI DOI 10.18653/V1/W17-3013

← 1 2 3 4 →