Ex-ThaiHate: A Generative Multi-task Framework for Sentiment and Emotion Aware Hate Speech Detection with Explanation in Thai

被引：0

作者：

Maity, Krishanu ^{[1
]}

Bhattacharya, Shaubhik ^{[1
]}

Phosit, Salisa ^{[2
]}

Kongsamlit, Sawarod ^{[2
]}

Saha, Sriparna ^{[1
]}

Pasupa, Kitsuchart ^{[2
]}

机构：

[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna 801103, Bihar, India

[2] King Mongkuts Inst Technol Ladkrabang, Sch Informat Technol, Bangkok 10520, Thailand

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI | 2023年 / 14174卷

关键词：

Hate Speech; Sentiment; Emotion; Explainability; Thai; Multi-task;

D O I：

10.1007/978-3-031-43427-3_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Social media platforms have both positive and negative impacts on users in diverse societies. One of the adverse effects of social media platforms is the usage of hate and offensive language, which not only fosters prejudice but also harms the vulnerable. Additionally, a person's sentiment and emotional state heavily influence the intended content of any social media post. Despite extensive research being conducted to detect online hate speech in English, there is a lack of similar studies on low-resource languages such as Thai. The recent enactment of laws like the "right to explanations" in the General Data Protection Regulation has stimulated the development of interpretable models rather than solely focusing on performance. Motivated by this, we created the first benchmark hate speech corpus, called Ex-ThaiHate, in the Thai language. Each post is annotated with four labels, namely hate, sentiment, emotion, and rationales (explainability), which specify the phrases that are responsible for annotating the post as hate. In order to investigate the effect of sentiment and emotional information on detecting hate speech posts, we propose a unified generative framework called GenX, which redefines this multi-task problem as a text-to-text generation task to simultaneously solve four tasks: hate-speech identification, rationale detection, sentiment, and emotion detection. Our extensive experiments demonstrate that GenX significantly outperforms all baselines and state-of-the-art models, thereby highlighting its effectiveness in detecting hate speech and identifying the rationales in low-resource languages. The code and dataset are available at https://github.com/dsmlr/Ex-ThaiHate. Disclaimer: The article contains offensive text and profanity. This is due to the nature of the work and does not reflect any opinion or stance of the authors.

引用

页码：139 / 156

页数：18

共 9 条

[1] Towards Analyzing the Efficacy of Multi-task Learning in Hate Speech Detection
Maity, Krishanu
Balaji, Gokulapriyan
Saha, Sriparna
NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI, 2024, 14452 : 317 - 328
[2] HHSD: Hindi Hate Speech Detection Leveraging Multi-Task Learning
Kapil, Prashant
Kumari, Gitanjali
Ekbal, Asif
Pal, Santanu
Chatterjee, Arindam
Vinutha, B. N.
IEEE ACCESS, 2023, 11 : 101460 - 101473
[3] FADOHS: Framework for Detection and Integration of Unstructured Data of Hate Speech on Facebook Using Sentiment and Emotion Analysis
Rodriguez, Axel
Yi-Ling Chen
Argueta, Carlos
IEEE ACCESS, 2022, 10 : 22400 - 22419
[4] A Multitask Framework for Sentiment, Emotion and Sarcasm aware Cyberbullying Detection from Multi-modal Code-Mixed Memes
Maity, Krishanu
Jha, Prince
Saha, Sriparna
Bhattacharyya, Pushpak
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1739 - 1749
[5] Generalizing Hate Speech Detection Using Multi-Task Learning: A Case Study of Political Public Figures
Yuan, Lanqin
Rizoiu, Marian-Andrei
COMPUTER SPEECH AND LANGUAGE, 2025, 89
[6] Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model
Aldjanabi, Wassen
Dahou, Abdelghani
Al-qaness, Mohammed A. A.
Abd Elaziz, Mohamed
Helmi, Ahmed Mohamed
Damasevicius, Robertas
INFORMATICS-BASEL, 2021, 8 (04):
[7] A transformer-based multi-task framework for joint detection of aggression and hate on social media data
Ghosh, Soumitra
Priyankar, Amit
Ekbal, Asif
Bhattacharyya, Pushpak
NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1495 - 1515
[8] Spanish MTLHateCorpus 2023: Multi-task learning for hate speech detection to identify speech type, target, target group and intensity
Pan, Ronghao
Garcia-Diaz, Jose Antonio
Valencia-Garcia, Rafael
COMPUTER STANDARDS & INTERFACES, 2025, 94
[9] A multi-task learning framework for politeness and emotion detection in dialogues for mental health counselling and legal aid
Priya, Priyanshu
Firdaus, Mauajama
Ekbal, Asif
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224

← 1 →