Ex-ThaiHate: A Generative Multi-task Framework for Sentiment and Emotion Aware Hate Speech Detection with Explanation in Thai

被引:0
|
作者
Maity, Krishanu [1 ]
Bhattacharya, Shaubhik [1 ]
Phosit, Salisa [2 ]
Kongsamlit, Sawarod [2 ]
Saha, Sriparna [1 ]
Pasupa, Kitsuchart [2 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna 801103, Bihar, India
[2] King Mongkuts Inst Technol Ladkrabang, Sch Informat Technol, Bangkok 10520, Thailand
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VI | 2023年 / 14174卷
关键词
Hate Speech; Sentiment; Emotion; Explainability; Thai; Multi-task;
D O I
10.1007/978-3-031-43427-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media platforms have both positive and negative impacts on users in diverse societies. One of the adverse effects of social media platforms is the usage of hate and offensive language, which not only fosters prejudice but also harms the vulnerable. Additionally, a person's sentiment and emotional state heavily influence the intended content of any social media post. Despite extensive research being conducted to detect online hate speech in English, there is a lack of similar studies on low-resource languages such as Thai. The recent enactment of laws like the "right to explanations" in the General Data Protection Regulation has stimulated the development of interpretable models rather than solely focusing on performance. Motivated by this, we created the first benchmark hate speech corpus, called Ex-ThaiHate, in the Thai language. Each post is annotated with four labels, namely hate, sentiment, emotion, and rationales (explainability), which specify the phrases that are responsible for annotating the post as hate. In order to investigate the effect of sentiment and emotional information on detecting hate speech posts, we propose a unified generative framework called GenX, which redefines this multi-task problem as a text-to-text generation task to simultaneously solve four tasks: hate-speech identification, rationale detection, sentiment, and emotion detection. Our extensive experiments demonstrate that GenX significantly outperforms all baselines and state-of-the-art models, thereby highlighting its effectiveness in detecting hate speech and identifying the rationales in low-resource languages. The code and dataset are available at https://github.com/dsmlr/Ex-ThaiHate. Disclaimer: The article contains offensive text and profanity. This is due to the nature of the work and does not reflect any opinion or stance of the authors.
引用
收藏
页码:139 / 156
页数:18
相关论文
共 9 条
  • [1] Towards Analyzing the Efficacy of Multi-task Learning in Hate Speech Detection
    Maity, Krishanu
    Balaji, Gokulapriyan
    Saha, Sriparna
    NEURAL INFORMATION PROCESSING, ICONIP 2023, PT VI, 2024, 14452 : 317 - 328
  • [2] HHSD: Hindi Hate Speech Detection Leveraging Multi-Task Learning
    Kapil, Prashant
    Kumari, Gitanjali
    Ekbal, Asif
    Pal, Santanu
    Chatterjee, Arindam
    Vinutha, B. N.
    IEEE ACCESS, 2023, 11 : 101460 - 101473
  • [3] FADOHS: Framework for Detection and Integration of Unstructured Data of Hate Speech on Facebook Using Sentiment and Emotion Analysis
    Rodriguez, Axel
    Yi-Ling Chen
    Argueta, Carlos
    IEEE ACCESS, 2022, 10 : 22400 - 22419
  • [4] A Multitask Framework for Sentiment, Emotion and Sarcasm aware Cyberbullying Detection from Multi-modal Code-Mixed Memes
    Maity, Krishanu
    Jha, Prince
    Saha, Sriparna
    Bhattacharyya, Pushpak
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1739 - 1749
  • [5] Generalizing Hate Speech Detection Using Multi-Task Learning: A Case Study of Political Public Figures
    Yuan, Lanqin
    Rizoiu, Marian-Andrei
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [6] Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model
    Aldjanabi, Wassen
    Dahou, Abdelghani
    Al-qaness, Mohammed A. A.
    Abd Elaziz, Mohamed
    Helmi, Ahmed Mohamed
    Damasevicius, Robertas
    INFORMATICS-BASEL, 2021, 8 (04):
  • [7] A transformer-based multi-task framework for joint detection of aggression and hate on social media data
    Ghosh, Soumitra
    Priyankar, Amit
    Ekbal, Asif
    Bhattacharyya, Pushpak
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1495 - 1515
  • [8] Spanish MTLHateCorpus 2023: Multi-task learning for hate speech detection to identify speech type, target, target group and intensity
    Pan, Ronghao
    Garcia-Diaz, Jose Antonio
    Valencia-Garcia, Rafael
    COMPUTER STANDARDS & INTERFACES, 2025, 94
  • [9] A multi-task learning framework for politeness and emotion detection in dialogues for mental health counselling and legal aid
    Priya, Priyanshu
    Firdaus, Mauajama
    Ekbal, Asif
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224