Hate speech and abusive language detection in Indonesian social media: Progress and challenges

被引：6

作者：

Ibrohim, Muhammad Okky ^{[1
]}

Budi, Indra ^{[2
]}

机构：

[1] Univ Torino, Dipartimento Informat, Turin, Italy

[2] Univ Indonesia, Fac Comp Sci, Depok, Indonesia

来源：

HELIYON | 2023年 / 9卷 / 08期

关键词：

Hate speech; Abusive language; Indonesian social media;

D O I：

10.1016/j.heliyon.2023.e18647

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Nowadays Hate Speech and Abusive Language (HSAL) have spread extensively over social media. The easy use of social media allows people to abuse the media to spread HSAL. Hate speech and abusive language in social media must be detected because they can trigger conflict among citizens. Not only in social media, but HSAL also often trigger conflict in real life. In recent years, many scholars have researched HSAL detection in various languages and media. However, there are still many tasks on HSAL detection that need to be done to develop a better HSAL detection system. This paper discusses a summary of Indonesian HSAL detection research, conducted by utilizing the Kitchenham systematic literature review method. Based on our summary, we found that most Indonesian HSAL research still uses the classic machine-learning approach with classic text representation features that experimented on the Twitter text dataset. We also found several challenges and tasks that need to be addressed to build a better HSAL detection system in Indonesian social media that can detect the hate speech target, category, and levels; and the hate speech buzzer, thread starter, and fake account spreader.

引用

页数：16

共 50 条

[21] Multi-label text classification on unbalanced Twitter with monolingual model and hyperparameter optimization for hate speech and abusive language detection
Alzahrani, Ahmad A.
Bramantoro, Arif
Permana, Asep
INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2024, 11 (05): : 177 - 185
[22] A Measurement Study of Hate Speech in Social Media
Mondal, Mainack
Silva, Leandro Araujo
Benevenuto, Fabricio
PROCEEDINGS OF THE 28TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT'17), 2017, : 85 - 94
[23] HATE SPEECH ON SOCIAL MEDIA - CROATIAN EXPERIENCE
Tomisa, Mario
Milkovic, Marin
Vusic, Damir
Pavicic, Ivona
ECONOMIC AND SOCIAL DEVELOPMENT (ESD 2019), 2019, : 256 - 263
[24] Spread of Hate Speech in Online Social Media
Mathew, Binny
Dutt, Ritam
Goyal, Pawan
Mukherjee, Animesh
PROCEEDINGS OF THE 11TH ACM CONFERENCE ON WEB SCIENCE (WEBSCI'19), 2019, : 173 - 182
[25] Hate Speech Classification in Indonesian Language Tweets Convolutional Neural Network
Taradhita, Dewa Ayu Nadia
Putra, I. Ketut Gede Darma
JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2021, 14 (03) : 225 - 239
[26] Offensive Language and Hate Speech Detection for Danish
Sigurbergsson, Gudbjartur Ingi
Derczynski, Leon
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3498 - 3508
[27] Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media
Vashistha, Neeraj
Zubiaga, Arkaitz
INFORMATION, 2021, 12 (01) : 1 - 16
[28] Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts
Ramos, Gil
Batista, Fernando
Ribeiro, Ricardo
Fialho, Pedro
Moro, Sergio
Fonseca, Antonio
Guerra, Rita
Carvalho, Paula
Marques, Catarina
Silva, Claudia
IEEE ACCESS, 2024, 12 : 101374 - 101389
[29] Hate Speech Detection in Indonesian Language on Instagram Comment Section Using Deep Neural Network Classification Method
Perdana, Sakti Putra B. B.
Irawan, Budhi
Setianingsih, Casi
2019 IEEE ASIA PACIFIC CONFERENCE ON WIRELESS AND MOBILE (APWIMOB), 2019, : 143 - 149
[30] Combating the challenges of social media hate speech in a polarized society A Twitter ego lexalytics approach
Udanor, Collins
Anyanwu, Chinatu C.
DATA TECHNOLOGIES AND APPLICATIONS, 2019, 53 (04) : 501 - 527

← 1 2 3 4 5 →