Hate speech and abusive language detection in Indonesian social media: Progress and challenges

被引:6
|
作者
Ibrohim, Muhammad Okky [1 ]
Budi, Indra [2 ]
机构
[1] Univ Torino, Dipartimento Informat, Turin, Italy
[2] Univ Indonesia, Fac Comp Sci, Depok, Indonesia
关键词
Hate speech; Abusive language; Indonesian social media;
D O I
10.1016/j.heliyon.2023.e18647
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nowadays Hate Speech and Abusive Language (HSAL) have spread extensively over social media. The easy use of social media allows people to abuse the media to spread HSAL. Hate speech and abusive language in social media must be detected because they can trigger conflict among citizens. Not only in social media, but HSAL also often trigger conflict in real life. In recent years, many scholars have researched HSAL detection in various languages and media. However, there are still many tasks on HSAL detection that need to be done to develop a better HSAL detection system. This paper discusses a summary of Indonesian HSAL detection research, conducted by utilizing the Kitchenham systematic literature review method. Based on our summary, we found that most Indonesian HSAL research still uses the classic machine-learning approach with classic text representation features that experimented on the Twitter text dataset. We also found several challenges and tasks that need to be addressed to build a better HSAL detection system in Indonesian social media that can detect the hate speech target, category, and levels; and the hate speech buzzer, thread starter, and fake account spreader.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Multi-label text classification on unbalanced Twitter with monolingual model and hyperparameter optimization for hate speech and abusive language detection
    Alzahrani, Ahmad A.
    Bramantoro, Arif
    Permana, Asep
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2024, 11 (05): : 177 - 185
  • [22] A Measurement Study of Hate Speech in Social Media
    Mondal, Mainack
    Silva, Leandro Araujo
    Benevenuto, Fabricio
    PROCEEDINGS OF THE 28TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT'17), 2017, : 85 - 94
  • [23] HATE SPEECH ON SOCIAL MEDIA - CROATIAN EXPERIENCE
    Tomisa, Mario
    Milkovic, Marin
    Vusic, Damir
    Pavicic, Ivona
    ECONOMIC AND SOCIAL DEVELOPMENT (ESD 2019), 2019, : 256 - 263
  • [24] Spread of Hate Speech in Online Social Media
    Mathew, Binny
    Dutt, Ritam
    Goyal, Pawan
    Mukherjee, Animesh
    PROCEEDINGS OF THE 11TH ACM CONFERENCE ON WEB SCIENCE (WEBSCI'19), 2019, : 173 - 182
  • [25] Hate Speech Classification in Indonesian Language Tweets Convolutional Neural Network
    Taradhita, Dewa Ayu Nadia
    Putra, I. Ketut Gede Darma
    JOURNAL OF ICT RESEARCH AND APPLICATIONS, 2021, 14 (03) : 225 - 239
  • [26] Offensive Language and Hate Speech Detection for Danish
    Sigurbergsson, Gudbjartur Ingi
    Derczynski, Leon
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3498 - 3508
  • [27] Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media
    Vashistha, Neeraj
    Zubiaga, Arkaitz
    INFORMATION, 2021, 12 (01) : 1 - 16
  • [28] Leveraging Transfer Learning for Hate Speech Detection in Portuguese Social Media Posts
    Ramos, Gil
    Batista, Fernando
    Ribeiro, Ricardo
    Fialho, Pedro
    Moro, Sergio
    Fonseca, Antonio
    Guerra, Rita
    Carvalho, Paula
    Marques, Catarina
    Silva, Claudia
    IEEE ACCESS, 2024, 12 : 101374 - 101389
  • [29] Hate Speech Detection in Indonesian Language on Instagram Comment Section Using Deep Neural Network Classification Method
    Perdana, Sakti Putra B. B.
    Irawan, Budhi
    Setianingsih, Casi
    2019 IEEE ASIA PACIFIC CONFERENCE ON WIRELESS AND MOBILE (APWIMOB), 2019, : 143 - 149
  • [30] Combating the challenges of social media hate speech in a polarized society A Twitter ego lexalytics approach
    Udanor, Collins
    Anyanwu, Chinatu C.
    DATA TECHNOLOGIES AND APPLICATIONS, 2019, 53 (04) : 501 - 527