Hate speech and abusive language detection in Indonesian social media: Progress and challenges

被引:6
|
作者
Ibrohim, Muhammad Okky [1 ]
Budi, Indra [2 ]
机构
[1] Univ Torino, Dipartimento Informat, Turin, Italy
[2] Univ Indonesia, Fac Comp Sci, Depok, Indonesia
关键词
Hate speech; Abusive language; Indonesian social media;
D O I
10.1016/j.heliyon.2023.e18647
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nowadays Hate Speech and Abusive Language (HSAL) have spread extensively over social media. The easy use of social media allows people to abuse the media to spread HSAL. Hate speech and abusive language in social media must be detected because they can trigger conflict among citizens. Not only in social media, but HSAL also often trigger conflict in real life. In recent years, many scholars have researched HSAL detection in various languages and media. However, there are still many tasks on HSAL detection that need to be done to develop a better HSAL detection system. This paper discusses a summary of Indonesian HSAL detection research, conducted by utilizing the Kitchenham systematic literature review method. Based on our summary, we found that most Indonesian HSAL research still uses the classic machine-learning approach with classic text representation features that experimented on the Twitter text dataset. We also found several challenges and tasks that need to be addressed to build a better HSAL detection system in Indonesian social media that can detect the hate speech target, category, and levels; and the hate speech buzzer, thread starter, and fake account spreader.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Separating Hate Speech from Abusive Language on Indonesian Twitter
    Ibrahim, Muhammad Amien
    Sagala, Noviyanti Tri Maretta
    Arifin, Samsul
    Nariswari, Rinda
    Murnaka, Nerru Pranuta
    Prasetyo, Puguh Wahyu
    2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 187 - 191
  • [2] Identification of Hate Speech and Abusive Language on Indonesian Twitter Using theWord2vec, Part of Speech and Emoji Features
    Ibrohim, Muhammad Okky
    Setiadi, Muhammad Akbar
    Budi, Indra
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION SCIENCE AND SYSTEM, AISS 2019, 2019,
  • [3] Multi-label Classification for Hate Speech and Abusive Language in Indonesian-Local Languages
    Asti, Ajeng Dwi
    Budi, Indra
    Ibrohim, Muhammad Okky
    13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 325 - 330
  • [4] Moralized language predicts hate speech on social media
    Solovev, Kirill
    Proellochs, Nicolas
    PNAS NEXUS, 2023, 2 (01):
  • [5] Challenges of Hate Speech Detection in Social Media: Data Scarcity, and Leveraging External Resources
    Kovács G.
    Alonso P.
    Saini R.
    SN Computer Science, 2021, 2 (2)
  • [6] Transfer learning for hate speech detection in social media
    Yuan, Lanqin
    Wang, Tianyu
    Ferraro, Gabriela
    Suominen, Hanna
    Rizoiu, Marian-Andrei
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2023, 6 (02): : 1081 - 1101
  • [7] Multimodal Hate Speech Detection in Greek Social Media
    Perifanos, Konstantinos
    Goutsos, Dionysis
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2021, 5 (07)
  • [8] Hate and offensive speech detection on Arabic social media
    Alsafari S.
    Sadaoui S.
    Mouhoub M.
    Online Social Networks and Media, 2020, 19
  • [9] Transfer learning for hate speech detection in social media
    Lanqin Yuan
    Tianyu Wang
    Gabriela Ferraro
    Hanna Suominen
    Marian-Andrei Rizoiu
    Journal of Computational Social Science, 2023, 6 : 1081 - 1101
  • [10] Hate Speech on Social Media
    Guiora, Amos
    Park, Elizabeth A.
    PHILOSOPHIA, 2017, 45 (03) : 957 - 971