Hate Speech Detection in Twitter using Transformer Methods

被引:0
|
作者
Mutanga, Raymond T. [1 ]
Naicker, Nalindren [1 ]
Olugbara, Oludayo O. [1 ]
机构
[1] Durban Univ Technol, ICT & Soc Res Grp, Dept Informat Syst, ZA-4000 Durban, South Africa
关键词
Attention transformer; deep learning; neural network; recurrent network; sequence transduction;
D O I
10.14569/IJACSA.2020.0110972
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Social media networks such as Twitter are increasingly utilized to propagate hate speech while facilitating mass communication. Recent studies have highlighted a strong correlation between hate speech propagation and hate crimes such as xenophobic attacks. Due to the size of social media and the consequences of hate speech in society, it is essential to develop automated methods for hate speech detection in different social media platforms. Several studies have investigated the application of different machine learning algorithms for hate speech detection. However, the performance of these algorithms is generally hampered by inefficient sequence transduction. The Vanilla recurrent neural networks and recurrent neural networks with attention have been established as state-of-the-art methods for the assignments of sequence modeling and sequence transduction. Unfortunately, these methods suffer from intrinsic problems such as long-term dependency and lack of parallelization. In this study, we investigate a transformer-based method and tested it on a publicly available multiclass hate speech corpus containing 24783 labeled tweets. DistilBERT transformer method was compared against attention-based recurrent neural networks and other transformer baselines for hate speech detection in Twitter documents. The study results show that DistilBERT transformer outperformed the baseline algorithms while allowing parallelization.
引用
收藏
页码:614 / 620
页数:7
相关论文
共 50 条
  • [1] Twitter Hate Speech Detection using Machine Learning
    Janardhan, G.
    Saikiran, Bollu
    Reddy, Inugala Swanith
    Abhishek, Mogilicherla
    2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024, 2024, : 270 - 278
  • [2] Hate speech detection on Twitter using transfer learning
    Ali, Raza
    Farooq, Umar
    Arshad, Umair
    Shahzad, Waseem
    Beg, Mirza Omer
    COMPUTER SPEECH AND LANGUAGE, 2022, 74
  • [3] Automated Hate Speech Detection on Twitter
    Koushik, Garima
    Rajeswari, K.
    Muthusamy, Suresh Kannan
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [4] Levantine hate speech detection in twitter
    AbdelHamid, Medyan
    Jafar, Assef
    Rahal, Yasser
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [5] Levantine hate speech detection in twitter
    Medyan AbdelHamid
    Assef Jafar
    Yasser Rahal
    Social Network Analysis and Mining, 2022, 12
  • [6] Multilingual Hate Speech Detection Using Ensemble of Transformer Models
    Jahan, Md. Saroar
    Hassan, Fadi
    Aransa, Walid
    Bouchekif, Abdessalam
    CEUR Workshop Proceedings, 2023, 3681 : 588 - 597
  • [7] Hate speech detection on multilingual twitter using convolutional neural networks
    Elouali A.
    Elberrichi Z.
    Elouali N.
    Elouali, Aya (n.elouali@esi-sba.dz), 1600, International Information and Engineering Technology Association (34): : 81 - 88
  • [8] Automated Detection of Hate Speech towards Woman on Twitter
    Sahi, Havvanur
    Kilic, Yasemin
    Saglam, Rahime Belen
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 533 - 536
  • [9] Twitter Hate Speech Detection: A Systematic Review of Methods, Taxonomy Analysis, Challenges, and Opportunities
    Mansur, Zainab
    Omar, Nazlia
    Tiun, Sabrina
    IEEE ACCESS, 2023, 11 : 16226 - 16249
  • [10] NAIJAHATE: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
    Tonneau, Manuel
    de Castro, Pedro Vitor Quinta
    Lasri, Karim
    Farouq, Ibrahim
    Subramanian, Lakshminarayanan
    Orozco-Olvera, Victor
    Fraiberger, Samuel P.
    arXiv,