Deep learning-based approaches for abusive content detection and classification for multi-class online user-generated data

被引:2
|
作者
Kaur S. [1 ]
Singh S. [1 ]
Kaushal S. [1 ]
机构
[1] University Institute of Engineering and Technology, Panjab University, Chandigarh
来源
International Journal of Cognitive Computing in Engineering | 2024年 / 5卷
关键词
Abusive language; Deep learning models; Gated recurrent unit; Long short term memory; Offensive language; Offensive language categorization; Target identification;
D O I
10.1016/j.ijcce.2024.02.002
中图分类号
学科分类号
摘要
With the rapid growth of social media culture, the use of offensive or hateful language has surged, which necessitates the development of effective abusive language detection models for online platforms. This paper focuses on developing a multi-class classification model to identify different types of offensive language. The input data is taken in the form of labeled tweets and is classified into offensive language detection, offensive language categorization, and offensive language target identification. The data undergoes pre-processing, which removes NaN value and punctuation, as well as performs tokenization followed by the generation of a word cloud to assess data quality. Further, the tf-idf technique is used for the selection of features. In the case of classifiers, multiple deep learning techniques, namely, bidirectional gated recurrent unit, multi-dense long short-term memory, bidirectional long short-term memory, gated recurrent unit, and long short-term memory, are applied where it has been found that all the models, except long short-term memory, achieved a high accuracy of 99.9 % for offensive language target identification. Bidirectional LSTM and multi-dense LSTM obtained the lowest loss and RMSE values of 0.01 and 0.1, respectively. This research provides valuable insights and contributes to the development of effective abusive language detection methods to promote a safe and respectful online environment. The insights gained can aid platform administrators in efficiently moderating content and taking appropriate actions against offensive language. © 2024
引用
收藏
页码:104 / 122
页数:18
相关论文
共 50 条
  • [1] Abusive Content Detection in Online User-Generated Data: A survey
    Kaur, Simrat
    Singh, Sarbjeet
    Kaushal, Sakshi
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 274 - 281
  • [2] Deep learning-based image classification for online multi-coal and multi-class sorting
    Liu, Yang
    Zhang, Zelin
    Liu, Xiang
    Wang, Lei
    Xia, Xuhui
    COMPUTERS & GEOSCIENCES, 2021, 157
  • [3] Deep learning based sentiment classification on user-generated big data
    Kumar A.
    Jaiswal A.
    Jaiswal, Arunima (arunimajaiswal@gmail.com), 1600, Bentham Science Publishers (13): : 1047 - 1056
  • [4] Ensemble Deep Learning for Multilabel Binary Classification of User-Generated Content
    Haralabopoulos, Giannis
    Anagnostopoulos, Ioannis
    McAuley, Derek
    ALGORITHMS, 2020, 13 (04)
  • [5] Deep Learning-Based Multi-Class Classification of Breast Digital Pathology Images
    Mi, Weiming
    Li, Junjie
    Guo, Yucheng
    Ren, Xinyu
    Liang, Zhiyong
    Zhang, Tao
    Zou, Hao
    CANCER MANAGEMENT AND RESEARCH, 2021, 13 : 4605 - 4617
  • [6] A Semantics-based Approach to Disclosure Classification in User-Generated Online Content
    Akiti, Chandan
    Squicciarini, Anna
    Rajtmajer, Sarah
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
  • [7] On the Machine Learning-based Multi-class Classification of Microscopic Colitis
    Tara, Vivek
    Mitra, Dipankar
    Muduganti, Aditi
    Mali, Padmavathi
    Maiti, Srabana
    Dey, Shuvashis
    Gomes, Rahul
    2024 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY, EIT 2024, 2024, : 38 - 43
  • [8] An active learning-based SVM multi-class classification model
    Guo, Husheng
    Wang, Wenjian
    PATTERN RECOGNITION, 2015, 48 (05) : 1577 - 1597
  • [9] Deep Learning-Based Skin Lesion Multi-class Classification with Global Average Pooling Improvement
    Raghavendra, Paravatham V. S. P.
    Charitha, C.
    Begum, K. Ghousiya
    Prasath, V. B. S.
    JOURNAL OF DIGITAL IMAGING, 2023, 36 (05) : 2227 - 2248
  • [10] An Effective Deep Learning Based Multi-Class Classification of DoS and DDoS Attack Detection
    Silivery, Arun Kumar
    Rao, Kovvur Ram Mohan
    Kumar, L. K. Suresh
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (04) : 421 - 431