Deep learning-based approaches for abusive content detection and classification for multi-class online user-generated data

被引：2

作者：

Kaur S. ^{[1
]}

Singh S. ^{[1
]}

Kaushal S. ^{[1
]}

机构：

[1] University Institute of Engineering and Technology, Panjab University, Chandigarh

来源：

International Journal of Cognitive Computing in Engineering | 2024年 / 5卷

关键词：

Abusive language; Deep learning models; Gated recurrent unit; Long short term memory; Offensive language; Offensive language categorization; Target identification;

D O I：

10.1016/j.ijcce.2024.02.002

中图分类号：

学科分类号：

摘要：

With the rapid growth of social media culture, the use of offensive or hateful language has surged, which necessitates the development of effective abusive language detection models for online platforms. This paper focuses on developing a multi-class classification model to identify different types of offensive language. The input data is taken in the form of labeled tweets and is classified into offensive language detection, offensive language categorization, and offensive language target identification. The data undergoes pre-processing, which removes NaN value and punctuation, as well as performs tokenization followed by the generation of a word cloud to assess data quality. Further, the tf-idf technique is used for the selection of features. In the case of classifiers, multiple deep learning techniques, namely, bidirectional gated recurrent unit, multi-dense long short-term memory, bidirectional long short-term memory, gated recurrent unit, and long short-term memory, are applied where it has been found that all the models, except long short-term memory, achieved a high accuracy of 99.9 % for offensive language target identification. Bidirectional LSTM and multi-dense LSTM obtained the lowest loss and RMSE values of 0.01 and 0.1, respectively. This research provides valuable insights and contributes to the development of effective abusive language detection methods to promote a safe and respectful online environment. The insights gained can aid platform administrators in efficiently moderating content and taking appropriate actions against offensive language. © 2024

引用

页码：104 / 122

页数：18

共 50 条

[1] Abusive Content Detection in Online User-Generated Data: A survey
Kaur, Simrat
Singh, Sarbjeet
Kaushal, Sakshi
AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 274 - 281
[2] Deep learning-based image classification for online multi-coal and multi-class sorting
Liu, Yang
Zhang, Zelin
Liu, Xiang
Wang, Lei
Xia, Xuhui
COMPUTERS & GEOSCIENCES, 2021, 157
[3] Deep learning based sentiment classification on user-generated big data
Kumar A.
Jaiswal A.
Jaiswal, Arunima (arunimajaiswal@gmail.com), 1600, Bentham Science Publishers (13): : 1047 - 1056
[4] Ensemble Deep Learning for Multilabel Binary Classification of User-Generated Content
Haralabopoulos, Giannis
Anagnostopoulos, Ioannis
McAuley, Derek
ALGORITHMS, 2020, 13 (04)
[5] Deep Learning-Based Multi-Class Classification of Breast Digital Pathology Images
Mi, Weiming
Li, Junjie
Guo, Yucheng
Ren, Xinyu
Liang, Zhiyong
Zhang, Tao
Zou, Hao
CANCER MANAGEMENT AND RESEARCH, 2021, 13 : 4605 - 4617
[6] A Semantics-based Approach to Disclosure Classification in User-Generated Online Content
Akiti, Chandan
Squicciarini, Anna
Rajtmajer, Sarah
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020,
[7] On the Machine Learning-based Multi-class Classification of Microscopic Colitis
Tara, Vivek
Mitra, Dipankar
Muduganti, Aditi
Mali, Padmavathi
Maiti, Srabana
Dey, Shuvashis
Gomes, Rahul
2024 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY, EIT 2024, 2024, : 38 - 43
[8] An active learning-based SVM multi-class classification model
Guo, Husheng
Wang, Wenjian
PATTERN RECOGNITION, 2015, 48 (05) : 1577 - 1597
[9] Deep Learning-Based Skin Lesion Multi-class Classification with Global Average Pooling Improvement
Raghavendra, Paravatham V. S. P.
Charitha, C.
Begum, K. Ghousiya
Prasath, V. B. S.
JOURNAL OF DIGITAL IMAGING, 2023, 36 (05) : 2227 - 2248
[10] An Effective Deep Learning Based Multi-Class Classification of DoS and DDoS Attack Detection
Silivery, Arun Kumar
Rao, Kovvur Ram Mohan
Kumar, L. K. Suresh
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (04) : 421 - 431

← 1 2 3 4 5 →