A deep dive into automated sexism detection using fine-tuned deep learning and large language models

被引：1

作者：

Vetagiri, Advaitha ^{[1
]}

Pakray, Partha ^{[1
]}

Das, Amitava ^{[2
,3
]}

机构：

[1] Natl Inst Technol Silchar, Comp Sci & Engn, Silchar 7 88010, Assam, India

[2] UofSC, Artificial Intelligence Inst, Columbia, SC USA

[3] Wipro AI Lab, Bangalore, Karnataka, India

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 145卷

关键词：

Online sexism; Sexism classification; MultiHate dataset; Machine learning; Deep learning; Convolutional Neural Networks-Bidirectional; Long Short-Term Memory; Generative Pre-trained Transformer 2; HATE SPEECH DETECTION; ONLINE;

D O I：

10.1016/j.engappai.2025.110167

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The issue of sexism in online content has recently been a significant concern. With the increasing number of online interactions and the rise of social media platforms, the need for automated techniques to identify and classify sexism has become more critical than ever. This paper addresses this problem by fine-tuning deep-learning models for sexism classification using "MultiHate". It is a comprehensive dataset created by curating ten different datasets on sexism. The dataset consists of 1.76 M English texts labelled as sexist and not sexist, then fine-tuned two deep learning models, Convolutional Neural Networks-Bidirectional Long Short-Term Memory and Generative Pre-trained Transformer 2, which accurately detect and classify sexism. A comparative analysis has been conducted on several machine learning and deep learning models using the MultiHate dataset. Investigation reveals that the Generative Pre-trained Transformer 2 model outperforms other models with an accuracy of 92%, while the Convolutional Neural Networks-Bidirectional Long Short-Term Memory model achieved an accuracy of 90% using precision, recall, and F1 scores as performance metrics. The models' performances are promising, indicating that automated techniques can be employed to classify sexist content effectively. A comprehensive error analysis of the models' performance has been presented, highlighting their limitations and challenges. The computational time required for training and testing the models is a significant challenge, especially for larger datasets.

引用

页数：17

共 50 条

[1] Fine-Grained Multi-label Sexism Classification Using a Semi-Supervised Multi-level Neural Approach [J].

Abburi, Harika ;

Parikh, Pulkit ;

Chhaya, Niyati ;

Varma, Vasudeva .

DATA SCIENCE AND ENGINEERING, 2021, 6 (04) :359-379

[2]

2023, Arxiv, DOI arXiv:2303.08774

[3]

Ahuir Vicent, 2022, IBERLEF SEPLN CORUNA, P1107

[4]

Alfina I, 2017, INT C ADV COMP SCI I, P233, DOI 10.1109/ICACSIS.2017.8355039

[5] Deep Learning for Hate Speech Detection in Tweets [J].

Badjatiya, Pinkesh ;

Gupta, Shashank ;

Gupta, Manish ;

Varma, Vasudeva .

WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, :759-760

[6]

Barnwal SK, 2022, PROCEEDINGS OF THE 16TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2022, P733

[7] Benevolent and hostile sexism in a shifting global context [J].

Barreto, Manuela ;

Doyle, David Matthew .

NATURE REVIEWS PSYCHOLOGY, 2023, 2 (02) :98-111

[8] Nuanced Metrics for Measuring Unintended Bias with Real Data for Text Classification [J].

Borkan, Daniel ;

Dixon, Lucas ;

Sorensen, Jeffrey ;

Thain, Nithum ;

Vasserman, Lucy .

COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, :491-500

[9] Development of a scale measuring online sexual harassment: Examining gender differences and the emotional impact of sexual harassment victimization online [J].

Buchanan, Niall ;

Mahoney, Adam .

LEGAL AND CRIMINOLOGICAL PSYCHOLOGY, 2022, 27 (01) :63-81

[10] A literature survey on multimodal and multilingual automatic hate speech identification [J].

Chhabra, Anusha ;

Vishwakarma, Dinesh Kumar .

MULTIMEDIA SYSTEMS, 2023, 29 (03) :1203-1230

← 1 2 3 4 5 →