Unintended bias evaluation: An analysis of hate speech detection and gender bias mitigation on social media using ensemble learning

被引：14

作者：

Nascimento, Francimaria R. S. ^{[1
]}

Cavalcanti, George D. C. ^{[1
]}

Da Costa-Abreu, Marjory ^{[2
]}

机构：

[1] Univ Fed Pernambuco UFPE, Ctr Informat CIn, Av Jornalista Anibal Fernandes S-N, Recife, PE, Brazil

[2] Sheffield Hallam Univ, Dept Comp, Sheffield, S Yorkshire, England

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2022年 / 201卷

关键词：

Hate speech detection; Ensemble learning; Gender bias; Multi-features; TWITTER;

D O I：

10.1016/j.eswa.2022.117032

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hate speech on online social media platforms is now at a level that has been considered a serious concern by governments, media outlets, and scientists, especially because it is easily spread, promoting harm to individuals and society, and made it virtually impossible to tackle with using just human analysis. Automatic approaches using machine learning and natural language processing are helpful for detection. For such applications, amongst several different approaches, it is essential to investigate the systems' robustness to deal with biases towards identity terms (gender, race, religion, for example). In this work, we analyse gender bias in different datasets and proposed a ensemble learning approach based on different feature spaces for hate speech detection with the aim that the model can learn from different abstractions of the problem, namely unintended bias evaluation metrics. We have used nine different feature spaces to train the pool of classifiers and evaluated our approach on a publicly available corpus, and our results demonstrate its effectiveness compared to state-of-the-art solutions.

引用

页数：14

共 71 条

[1] Combating hate speech using an adaptive ensemble learning model with a case study on COVID-19 [J].

Agarwal, Shivang ;

Chowdary, C. Ravindranath .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185

[2] Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic Text [J].

Al-Azani, Sadam ;

El-Alfy, El-Sayed M. .

8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 :359-366

[3] Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach [J].

Al-Makhadmeh, Zafer ;

Tolba, Amr .

COMPUTING, 2020, 102 (02) :501-522

[4] Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets [J].

Almatarneh, Sattam ;

Gamallo, Pablo ;

Ribadas Pena, Francisco J. ;

Alexeev, Alexey .

DIGITAL LIBRARIES AT THE CROSSROADS OF DIGITAL INFORMATION FOR THE FUTURE, ICADL 2019, 2019, 11853 :23-30

[5]

[Anonymous], 2013, P WORKSHOP INT C LEA

[6] A survey of Twitter research: Data model, graph structure, sentiment analysis and attacks? [J].

Antonakaki, Despoina ;

Fragopoulou, Paraskevi ;

Ioannidis, Sotiris .

EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164

[7] Stereotypical Bias Removal for Hate Speech Detection Task using Knowledge-based Generalizations [J].

Badjatiya, Pinkesh ;

Gupta, Manish ;

Varma, Vasudeva .

WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :49-59

[8]

Basile Valerio, 2019, P 13 INT WORKSH SEM, P54, DOI 10.18653/v1/S19-2007

[9]

Bojanowski P., 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]

[10]

Bolukbasi T, 2016, ADV NEUR IN, V29

← 1 2 3 4 5 6 7 8 →