Unintended bias evaluation: An analysis of hate speech detection and gender bias mitigation on social media using ensemble learning

被引:14
作者
Nascimento, Francimaria R. S. [1 ]
Cavalcanti, George D. C. [1 ]
Da Costa-Abreu, Marjory [2 ]
机构
[1] Univ Fed Pernambuco UFPE, Ctr Informat CIn, Av Jornalista Anibal Fernandes S-N, Recife, PE, Brazil
[2] Sheffield Hallam Univ, Dept Comp, Sheffield, S Yorkshire, England
关键词
Hate speech detection; Ensemble learning; Gender bias; Multi-features; TWITTER;
D O I
10.1016/j.eswa.2022.117032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hate speech on online social media platforms is now at a level that has been considered a serious concern by governments, media outlets, and scientists, especially because it is easily spread, promoting harm to individuals and society, and made it virtually impossible to tackle with using just human analysis. Automatic approaches using machine learning and natural language processing are helpful for detection. For such applications, amongst several different approaches, it is essential to investigate the systems' robustness to deal with biases towards identity terms (gender, race, religion, for example). In this work, we analyse gender bias in different datasets and proposed a ensemble learning approach based on different feature spaces for hate speech detection with the aim that the model can learn from different abstractions of the problem, namely unintended bias evaluation metrics. We have used nine different feature spaces to train the pool of classifiers and evaluated our approach on a publicly available corpus, and our results demonstrate its effectiveness compared to state-of-the-art solutions.
引用
收藏
页数:14
相关论文
共 71 条
[1]   Combating hate speech using an adaptive ensemble learning model with a case study on COVID-19 [J].
Agarwal, Shivang ;
Chowdary, C. Ravindranath .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
[2]   Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic Text [J].
Al-Azani, Sadam ;
El-Alfy, El-Sayed M. .
8TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT-2017) AND THE 7TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT 2017), 2017, 109 :359-366
[3]   Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach [J].
Al-Makhadmeh, Zafer ;
Tolba, Amr .
COMPUTING, 2020, 102 (02) :501-522
[4]   Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets [J].
Almatarneh, Sattam ;
Gamallo, Pablo ;
Ribadas Pena, Francisco J. ;
Alexeev, Alexey .
DIGITAL LIBRARIES AT THE CROSSROADS OF DIGITAL INFORMATION FOR THE FUTURE, ICADL 2019, 2019, 11853 :23-30
[5]  
[Anonymous], 2013, P WORKSHOP INT C LEA
[6]   A survey of Twitter research: Data model, graph structure, sentiment analysis and attacks? [J].
Antonakaki, Despoina ;
Fragopoulou, Paraskevi ;
Ioannidis, Sotiris .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 164
[7]   Stereotypical Bias Removal for Hate Speech Detection Task using Knowledge-based Generalizations [J].
Badjatiya, Pinkesh ;
Gupta, Manish ;
Varma, Vasudeva .
WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, :49-59
[8]  
Basile Valerio, 2019, P 13 INT WORKSH SEM, P54, DOI 10.18653/v1/S19-2007
[9]  
Bojanowski P., 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
[10]  
Bolukbasi T, 2016, ADV NEUR IN, V29