Systematic Literature Review Of Hate Speech Detection With Text Mining

被引:7
作者
Rini [1 ]
Utami, Ema [1 ]
Hartanto, Anggit Dwi [2 ]
机构
[1] Univ Amikom Yogyakarta, Informat Engn, Yogyakarta, Indonesia
[2] Univ Amikom Yogyakarta, Fac Comp Sci, Yogyakarta, Indonesia
来源
PROCEEDINGS OF ICORIS 2020: 2020 THE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS) | 2020年
关键词
hate speech; classification; systematic literature review; text mining; TWITTER;
D O I
10.1109/ICORIS50180.2020.9320755
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Along with the increasing activity on social media, hate speech is getting out of control. Hate speech detection can be done by utilizing text mining technology. There have been many hate speech detection studies conducted. To identify and analyze research trends, data sources, methods and features used in hate speech detection, this systematic literature review was created. Until early 2020, the topics of hate speech were found, including hate speech against minorities, religion, women, the general election agenda, and politics. Sources of data that are widely used to be used as datasets come from twitter. Hate speech is not only classified into HS (hate speech) and Non-HS (non-hate speech) but can be further classified into racism, sexism, offensive, abusive, threats of violence and others. Of the 38 studies that meet inclusion and exclusion, there are 26 algorithms and 28 features that have been used to detect hate speech. However, these methods and features do not necessarily guarantee a good hate detection performance. Hate speech classification performance is also influenced by the dataset, the features chosen, the number of classes and mutually exclusive classes.
引用
收藏
页码:228 / 233
页数:6
相关论文
共 44 条
[1]  
Al-Makhadmeh Z., 2019, COMPUTING
[2]  
Alfina I, 2017, INT C ADV COMP SCI I, P233, DOI 10.1109/ICACSIS.2017.8355039
[3]   Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets [J].
Almatarneh, Sattam ;
Gamallo, Pablo ;
Ribadas Pena, Francisco J. ;
Alexeev, Alexey .
DIGITAL LIBRARIES AT THE CROSSROADS OF DIGITAL INFORMATION FOR THE FUTURE, ICADL 2019, 2019, 11853 :23-30
[4]   Hate Speech Detection on Indonesian Long Text Documents Using Machine Learning Approach [J].
Aulia, Nofa ;
Budi, Indra .
ICCAI '19 - PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE, 2019, :164-169
[5]   Automatic Classification of Abusive Language and Personal Attacks in Various Forms of Online Communication [J].
Bourgonje, Peter ;
Moreno-Schneider, Julian ;
Srivastava, Ankit ;
Rehm, Georg .
LANGUAGE TECHNOLOGIES FOR THE CHALLENGES OF THE DIGITAL AGE, GSCL 2017, 2018, 10713 :180-191
[6]  
Davidson T., 2017, P INT AAAI C WEB SOC, P512, DOI DOI 10.1609/ICWSM.V11I1.14955
[7]  
Dhillon Jasleen, 2019, 2019 International Conference on Signal Processing and Communication (ICSC), P41
[8]   Using Visual Text Mining to Support the Study Selection Activity in Systematic Literature Reviews [J].
Felizardo, Katia R. ;
Salleh, Norsaremah ;
Martins, Rafael M. ;
Mendes, Emilia ;
MacDonell, Stephen G. ;
Maldonado, Jose C. .
2011 FIFTH INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT (ESEM 2011), 2011, :77-86
[9]   All You Need is "Love": Evading Hate Speech Detection [J].
Grondahl, Tommi ;
Pajola, Luca ;
Juuti, Mika ;
Conti, Mauro ;
Asokan, N. .
AISEC'18: PROCEEDINGS OF THE 11TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, 2018, :2-12
[10]  
Hakiem M., 2019, KLASIFIKASI UJARAN K, V3, P2443