Systematic Literature Review Of Hate Speech Detection With Text Mining

被引:7
作者
Rini [1 ]
Utami, Ema [1 ]
Hartanto, Anggit Dwi [2 ]
机构
[1] Univ Amikom Yogyakarta, Informat Engn, Yogyakarta, Indonesia
[2] Univ Amikom Yogyakarta, Fac Comp Sci, Yogyakarta, Indonesia
来源
PROCEEDINGS OF ICORIS 2020: 2020 THE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEM (ICORIS) | 2020年
关键词
hate speech; classification; systematic literature review; text mining; TWITTER;
D O I
10.1109/ICORIS50180.2020.9320755
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Along with the increasing activity on social media, hate speech is getting out of control. Hate speech detection can be done by utilizing text mining technology. There have been many hate speech detection studies conducted. To identify and analyze research trends, data sources, methods and features used in hate speech detection, this systematic literature review was created. Until early 2020, the topics of hate speech were found, including hate speech against minorities, religion, women, the general election agenda, and politics. Sources of data that are widely used to be used as datasets come from twitter. Hate speech is not only classified into HS (hate speech) and Non-HS (non-hate speech) but can be further classified into racism, sexism, offensive, abusive, threats of violence and others. Of the 38 studies that meet inclusion and exclusion, there are 26 algorithms and 28 features that have been used to detect hate speech. However, these methods and features do not necessarily guarantee a good hate detection performance. Hate speech classification performance is also influenced by the dataset, the features chosen, the number of classes and mutually exclusive classes.
引用
收藏
页码:228 / 233
页数:6
相关论文
共 44 条
[21]   Abusive Language Detection in Online User Content [J].
Nobata, Chikashi ;
Tetreault, Joel ;
Thomas, Achint ;
Mehdad, Yashar ;
Chang, Yi .
PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16), 2016, :145-153
[22]   Evaluating Machine Learning Techniques for Detecting Offensive and Hate Speech in South African Tweets [J].
Oriola, Oluwafemi ;
Kotze, Eduan .
IEEE ACCESS, 2020, 8 :21496-21509
[23]   Effective hate-speech detection in Twitter data using recurrent neural networks [J].
Pitsilis, Georgios K. ;
Ramampiaro, Heri ;
Langseth, Helge .
APPLIED INTELLIGENCE, 2018, 48 (12) :4730-4742
[24]  
Pratiwi N.I., 2019, P 2019 2 INT C DATA, P128, DOI DOI 10.1145/3352411.3352432
[25]  
Pratiwi NI, 2018, INT C ADV COMP SCI I, P447, DOI 10.1109/ICACSIS.2018.8618182
[26]  
Putra A. K. B. A., 2018, J PENGEMB TEKNOL INF, V2
[27]  
Ramdhani A., 2019, PROS SEMIN NAS TEKNO, V3, P42
[28]   Augment to Prevent: Short-Text Data Augmentation in Deep Learning for Hate-Speech Classification [J].
Rizos, Georgios ;
Hemker, Konstantin ;
Schuller, Bjoern .
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, :991-1000
[29]   Hate Speech Detection on Twitter: Feature Engineering v.s. Feature Selection [J].
Robinson, David ;
Zhang, Ziqi ;
Tepper, Jonathan .
SEMANTIC WEB: ESWC 2018 SATELLITE EVENTS, 2018, 11155 :46-49
[30]  
Ruwandika NDT, 2018, INT CONF ADV ICT, P273, DOI 10.1109/ICTER.2018.8615517