Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation

被引：90

作者：

Arango, Ayme ^{[1
]}

Perez, Jorge ^{[1
]}

Poblete, Barbara ^{[1
]}

机构：

[1] Univ Chile, IMFD, Dept Comp Sci, Santiago, Chile

来源：

PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19) | 2019年

关键词：

hate speech classification; experimental evaluation; social media; deep learning;

D O I：

10.1145/3331184.3331262

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Hate speech is an important problem that is seriously affecting the dynamics and usefulness of online social communities. Large scale social platforms are currently investing important resources into automatically detecting and classifying hateful content, without much success. On the other hand, the results reported by state-of-the-art systems indicate that supervised approaches achieve almost perfect performance but only within specific datasets. In this work, we analyze this apparent contradiction between existing literature and actual applications. We study closely the experimental methodology used in prior work and their generalizability to other datasets. Our findings evidence methodological issues, as well as an important dataset bias. As a consequence, performance claims of the current state-of-the-art have become significantly overestimated. The problems that we have found are mostly related to data overfitting and sampling issues. We discuss the implications for current research and re-conduct experiments to give a more accurate picture of the current state-of-the art methods.

引用

页码：45 / 53

页数：9

共 32 条

[1] Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms
Agrawal, Sweta
Awekar, Amit
[J]. ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 141 - 153
[2] [Anonymous], 2016, DEEP LEARNING
[3] [Anonymous], 2014, ARXIV
[4] [Anonymous], 2017, IEEE T AFFECTIVE COM
[5] Deep Learning for Hate Speech Detection in Tweets
Badjatiya, Pinkesh
Gupta, Shashank
Gupta, Manish
Varma, Vasudeva
[J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
[6] Basile Valerio, SHARED TASK MULTILIN
[7] Mean Birds: Detecting Aggression and Bullying on Twitter
Chatzakou, Despoina
Kourtellis, Nicolas
Blackburn, Jeremy
De Cristofaro, Emiliano
Stringhini, Gianluca
Vakali, Athena
[J]. PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, : 13 - 22
[8] Dadvar M., 2018, CORR
[9] Dadvar M, 2014, LECT NOTES COMPUT SC, V8436, P275, DOI 10.1007/978-3-319-06483-3_25
[10] Davidson T., 2017, ICWSM, P512

← 1 2 3 4 →