Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation

被引:90
作者
Arango, Ayme [1 ]
Perez, Jorge [1 ]
Poblete, Barbara [1 ]
机构
[1] Univ Chile, IMFD, Dept Comp Sci, Santiago, Chile
来源
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19) | 2019年
关键词
hate speech classification; experimental evaluation; social media; deep learning;
D O I
10.1145/3331184.3331262
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hate speech is an important problem that is seriously affecting the dynamics and usefulness of online social communities. Large scale social platforms are currently investing important resources into automatically detecting and classifying hateful content, without much success. On the other hand, the results reported by state-of-the-art systems indicate that supervised approaches achieve almost perfect performance but only within specific datasets. In this work, we analyze this apparent contradiction between existing literature and actual applications. We study closely the experimental methodology used in prior work and their generalizability to other datasets. Our findings evidence methodological issues, as well as an important dataset bias. As a consequence, performance claims of the current state-of-the-art have become significantly overestimated. The problems that we have found are mostly related to data overfitting and sampling issues. We discuss the implications for current research and re-conduct experiments to give a more accurate picture of the current state-of-the art methods.
引用
收藏
页码:45 / 53
页数:9
相关论文
共 32 条
  • [1] Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms
    Agrawal, Sweta
    Awekar, Amit
    [J]. ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 141 - 153
  • [2] [Anonymous], 2016, DEEP LEARNING
  • [3] [Anonymous], 2014, ARXIV
  • [4] [Anonymous], 2017, IEEE T AFFECTIVE COM
  • [5] Deep Learning for Hate Speech Detection in Tweets
    Badjatiya, Pinkesh
    Gupta, Shashank
    Gupta, Manish
    Varma, Vasudeva
    [J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
  • [6] Basile Valerio, SHARED TASK MULTILIN
  • [7] Mean Birds: Detecting Aggression and Bullying on Twitter
    Chatzakou, Despoina
    Kourtellis, Nicolas
    Blackburn, Jeremy
    De Cristofaro, Emiliano
    Stringhini, Gianluca
    Vakali, Athena
    [J]. PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, : 13 - 22
  • [8] Dadvar M., 2018, CORR
  • [9] Dadvar M, 2014, LECT NOTES COMPUT SC, V8436, P275, DOI 10.1007/978-3-319-06483-3_25
  • [10] Davidson T., 2017, ICWSM, P512