COVID-HateBERT: a Pre-trained Language Model for COVID-19 related Hate Speech Detection

被引:9
作者
Li, Mingqi [1 ]
Liao, Song [1 ]
Okpala, Ebuka [1 ]
Tong, Max [1 ,4 ]
Costello, Matthew [2 ]
Cheng, Long [1 ]
Hu, Hongxin [3 ]
Luo, Feng [1 ]
机构
[1] Clemson Univ, Sch Comp, Clemson, SC 29631 USA
[2] Clemson Univ, Dept Sociol, Clemson, SC 29631 USA
[3] Univ Buffalo, Dept Comp Sci & Engn, Buffalo, NY USA
[4] Christ Church Episcopal Sch, Greenville, SC USA
来源
20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021) | 2021年
关键词
hate speech detection; language model; COVID-19; BERT;
D O I
10.1109/ICMLA52953.2021.00043
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the dramatic growth of hate speech on social media during the COVID-19 pandemic, there is an urgent need to detect various hate speech effectively. Existing methods only achieve high performance when the training and testing data come from the same data distribution. The models trained on the traditional hateful dataset cannot fit well on COVID-19 related dataset. Meanwhile, manually annotating the hate speech dataset for supervised learning is time-consuming. Here, we propose COVID-HateBERT, a pre-trained language model to detect hate speech on English Tweets to address this problem. We collect 200M English tweets based on COVID-19 related hateful keywords and hashtags. Then, we use a classifier to extract the 1.27M potential hateful tweets to re-train BERT-base. We evaluate our COVID-HateBERT on four benchmark datasets. The COVID-HateBERT achieves a 14.8%-23.8% higher macro average F1 score on traditional hate speech detection comparing to baseline methods and a 2.6%-6.73% higher macro average F1 score on COVID-19 related hate speech detection comparing to classifiers using BERT and BERTweet, which shows that COIVD-HateBERT can generalize well on different datasets.
引用
收藏
页码:233 / 238
页数:6
相关论文
共 37 条
  • [1] Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms
    Agrawal, Sweta
    Awekar, Amit
    [J]. ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 141 - 153
  • [2] Detection of Hate Speech in COVID-19-Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach
    Alshalan, Raghad
    Al-Khalifa, Hend
    Alsaeed, Duaa
    Al-Baity, Heyam
    Alshalan, Shahad
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (12)
  • [3] Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation
    Arango, Ayme
    Perez, Jorge
    Poblete, Barbara
    [J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 45 - 53
  • [4] Deep Learning for Hate Speech Detection in Tweets
    Badjatiya, Pinkesh
    Gupta, Shashank
    Gupta, Manish
    Varma, Vasudeva
    [J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
  • [5] Basile Valerio, 2019, P 13 INT WORKSH SEM, P54, DOI [10.18653, DOI 10.18653/V1/S19-2007]
  • [6] Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
  • [7] Caselli T., 2020, ARXIV PREPRINT ARXIV
  • [8] Caselli T, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P6193
  • [9] Cer D, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P169
  • [10] Conneau A., 2019, Unsupervised cross-lingual representation learning at scale