COVID-HateBERT: a Pre-trained Language Model for COVID-19 related Hate Speech Detection

被引：9

作者：

Li, Mingqi ^{[1
]}

Liao, Song ^{[1
]}

Okpala, Ebuka ^{[1
]}

Tong, Max ^{[1
,4
]}

Costello, Matthew ^{[2
]}

Cheng, Long ^{[1
]}

Hu, Hongxin ^{[3
]}

Luo, Feng ^{[1
]}

机构：

[1] Clemson Univ, Sch Comp, Clemson, SC 29631 USA

[2] Clemson Univ, Dept Sociol, Clemson, SC 29631 USA

[3] Univ Buffalo, Dept Comp Sci & Engn, Buffalo, NY USA

[4] Christ Church Episcopal Sch, Greenville, SC USA

来源：

20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021) | 2021年

关键词：

hate speech detection; language model; COVID-19; BERT;

D O I：

10.1109/ICMLA52953.2021.00043

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the dramatic growth of hate speech on social media during the COVID-19 pandemic, there is an urgent need to detect various hate speech effectively. Existing methods only achieve high performance when the training and testing data come from the same data distribution. The models trained on the traditional hateful dataset cannot fit well on COVID-19 related dataset. Meanwhile, manually annotating the hate speech dataset for supervised learning is time-consuming. Here, we propose COVID-HateBERT, a pre-trained language model to detect hate speech on English Tweets to address this problem. We collect 200M English tweets based on COVID-19 related hateful keywords and hashtags. Then, we use a classifier to extract the 1.27M potential hateful tweets to re-train BERT-base. We evaluate our COVID-HateBERT on four benchmark datasets. The COVID-HateBERT achieves a 14.8%-23.8% higher macro average F1 score on traditional hate speech detection comparing to baseline methods and a 2.6%-6.73% higher macro average F1 score on COVID-19 related hate speech detection comparing to classifiers using BERT and BERTweet, which shows that COIVD-HateBERT can generalize well on different datasets.

引用

页码：233 / 238

页数：6

共 37 条

[1] Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms
Agrawal, Sweta
Awekar, Amit
[J]. ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 141 - 153
[2] Detection of Hate Speech in COVID-19-Related Tweets in the Arab Region: Deep Learning and Topic Modeling Approach
Alshalan, Raghad
Al-Khalifa, Hend
Alsaeed, Duaa
Al-Baity, Heyam
Alshalan, Shahad
[J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (12)
[3] Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation
Arango, Ayme
Perez, Jorge
Poblete, Barbara
[J]. PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, : 45 - 53
[4] Deep Learning for Hate Speech Detection in Tweets
Badjatiya, Pinkesh
Gupta, Shashank
Gupta, Manish
Varma, Vasudeva
[J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
[5] Basile Valerio, 2019, P 13 INT WORKSH SEM, P54, DOI [10.18653, DOI 10.18653/V1/S19-2007]
[6] Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
[7] Caselli T., 2020, ARXIV PREPRINT ARXIV
[8] Caselli T, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P6193
[9] Cer D, 2018, CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, P169
[10] Conneau A., 2019, Unsupervised cross-lingual representation learning at scale

← 1 2 3 4 →