CyberBERT: BERT for cyberbullying identification BERT for cyberbullying identification

被引:58
作者
Paul, Sayanta [1 ]
Saha, Sriparna [1 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Bihta, Bihar, India
关键词
Cyberbullying; Language model; Deep learning; BERT;
D O I
10.1007/s00530-020-00710-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cyberbullying can be delineated as a purposive and recurrent act, which is aggressive in nature, done via different social media platforms such as Facebook, Twitter, Instagram, and others. A state-of-the-art pre-training language model, BERT (Bidirectional Encoder Representations from Transformers), has achieved remarkable results in many language understanding tasks. In this paper, we present a novel application of BERT for cyberbullying identification. A straightforward classification model using BERT is able to achieve the state-of-the-art results across three real-world corpora: Formspring (similar to 12k posts), Twitter (similar to 16k posts), and Wikipedia (similar to 100k posts). Experimental results demonstrate that our proposed model achieves significant improvements over existing works, in comparison with the slot-gated or attention-based deep neural network models.
引用
收藏
页码:1897 / 1904
页数:8
相关论文
共 21 条
[1]   Classification of Fake News by Fine-tuning Deep Bidirectional Transformers based Language Model [J].
Aggarwal, Akshay ;
Chauhan, Aniruddha ;
Kumar, Deepika ;
Mittal, Mamta ;
Verma, Sharad .
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2020, 7 (27) :1-12
[2]   Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms [J].
Agrawal, Sweta ;
Awekar, Amit .
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 :141-153
[3]   Deep Learning for Hate Speech Detection in Tweets [J].
Badjatiya, Pinkesh ;
Gupta, Shashank ;
Gupta, Manish ;
Varma, Vasudeva .
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, :759-760
[4]   Improving cyberbullying detection using Twitter users' psychological features and machine learning [J].
Balakrishnan, Vimala ;
Khan, Shahzaib ;
Arabnia, Hamid R. .
COMPUTERS & SECURITY, 2020, 90
[5]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923
[8]  
Dinakar K., 2011, P INT C WEBL SOC MED
[9]   Hate Speech Detection with Comment Embeddings [J].
Djuric, Nemanja ;
Zhou, Jing ;
Morris, Robin ;
Grbovic, Mihajlo ;
Radosavljevic, Vladan ;
Bhamidipati, Narayan .
WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, :29-30
[10]  
Hinton G., 2015, ARXIV150302531