Hateful people or hateful bots? Detection and characterization of bots spreading religious hatred in Arabic social media

被引:15
作者
Albadi N. [1 ,2 ]
Kurdi M. [2 ,3 ]
Mishra S. [2 ]
机构
[1] Taibah University, Department of Computer Science, Medina
[2] University of Colorado Boulder, Department of Computer Science, 1111 Engineering Dr, Boulder, 80309, CO
[3] Taif University, Department of Computer Science, Taif
来源
Proceedings of the ACM on Human-Computer Interaction | 2019年 / 3卷 / CSCW期
关键词
Arabic bots; Arabic NLP; Detection; Hate speech; Machine learning; Twitter;
D O I
10.1145/3359163
中图分类号
学科分类号
摘要
Arabic Twitter space is crawling with bots that fuel political feuds, spread misinformation, and proliferate sectarian rhetoric. While efforts have long existed to analyze and detect English bots, Arabic bot detection and characterization remains largely understudied. In this work, we contribute new insights into the role of bots in spreading religious hatred on Arabic Twitter and introduce a novel regression model that can accurately identify Arabic language bots. Our assessment shows that existing tools that are highly accurate in detecting English bots don’t perform as well on Arabic bots. We identify the possible reasons for this poor performance, perform a thorough analysis of linguistic, content, behavioral and network features, and report on the most informative features that distinguish Arabic bots from humans as well as the differences between Arabic and English bots. Our results mark an important step toward understanding the behavior of malicious bots on Arabic Twitter and pave the way for a more effective Arabic bot detection tools. © 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.
引用
收藏
相关论文
共 60 条
[21]  
Djuric N., Zhou J., Morris R., Grbovic M., Radosavljevic V., Bhamidipati N., Hate speech detection with comment embeddings, Proceedings of the 24th International Conference on World Wide Web, pp. 29-30, (2015)
[22]  
El-Mawass N., Alaboodi S., Detecting Arabic spammers and content polluters on Twitter, 2016 Sixth International Conference on Digital Information Processing and Communications (ICDIPC), pp. 53-58, (2016)
[23]  
Ernala S.K., Rizvi A.F., Birnbaum M.L., Kane J.M., De Choudhury M., Linguistic markers indicating therapeutic outcomes of social media disclosures of schizophrenia, Proceedings of the ACM on Human-Computer Interaction, 1, (2017)
[24]  
Farghaly A., Shaalan K., Arabic natural language processing: Challenges and solutions, ACM Transactions on Asian Language Information Processing (TALIP), 8, 4, (2009)
[25]  
Ferrara E., Varol O., Davis C., Menczer F., Flammini A., The rise of social bots, Commun. ACM, 59, 7, pp. 96-104, (2016)
[26]  
Golberg G., When It Comes to Dealing with Fake/Bot Accounts, Twitter Is (Still) Failing, (2018)
[27]  
Hall A., Terveen L., Halfaker A., Bot detection in wikidata using behavioral and other informal cues, Proceedings of the ACM on Human-Computer Interaction, 2, (2018)
[28]  
Kendall M.G., A new measure of rank correlation, Biometrika, 30, 1-2, pp. 81-93, (1938)
[29]  
Long J.V.K., Sutton S., Brooker P., Feltwell T., Kirman B., Barnett J., Lawson S., Could you define that in bot terms?: Requesting, creating and using bots on reddit, Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, pp. 3488-3500, (2017)
[30]  
Botcheck.Me: Detect & Track Twitter Bots, (2017)