Multimodal Hate Speech Detection in Memes Using Contrastive Language-Image Pre-Training

被引:4
作者
Arya, Greeshma [1 ]
Hasan, Mohammad Kamrul [2 ]
Bagwari, Ashish [3 ]
Safie, Nurhizam [2 ]
Islam, Shayla [4 ]
Ahmed, Fatima Rayan Awad [5 ]
De, Aaishani [1 ]
Khan, Muhammad Attique [6 ,7 ]
Ghazal, Taher M. [8 ,9 ,10 ]
机构
[1] Indira Gandhi Delhi Tech Univ Women, Dept Elect & Commun Engn, New Delhi 110006, India
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Selangor, Malaysia
[3] Uttarakhand Tech Univ, Dept Elect & Commun Engn, Dehra Dun 248007, India
[4] UCSI Univ Malaysia, Inst Comp Sci & Digital Innovat, Kuala Lumpur 56000, Malaysia
[5] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Comp Sci Dept, Al Kharj 16273, Saudi Arabia
[6] HITEC Univ, Dept Comp Sci, Taxila 47080, Pakistan
[7] Lebanese Amer Univ, Dept CS & Math, Beirut 11022801, Lebanon
[8] Khalifa Univ, Ctr Cyber Phys Syst, Comp Sci Dept, Abu Dhabi, U Arab Emirates
[9] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Cyber Secur, Bangi 43600, Selangor, Malaysia
[10] Appl Sci Private Univ, Appl Sci Res Ctr, Amman 11937, Jordan
关键词
CLIP; facebook hateful meme dataset; multimodal; contrastive learning; zero-shot prediction; InfoNCE contrastive loss; prompt engineering; cosine similarity matrix;
D O I
10.1109/ACCESS.2024.3361322
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In contemporary society, the proliferation of online hateful messages has emerged as a pressing concern, inflicting deleterious consequences on both societal fabric and individual well-being. The automatic detection of such malevolent content online using models designed to recognize it, holds promise in mitigating its harmful impact. However, the advent of "Hateful Memes" poses fresh challenges to the detection paradigm, particularly within the realm of deep learning models. These memes, constituting of a textual element associated with an image are individually innocuous but their combination causes a detrimental effect. Consequently, entities responsible for disseminating information via web browsers are compelled to institute mechanisms that regulate and automatically filter out such injurious content. Effectively identifying hateful memes demands algorithms and models endowed with robust vision and language fusion capabilities, capable of reasoning across diverse modalities. This research introduces a novel approach by leveraging the multimodal Contrastive Language-Image Pre-Training (CLIP) model, fine-tuned through the incorporation of prompt engineering. This innovative methodology achieves a commendable accuracy of 87.42%. Comprehensive metrics such as loss, AUROC, and f1 score are also meticulously computed, corroborating the efficacy of the proposed strategy. Our findings suggest that this approach presents an efficient means to regulate the dissemination of hate speech in the form of viral meme content across social networking platforms, thereby contributing to a safer online environment.
引用
收藏
页码:22359 / 22375
页数:17
相关论文
共 34 条
[1]   Predicting the Future Popularity of Academic Publications Using Deep Learning by Considering It as Temporal Citation Networks [J].
Abbas, Khushnood ;
Hasan, Mohammad Kamrul ;
Abbasi, Alireza ;
Mokhtar, Umi Asma ;
Khan, Asif ;
Abdullah, Siti Norul Huda Sheikh ;
Dong, Shi ;
Islam, Shayla ;
Alboaneen, Dabiah ;
Ahmed, Fatima Rayan Awad .
IEEE ACCESS, 2023, 11 :83052-83068
[2]   Co-Evolving Popularity Prediction in Temporal Bipartite Networks: A Heuristics Based Model [J].
Abbas, Khushnood ;
Hasan, Mohammad Kamrul ;
Abbasi, Alireza ;
Dong, Shi ;
Ghazal, Taher M. ;
Abdullah, Siti Norul Huda Sheikh ;
Khan, Asif ;
Alboaneen, Dabiah ;
Ahmed, Fatima Rayan Awad ;
Ahmed, Thowiba E. ;
Islam, Shayla .
IEEE ACCESS, 2023, 11 :37546-37559
[3]   Arabic Text Detection Using Rough Set Theory: Designing a Novel Approach [J].
Ahmed, Amjed Abbas ;
Hasan, Mohammad Kamrul ;
Jaber, Mustafa Musa ;
Al-Ghuribi, Sumaia Mohammed ;
Abd, Dhafar Hamed ;
Khan, Wasiq ;
Sadiq, Ahmed Tareq ;
Hussain, Abir .
IEEE ACCESS, 2023, 11 :68428-68438
[4]  
Ahmed Md. R., 2021, INT C COMPUTCOMMUN G, P1
[5]   Hateful Memes Classification using Machine Learning [J].
Badour, Jafar ;
Brown, Joseph Alexander .
2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
[6]  
Bhandari A, 2023, Int. J. Comput. Sci. Eng., V10, P9
[7]   A Deep Learning Approach to Online Social Network Account Compromisation [J].
Boahen, Edward Kwadwo ;
Bouya-Moko, Brunel Elvire ;
Qamar, Faizan ;
Wang, Changda .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (06) :3204-3216
[8]  
Cao R, 2023, Arxiv, DOI arXiv:2302.04156
[9]  
Chen YS., 2022, PLOS ONE, V17
[10]   Multimodal detection of hateful memes by applying a vision-language pre-training model [J].
Chen, Yuyang ;
Pan, Feng .
PLOS ONE, 2022, 17 (09)