Categorizing Crises From Social Media Feeds via Multimodal Channel Attention

被引:5
作者
Rezk, Mariham [1 ,2 ]
Elmadany, Noureldin [1 ,2 ]
Hamad, Radwa K. [1 ]
Badran, Ehab F. [1 ]
机构
[1] Arab Acad Sci, Dept Elect & Commun Engn Technol & Maritime Transp, Alexandria 21937, Egypt
[2] Arab Acad Sci Technol & Maritime Transport, Intelligent Syst Lab, Alexandria 21937, Egypt
关键词
Multimodal deep learning; social media; natural disasters; crisis response; attention; fusion; NATURAL DISASTERS; EVENT DETECTION;
D O I
10.1109/ACCESS.2023.3294474
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the era of advanced computer vision and natural language processing, the use of social media as a source of information has become even more valuable in directing aid and rescuing victims. Consequently, millions of texts and images can be processed in real-time, allowing emergency responders to efficiently assess evolving crises and appropriately allocate resources. The majority of the previous detection studies are text-only or image-only based, overlooking the potential benefits of integrating both modalities. In this paper, we propose Multimodal Channel Attention (MCA) block, which employs an adaptive attention mechanism, learning to assign varying importance to each modality. We then propose a novel Deep Multimodal Crisis Categorization (DMCC) framework, which employs a two-level fusion strategy for better integration of textual and visual information. The DMCC framework consists of feature-level fusion, which is accomplished through the MCA block, and score-level fusion, whereby the decisions made by the individual modalities are integrated with those of the MCA model. Extensive experiments on publicly available datasets demonstrate the effectiveness of the proposed framework. Through a comprehensive evaluation, it was found that the proposed framework achieves a performance enhancement compared to unimodal methods. Furthermore, it outperforms the current state-of-the-art methods on crisis-related categorization tasks. The code is available at https://github.com/MarihamR/Categorizing-Crises-from-Social-Media-Feeds-Via-Multimodal-Channel-Attention.
引用
收藏
页码:72037 / 72049
页数:13
相关论文
共 60 条
[1]   Multimodal Categorization of Crisis Events in Social Media [J].
Abavisani, Mahdi ;
Wu, Liwei ;
Hu, Shengli ;
Tetreault, Joel ;
Jaimes, Alejandro .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :14667-14677
[2]  
Alam Firoj, 2017, 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), P601, DOI 10.1145/3110025.3110164
[3]  
Alam F., 2018, PROC 12 INT AAAI C W, P1
[4]   Robust Training of Social Media Image Classification Models [J].
Alam, Firoj ;
Alam, Tanvirul ;
Ofli, Ferda ;
Imran, Muhammad .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) :546-565
[5]   Deep Learning Benchmarks and Datasets for Social Media Image Classification for Disaster Response [J].
Alam, Firoj ;
Ofli, Ferda ;
Imran, Muhammad ;
Alam, Tanvirul ;
Qazi, Umair .
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, :151-158
[6]  
Alharbi A., 2019, P 3 WORKSHOP ARABIC, P72
[7]  
[Anonymous], 2010, WWW'10: Proceedings of the 19th international conference on World wide web, DOI DOI 10.1145/1772690.1772777
[8]  
Beigi G, 2016, STUD COMPUT INTELL, V639, P313, DOI 10.1007/978-3-319-30319-2_13
[9]   Visual Representations of Disaster [J].
Bica, Melissa ;
Palen, Leysia ;
Bopp, Chris .
CSCW'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, 2017, :1262-1276
[10]  
Burel G., 2017, On semantics and deep learning for event detection in crisis situations