Survey of Multimodal Federated Learning: Exploring Data Integration, Challenges, and Future Directions

被引:0
作者
Adam, Mumin [1 ,2 ]
Albaseer, Abdullatif [3 ]
Baroudi, Uthman [1 ,2 ]
Abdallah, Mohamed [3 ]
机构
[1] King Fahd Univ Petr & Minerals, Dept Comp Engn, Dhahran 31261, Saudi Arabia
[2] King Fahd Univ Petr & Minerals, IRC ISS, Dhahran 31261, Saudi Arabia
[3] Hamad Bin Khalifa Univ, Coll Sci & Engn, Div Informat & Comp Technol, Doha, Qatar
来源
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY | 2025年 / 6卷
关键词
Data models; Surveys; Transformers; Data privacy; Computational modeling; Internet of Things; Accuracy; Scalability; Federated learning; Distributed databases; Multimodal FL; data fusion; cross-modal; multimodal federated transformer learning; multimodal FL communication intelligent IoT applications; PRIVACY; DIAGNOSIS; SECURITY;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The rapidly expanding demand for intelligent wireless applications and the Internet of Things (IoT) requires advanced system designs to handle multimodal data effectively while ensuring user privacy and data security. Traditional machine learning (ML) models rely on centralized architectures, which, while powerful, often present significant privacy risks due to the centralization of sensitive data. Federated Learning (FL) is a promising decentralized alternative for addressing these issues. However, FL predominantly handles unimodal data, which limits its applicability in environments where devices collect and process various data types such as text, images, and sensor output. To address this limitation, Multimodal FL (MMFL) integrates multiple data modalities, enabling a richer and more holistic understanding of data. In this survey, we explore the challenges and advancements in MMFL, including data representation, fusion techniques, and cross-modal learning strategies. We present a comprehensive taxonomy of MMFL, outlining critical challenges such as modality imbalance, fusion complexity, and security concerns. Additionally, we highlight the role of transformers in MMFL by leveraging their powerful attention mechanisms to process multimodal data in a federated setting. Finally, we discuss various applications of MMFL, including healthcare, human activity recognition, and emotion recognition, and propose future research directions for improving the scalability and robustness of MMFL systems in real-world scenarios.
引用
收藏
页码:2510 / 2538
页数:29
相关论文
共 211 条
[1]  
Adam M., 2024, P IEEE 100 VEH TECHN, P1
[2]   Federated Learning for IoT: Applications, Trends, Taxonomy, Challenges, Current Solutions, and Future Directions [J].
Adam, Mumin ;
Baroudi, Uthman .
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2024, 5 :7842-7877
[3]   A Survey on Security, Privacy, Trust, and Architectural Challenges in IoT Systems [J].
Adam, Mumin ;
Hammoudeh, Mohammad ;
Alrawashdeh, Rana ;
Alsulaimy, Basil .
IEEE ACCESS, 2024, 12 :57128-57149
[4]  
Agbley B.L.Y., 2021, 2021 18 INT COMP C W, P238, DOI [DOI 10.1109/ICCWAMTIP53232.2021.9674116, 10.1109/ICCWAMTIP53232.2021.9674116]
[5]  
Alam F., 2018, P INT AAAI C WEB SOC
[6]   Client Selection Approach in Support of Clustered Federated Learning over Wireless Edge Networks [J].
Albaseer, Abdullatif ;
Abdallah, Mohamed ;
Al-Fuqaha, Ala ;
Erbad, Aiman .
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[7]   Fine-Grained Data Selection for Improved Energy Efficiency of Federated Edge Learning [J].
Albaseer, Abdullatif ;
Abdallah, Mohamed ;
Al-Fuqaha, Ala ;
Erbad, Aiman .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (05) :3258-3271
[8]   Fair Selection of Edge Nodes to Participate in Clustered Federated Multitask Learning [J].
Albaseer, Abdullatif Mohammed ;
Abdallah, Mohamed ;
Al-Fuqaha, Ala ;
Seid, Abegaz Mohammed ;
Erbad, Aiman ;
Dobre, Octavia A. .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02) :1502-1516
[9]   A Multimodal Federated Learning Framework for Modality Incomplete Scenarios in Healthcare [J].
An, Ying ;
Bai, Yaqi ;
Liu, Yuan ;
Guo, Lin ;
Chen, Xianlai .
BIOINFORMATICS RESEARCH AND APPLICATIONS, PT II, ISBRA 2024, 2024, 14955 :245-256
[10]   Multimodal Federated Learning in AIoT Systems: Existing Solutions, Applications, and Challenges [J].
Anagnostopoulos, Christos ;
Gkillas, Alexandros ;
Mavrokefalidis, Christos ;
Pikoulis, Erion-Vasilis ;
Piperigkos, Nikos ;
Lalos, Aris S. .
IEEE ACCESS, 2024, 12 :180864-180902