Multi-Modal Federated Learning Over Cell-Free Massive MIMO Systems for Activity Recognition

被引：0

作者：

Sheikholeslami, Seyed Mohammad ^{[1
]}

Ng, Pai Chet ^{[1
]}

Abouei, Jamshid ^{[2
]}

Plataniotis, Konstantinos N. ^{[1
]}

机构：

[1] Univ Toronto, Edward S Rogers Sr Dept Elect & Comp Engn, Toronto, ON M5S 3G4, Canada

[2] Yazd Univ, Dept Elect Engn, Yazd 89195741, Iran

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Training; Internet of Things; Resource management; Optimization; Delays; Servers; Federated learning; Data models; Distributed databases; Wireless networks; Multi-modal federated learning; cell-free networks; human activity recognition; device-modality selection;

D O I：

10.1109/ACCESS.2025.3548001

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper addresses the problem of Multi-modal Federated Learning (MFL) over resource-limited Cell-Free massive MIMO (CF-mMIMO) networks for the application of Human Activity Recognition (HAR). MFL leverages diverse data modalities across various clients, while the CF-mMIMO network ensures consistent service quality, crucial for collaborative training. The primary challenges of MFL are data heterogeneity, which includes statistical and modality heterogeneity that complicate data fusion, client collaboration, and inference with missing data, and system heterogeneity, where devices with dissimilar modalities experience varied processing and communication delays, increasing overall training latency. To tackle these issues, we propose a late-fusion model architecture that allows flexible client participation with any combination of data modalities, and formulate an optimization problem to jointly minimize latency and global loss in MFL. We propose a prioritized device-modality selection scheme that allows flexible participation of devices. Additionally, we employ a modified Particle Swarm Optimization (PSO) algorithm for efficient resource allocation. Extensive experiments validate our framework, demonstrating substantial reductions in training time and significant improvements in model performance, particularly an average improvement of 15% and 23% in test accuracy compared to the other fusion models when missing one and two modalities in the inference phase.

引用

页码：40844 / 40858

页数：15