End-to-End Learning for Task-Oriented Semantic Communications Over MIMO Channels: An Information-Theoretic Framework

被引:1
作者
Cai, Chang [1 ]
Yuan, Xiaojun [2 ]
Zhang, Ying-Jun Angela [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Peoples R China
[2] Univ Elect Sci & Technol China, Natl Key Lab Wireless Commun, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Precoding; Accuracy; Wireless communication; Training; Feature extraction; Transceivers; Servers; Semantic communication; Performance evaluation; Noise; Task-oriented semantic communication; transceiver design; deep unfolding; multi-device edge inference; maximal coding rate reduction (MCR2); DEEP; NETWORK;
D O I
10.1109/JSAC.2025.3531575
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of end-to-end (E2E) design of learning and communication in a task-oriented semantic communication system. In particular, we consider a multi-device cooperative edge inference system over a wireless multiple-input multiple-output (MIMO) multiple access channel, where multiple devices transmit extracted features to a server to perform a classification task. We formulate the E2E design of feature encoding, MIMO precoding, and classification as a conditional mutual information maximization problem. However, it is notoriously difficult to design and train an E2E network that can be adaptive to both the task dataset and different channel realizations. Regarding network training, we propose a decoupled pretraining framework that separately trains the feature encoder and the MIMO precoder, with a maximum a posteriori (MAP) classifier employed at the server to generate the inference result. The feature encoder is pretrained exclusively using the task dataset, while the MIMO precoder is pretrained solely based on the channel and noise distributions. Nevertheless, we manage to align the pretraining objectives of each individual component with the E2E learning objective, so as to approach the performance bound of E2E learning. By leveraging the decoupled pretraining results for initialization, the E2E learning can be conducted with minimal training overhead. Regarding network architecture design, we develop two deep unfolded precoding networks that effectively incorporate the domain knowledge of the solution to the decoupled precoding problem. Simulation results on both the CIFAR-10 and ModelNet10 datasets verify that the proposed method achieves significantly higher classification accuracy compared to various baselines.
引用
收藏
页码:1292 / 1307
页数:16
相关论文
共 41 条
[1]   Efficient Maximal Coding Rate Reduction by Variational Forms [J].
Baek, Christina ;
Wu, Ziyang ;
Chan, Kwan Ho Ryan ;
Ding, Tianjiao ;
Ma, Yi ;
Haeffele, Benjamin D. .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :490-498
[2]   Semantic Information Recovery in Wireless Networks [J].
Beck, Edgar ;
Bockelmann, Carsten ;
Dekorsy, Armin .
SENSORS, 2023, 23 (14)
[3]  
Cai C., 2024, P 58 AS C SIGN SYST, P1
[4]   Multi-Device Task-Oriented Communication via Maximal Coding Rate Reduction [J].
Cai, Chang ;
Yuan, Xiaojun ;
Zhang, Ying-Jun Angela .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (12) :18096-18110
[5]  
Chan KHR, 2022, J MACH LEARN RES, V23, P1
[6]  
Cover T. M., 1999, Elements of information theory, DOI DOI 10.1002/047174882X
[7]   Nonlinear Transform Source-Channel Coding for Semantic Communications [J].
Dai, Jincheng ;
Wang, Sixian ;
Tan, Kailin ;
Si, Zhongwei ;
Qin, Xiaoqi ;
Niu, Kai ;
Zhang, Ping .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2022, 40 (08) :2300-2316
[8]  
Dosovitskiy, 2020, ARXIV
[9]  
Garren KennethR., 1968, Bounds for the Eigenvalues of a Matrix
[10]   Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications [J].
Gunduz, Deniz ;
Qin, Zhijin ;
Aguerri, Inaki Estella ;
Dhillon, Harpreet S. ;
Yang, Zhaohui ;
Yener, Aylin ;
Wong, Kai Kit ;
Chae, Chan-Byoung .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (01) :5-41