Wireless Semantic Communications for Video Conferencing

被引:74
作者
Jiang, Peiwen [1 ]
Wen, Chao-Kai [2 ]
Jin, Shi [1 ]
Li, Geoffrey Ye [3 ]
机构
[1] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
[2] Natl Sun Yat Sen Univ, Inst Commun Engn, Kaohsiung 80424, Taiwan
[3] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
基金
中国国家自然科学基金;
关键词
IR-HARQ; CSI feedback; semantic communication; video conferencing; facial keypoints; DEEP; TRANSMISSION; INTERNET;
D O I
10.1109/JSAC.2022.3221968
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video conferencing has become a popular mode of meeting despite consuming considerable communication resources. Conventional video compression causes resolution reduction under a limited bandwidth. Semantic video conferencing (SVC) maintains a high resolution by transmitting some keypoints to represent the motions because the background is almost static, and the speakers do not change often. However, the study on the influence of transmission errors on keypoints is limited. In this paper, an SVC network based on keypoint transmission is established, which dramatically reduces transmission resources while only losing detailed expressions. Transmission errors in SVC only lead to a changed expression, whereas those in the conventional methods directly destroy pixels. However, the conventional error detector, such as cyclic redundancy check, cannot reflect the degree of expression changes. To overcome this issue, an incremental redundancy hybrid automatic repeat-request framework for varying channels (SVC-HARQ) incorporating a novel semantic error detector is developed. SVC-HARQ has flexibility in bit consumption and achieves a good performance. In addition, SVC-channel state information (CSI) is designed for CSI feedback to allocate the keypoint transmission and enhance the performance dramatically. Simulation shows that the proposed wireless semantic communication system can remarkably improve transmission efficiency.
引用
收藏
页码:230 / 244
页数:15
相关论文
共 41 条
[1]   A link adaptation scheme optimized for wireless JPEG 2000 transmission over realistic MIMO systems [J].
Abot, Julien ;
Olivier, Christian ;
Perrine, Clency ;
Pousset, Yannis .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2012, 27 (10) :1066-1078
[2]   Deep Joint Source-Channel Coding for Wireless Image Transmission [J].
Bourtsoulatze, Eirina ;
Kurka, David Burth ;
Gunduz, Deniz .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2019, 5 (03) :567-579
[3]  
Bulazel A., 2017, P 1 REVERSING OFFENS, P2, DOI DOI 10.1145/3150376.3150378
[4]   Joint Source-Channel Coding for Deep-Space Image Transmission using Rateless Codes [J].
Bursalioglu, O. Y. ;
Caire, G. ;
Divsalar, D. .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2013, 61 (08) :3448-3461
[5]   Advances in Video Compression System Using Deep Neural Network: A Review and Case Studies [J].
Ding, Dandan ;
Ma, Zhan ;
Chen, Di ;
Chen, Qingshuang ;
Liu, Zoe ;
Zhu, Fengqing .
PROCEEDINGS OF THE IEEE, 2021, 109 (09) :1494-1520
[6]  
Farsad N, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P2326, DOI 10.1109/ICASSP.2018.8461983
[7]   Model-Driven Deep Learning for Physical Layer Communications [J].
He, Hengtao ;
Jin, Shi ;
Wen, Chao-Kai ;
Gao, Feifei ;
Li, Geoffrey Ye ;
Xu, Zongben .
IEEE WIRELESS COMMUNICATIONS, 2019, 26 (05) :77-83
[8]  
Hore Alain, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P2366, DOI 10.1109/ICPR.2010.579
[9]  
Jie Bao, 2011, 2011 IEEE First International Network Science Workshop (NSW 2011), P110, DOI 10.1109/NSW.2011.6004632
[10]   Perceptual Losses for Real-Time Style Transfer and Super-Resolution [J].
Johnson, Justin ;
Alahi, Alexandre ;
Li Fei-Fei .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :694-711