LARGE AI MODEL-BASED SEMANTIC COMMUNICATIONS

被引：19

作者：

Jiang, Feibo ^{[1
]}

Peng, Yubo ^{[2
]}

Dong, Li ^{[2
]}

Wang, Kezhi ^{[3
]}

Yang, Kun ^{[4
]}

Pan, Cunhua ^{[5
]}

You, Xiaohu ^{[6
]}

机构：

[1] Hunan Normal Univ, Hunan Prov Key Lab Intelligent Comp & Language In, Changsha, Peoples R China

[2] Hunan Normal Univ, Changsha, Peoples R China

[3] Brunel Univ London, Dept Comp Sci, London, England

[4] Nanjing Univ, Sch Intelligent Software & Engn, Nanjing, Peoples R China

[5] Southeast Univ, Nanjing, Peoples R China

[6] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing, Peoples R China

来源：

IEEE WIRELESS COMMUNICATIONS | 2024年 / 31卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Image segmentation - Internet of Everything - Mixed reality - Semantics;

D O I：

10.1109/MWC.001.2300346

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semantic communication (SC) is an emerging intelligent paradigm, offering solutions for various future applications like metaverse, mixed reality, and the Internet of everything. However, in current SC systems, the construction of the knowledge base (KB) faces several issues, including limited knowledge representation, frequent knowledge updates, and insecure knowledge sharing. Fortunately, the development of the large AI model (LAM) provides new solutions to overcome the above issues. Here, we propose a LAM-based SC framework (LAM-SC) specifically designed for image data, where we first apply the segment anything model (SAM)-based KB (SKB) that can split the original image into different semantic segments by universal semantic knowledge. Then, we present an attention-based semantic integration (ASI) to weigh the semantic segments generated by SKB without human participation and integrate them as the semantic-aware image. Additionally, we propose an adaptive semantic compression (ASC) encoding to remove redundant information in semantic features, thereby reducing communication overhead. Finally, through simulations, we demonstrate the effectiveness of the LAM-SC framework and the possibility of applying the LAM-based KB in future SC paradigms.

引用

页码：68 / 75

页数：8

共 15 条

[1] WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing [J].

Chen, Sanyuan ;

Wang, Chengyi ;

Chen, Zhengyang ;

Wu, Yu ;

Liu, Shujie ;

Chen, Zhuo ;

Li, Jinyu ;

Kanda, Naoyuki ;

Yoshioka, Takuya ;

Xiao, Xiong ;

Wu, Jian ;

Zhou, Long ;

Ren, Shuo ;

Qian, Yanmin ;

Qian, Yao ;

Zeng, Michael ;

Yu, Xiangzhan ;

Wei, Furu .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) :1505-1518

[2]

Chen ZR, 2024, Arxiv, DOI arXiv:2308.06250

[3]

Chowdhury J.R., 2022, Proc. AAAI Conf. Artif. Intell., V36

[4] Self-Supervised Representation Learning: Introduction, advances, and challenges [J].

Ericsson, Linus ;

Gouk, Henry ;

Loy, Chen Change ;

Hospedales, Timothy M. .

IEEE SIGNAL PROCESSING MAGAZINE, 2022, 39 (03) :42-62

[5]

Jiang FB, 2024, Arxiv, DOI [arXiv:2309.01249, DOI 10.1109/MCOM.001.2300575]

[6]

Kirillov A, 2023, Arxiv, DOI arXiv:2304.02643

[7] Cross-Modal Semantic Communications [J].

Li, Ang ;

Wei, Xin ;

Wu, Dan ;

Zhou, Liang .

IEEE WIRELESS COMMUNICATIONS, 2022, 29 (06) :144-151

[8] Semantic Communications: Overview, Open Issues, and Future Research Directions [J].

Luo, Xuewen ;

Chen, Hsiao-Hwa ;

Guo, Qing .

IEEE WIRELESS COMMUNICATIONS, 2022, 29 (01) :210-219

[9] Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis [J].

Weng, Zhenzi ;

Qin, Zhijin ;

Tao, Xiaoming ;

Pan, Chengkang ;

Liu, Guangyi ;

Li, Geoffrey Ye .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (09) :6227-6240

[10] Deep Learning Enabled Semantic Communication Systems [J].

Xie, Huiqiang ;

Qin, Zhijin ;

Li, Geoffrey Ye ;

Juang, Biing-Hwang .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 :2663-2675

← 1 2 →