Self-Supervised Radio-Visual Representation Learning for 6G Sensing

被引:5
作者
Alloulah, Mohammed [1 ]
Singh, Akash Deep [1 ,2 ]
Arnold, Maximilian [1 ]
机构
[1] Bell Labs, Holmdel, NJ 07974 USA
[2] UCLA, Los Angeles, CA USA
来源
IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022) | 2022年
关键词
radio-visual learning; self-supervised learning; deep learning; sensing; perception; 6G;
D O I
10.1109/ICC45855.2022.9838844
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In future 6G cellular networks, a joint communication and sensing protocol will allow the network to perceive the environment, opening the door for many new applications atop a unified communication-perception infrastructure. However, interpreting the sparse radio representation of sensing scenes is challenging, which hinders the potential of these emergent systems. We propose to combine radio and vision to automatically learn a radio-only sensing model with minimal human intervention. We want to build a radio sensing model that can feed on millions of uncurated data points. To this end, we leverage recent advances in self-supervised learning and formulate a new label-free radio-visual co-learning scheme, whereby vision trains radio via cross-modal mutual information. We implement and evaluate our scheme according to the common linear classification benchmark, and report qualitative and quantitative performance metrics. In our evaluation, the representation learnt by radio-visual self-supervision works well for a downstream sensing demonstrator, and outperforms its fully-supervised counterpart when less labelled data is used. This indicates that self-supervised learning could be an important enabler for future scalable radio sensing systems.
引用
收藏
页码:1955 / 1961
页数:7
相关论文
共 36 条
[1]   Self-supervised Learning of Audio-Visual Objects from Video [J].
Afouras, Triantafyllos ;
Owens, Andrew ;
Chung, Joon Son ;
Zisserman, Andrew .
COMPUTER VISION - ECCV 2020, PT XVIII, 2020, 12363 :208-224
[2]  
Alloulah M., 2021, ARXIV210615178
[3]   Look, Listen and Learn [J].
Arandjelovic, Relja ;
Zisserman, Andrew .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :609-617
[4]  
Arandjelovic Relja, 2018, EUROPEAN C COMPUTER, P435
[5]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[6]  
Bourdoux A., 2020, 6G white paper on localization and sensing
[7]  
Brown TB, 2020, ADV NEUR IN, V33
[8]  
Caron M, 2020, ADV NEUR IN, V33
[9]   Localizing Visual Sounds the Hard Way [J].
Chen, Honglie ;
Xie, Weidi ;
Afouras, Triantafyllos ;
Nagrani, Arsha ;
Vedaldi, Andrea ;
Zisserman, Andrew .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16862-16871
[10]   Knowledge-guided Deep Reinforcement Learning for Interactive Recommendation [J].
Chen, Xiaocong ;
Huang, Chaoran ;
Yao, Lina ;
Wang, Xianzhi ;
Liu, Wei ;
Zhang, Wenjie .
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,