Multi-modal fusion for sensing-aided beam tracking in mmWave communications

被引:0
|
作者
Bian, Yijie [1 ]
Yang, Jie [2 ,3 ]
Dai, Lingyun [1 ]
Lin, Xi [4 ]
Cheng, Xinyao [4 ]
Que, Hang [4 ,5 ]
Liang, Le [3 ,4 ,5 ]
Jin, Shi [3 ,4 ,5 ]
机构
[1] Southeast Univ, CHIEN SHIUNG WU Coll, Nanjing, Peoples R China
[2] Key Lab Measurement & Control Complex Syst Engn, Nanjing, Peoples R China
[3] Frontiers Sci Ctr Mobile Informat Commun & Secur, Nanjing, Peoples R China
[4] Sch Informat Sci & Technol, Nanjing, Peoples R China
[5] Natl Mobile Commun Res Lab, Nanjing, Peoples R China
关键词
mmWave communications; Deep learning; Beam training and tracking; Multi-modal data; Decision-level fusion; MILLIMETER-WAVE;
D O I
10.1016/j.phycom.2024.102514
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Millimeter wave (mmWave) communication has attracted extensive attention and research due to its wide bandwidth and abundant spectrum resources. Effective and fast beam tracking is a critical challenge for the practical deployment of mmWave communications. Existing studies demonstrate the potential of sensing- aided beam tracking. However, most studies are focus on single-modal data assistance without considering multi-modal calibration or the impact of inference latency of different sub-modules. Thus, in this study, we design a decision-level multi-modal (mmWave received signal power vector, RGB image and GPS data) fusion for sensing-aided beam tracking (DMBT) method. The proposed DMBT method includes three designed mechanisms, namely normal prediction process, beam misalignment alert and beam tracking correction. The normal prediction process conducts partial beam training instead of exhaustive beam training, which largely reduces large beam training overhead. It also comprehensively selects prediction results from multi-modal data to enhance the DMBT method robustness to noise. The beam misalignment alert based on RGB image and GPS data detects whether there exists beam misalignment and also predict the optimal beam. The beam tracking correction is designed to capture the optimal beam if misalignment happens by reusing certain blocks in normal prediction process and possibly outdated prediction results. Finally, we evaluate the proposed DMBT method in the vehicle-to-infrastructure scenario based on a real-world dataset. The results show that the method is capable of self-correction and mitigating the negative effect of the relative inference latency. Moreover, 75%-93% beam training overhead can be saved to maintain reliable communication even when faced with considerable noise in measurement data.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] MULTI-TARGET TRACKING USING MULTI-MODAL SENSING WITH WAVEFORM CONFIGURATION
    Zhang, Jun Jason
    Papandreou-Suppappola, Antonia
    Rangaswamy, Muralidhar
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 3890 - 3893
  • [22] Text-Guided Multi-Modal Fusion for Underwater Visual Tracking
    Michael, Yonathan
    Alansari, Mohamad
    Javed, Sajid
    2024 IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, AVSS 2024, 2024,
  • [23] Multi-Modal Object Tracking and Image Fusion With Unsupervised Deep Learning
    LaHaye, Nicholas
    Ott, Jordan
    Garay, Michael J.
    El-Askary, Hesham Mohamed
    Linstead, Erik
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (08) : 3056 - 3066
  • [24] Multi-modal retinal imaging with active retinal tracking and wavefront sensing
    Vienola, Kari V.
    Jonnal, Ravi Sankar
    Migacz, Justin V.
    Gorczynska, Iwona
    Zawadzki, Robert
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2020, 61 (07)
  • [25] Split Learning for Sensing-Aided Single and Multi-Level Beam Selection in Multi-Vendor RAN
    Dantas, Ycaro
    Iturria-Rivera, Pedro Enrique
    Zhou, Hao
    Ozcan, Yigit
    Bavand, Majid
    Elsayed, Medhat
    Gaigalas, Raimundas
    Erol-Kantarci, Melike
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6652 - 6657
  • [26] Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing
    Siebert, Tim
    Clasen, Kai Norman
    Ravanbakhsh, Mahdyar
    Demir, Beguem
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267
  • [27] Environment Sensing-aided Beam Prediction with Transfer Learning for Smart Factory
    Zhao, Chuanbing
    Feng, Yuan
    Gao, Feifei
    Zhang, Yong
    Ma, Shaodan
    Poor, H. Vincent
    2024 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC, 2024,
  • [28] Environment Sensing-Aided Beam Prediction With Transfer Learning for Smart Factory
    Feng, Yuan
    Zhao, Chuanbin
    Gao, Feifei
    Zhang, Yong
    Ma, Shaodan
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (01) : 676 - 690
  • [29] A Multi-Modal Gaze Tracking Algorithm
    Hou, Zhenjie
    Chao, Xin
    Liang, Jiuzhen
    Yang, Tianjin
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (07)
  • [30] Visual Prompt Multi-Modal Tracking
    Zhu, Jiawen
    Lai, Simiao
    Chen, Xin
    Wang, Dong
    Lu, Huchuan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 9516 - 9526