Multi-Modal Sensing-aided Beam Prediction using Poolformer for UAV Communications

被引:1
作者
Yeo, Yerin [1 ]
Kim, Junghyun [2 ]
机构
[1] Sejong Univ, Dept Artificial Intelligence, Seoul, South Korea
[2] Sejong Univ, Dept Artificial Intelligence & Data Sci, Seoul, South Korea
来源
2024 FIFTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS, ICUFN 2024 | 2024年
基金
新加坡国家研究基金会;
关键词
mmWave communications; beam prediction; deep learning; multi-modal sensing; unmanned aerial vehicle;
D O I
10.1109/ICUFN61752.2024.10625314
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a deep learning technique that uses both camera image data and GPS data to predict the optimal beam for efficient beamforming in UAV communication systems. Previous research has proposed single-modal beam prediction models that use either camera image data or GPS data individually. However, such methods have limitations due to their sensitivity to the measurement environment and outliers. To overcome this, we suggest a new technique based on Poolformer, a derivative model of the transformer, which combines these two types of data. Experimental results show that the proposed Poolformer-based model outperforms the existing model in terms of Top-1, 2, 3 accuracy for both 32 and 64 beams. Notably, the Top-3 accuracy of the proposed model approached nearly 100% accuracy in both experiments.
引用
收藏
页码:202 / 204
页数:3
相关论文
共 50 条
[31]   Artificial Intelligence based Multi-modal sensing for flash flood investigation [J].
Khan, Talha Ahmed ;
Alam, Muhammad ;
Shahid, Zeeshan ;
Ahmed, Syed Faiz ;
Mazliham, M. S. .
2018 5TH IEEE INTERNATIONAL CONFERENCE ON ENGINEERING TECHNOLOGIES AND APPLIED SCIENCES (IEEE ICETAS), 2018,
[32]   Metglas Based Multi-Modal Sensing Employing Magnetostrictive and Triboelectric Properties [J].
Mukherjee, Dibyajyoti ;
Naval, Sourav ;
Beigh, Nadeem Tariq ;
Mallick, Dhiman .
2023 IEEE SENSORS, 2023,
[33]   Multi-Modal Fusion Transformer for Visual Question Answering in Remote Sensing [J].
Siebert, Tim ;
Clasen, Kai Norman ;
Ravanbakhsh, Mahdyar ;
Demir, Beguem .
IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING XXVIII, 2022, 12267
[34]   Energy autonomous hybrid electronic skin with multi-modal sensing capabilities [J].
Zhu, Miaomiao ;
Lou, Mengna ;
Yu, Jianyong ;
Li, Zhaoling ;
Ding, Bin .
NANO ENERGY, 2020, 78
[35]   Evaluating multi-modal image matching algorithms for remote sensing applications [J].
Liu, Zhanquan ;
Han, Yilong ;
Fu, Mingsheng ;
Song, Baiwan ;
Gong, Danchao ;
Ming, Yanfang ;
Sun, Lin .
SIXTH INTERNATIONAL CONFERENCE ON GEOSCIENCE AND REMOTE SENSING MAPPING, GRSM 2024, PT 1, 2025, 13506
[36]   Position Prediction Based Fast Beam Tracking Scheme for Multi-User UAV-mmWave Communications [J].
Ke, Yongning ;
Gao, Hui ;
Xu, Wenjun ;
Li, Lixin ;
Guo, Li ;
Feng, Zhiyong .
ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[37]   Prediction of Lymph Node Metastasis in Colorectal Cancer Using Intraoperative Fluorescence Multi-Modal Imaging [J].
Zhu, Xiaobo ;
Sun, He ;
Wang, Yuhan ;
Hu, Gang ;
Shao, Lizhi ;
Zhang, Song ;
Liu, Fucheng ;
Chi, Chongwei ;
He, Kunshan ;
Tang, Jianqiang ;
An, Yu ;
Tian, Jie ;
Liu, Zhenyu .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (03) :1568-1580
[38]   Multi-modal deep learning for credit rating prediction using text and numerical data streams [J].
Tavakoli, Mahsa ;
Chandra, Rohitash ;
Tian, Fengrui ;
Bravo, Cristian .
APPLIED SOFT COMPUTING, 2025, 171
[39]   Multi-modal LSTM network for anomaly prediction in piston engine aircraft [J].
Khattak, Waqas Rauf ;
Salman, Ahmad ;
Ghafoor, Salman ;
Latif, Seemab .
HELIYON, 2024, 10 (03)
[40]   Heterogeneous multi-modal graph network for arterial travel time prediction [J].
Fang, Jie ;
He, Hangyu ;
Xu, Mengyun ;
Wu, Xiongwei .
APPLIED INTELLIGENCE, 2025, 55 (06)