Multi-Modal Sensing-aided Beam Prediction using Poolformer for UAV Communications

被引:1
作者
Yeo, Yerin [1 ]
Kim, Junghyun [2 ]
机构
[1] Sejong Univ, Dept Artificial Intelligence, Seoul, South Korea
[2] Sejong Univ, Dept Artificial Intelligence & Data Sci, Seoul, South Korea
来源
2024 FIFTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS, ICUFN 2024 | 2024年
基金
新加坡国家研究基金会;
关键词
mmWave communications; beam prediction; deep learning; multi-modal sensing; unmanned aerial vehicle;
D O I
10.1109/ICUFN61752.2024.10625314
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a deep learning technique that uses both camera image data and GPS data to predict the optimal beam for efficient beamforming in UAV communication systems. Previous research has proposed single-modal beam prediction models that use either camera image data or GPS data individually. However, such methods have limitations due to their sensitivity to the measurement environment and outliers. To overcome this, we suggest a new technique based on Poolformer, a derivative model of the transformer, which combines these two types of data. Experimental results show that the proposed Poolformer-based model outperforms the existing model in terms of Top-1, 2, 3 accuracy for both 32 and 64 beams. Notably, the Top-3 accuracy of the proposed model approached nearly 100% accuracy in both experiments.
引用
收藏
页码:202 / 204
页数:3
相关论文
共 50 条
[41]   MDNNSyn: A Multi-Modal Deep Learning Framework for Drug Synergy Prediction [J].
Li, Lei ;
Li, Haitao ;
Ishdorj, Tseren-Onolt ;
Zheng, Chunhou ;
Su, Yansen .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (10) :6225-6236
[42]   Monitoring In-Home Emergency Situation and Preserve Privacy using Multi-modal Sensing and Deep Learning [J].
Bordvik, David Andreas ;
Hou, Jie ;
Noori, Farzan M. ;
Uddin, Md Zia ;
Torresen, Jim .
2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
[43]   Air Pollution Prediction with Multi-Modal Data and Deep Neural Networks [J].
Kalajdjieski, Jovan ;
Zdravevski, Eftim ;
Corizzo, Roberto ;
Lameski, Petre ;
Kalajdziski, Slobodan ;
Pires, Ivan Miguel ;
Garcia, Nuno M. ;
Trajkovik, Vladimir .
REMOTE SENSING, 2020, 12 (24) :1-19
[44]   MultiSenseNet: Multi-Modal Deep Learning for Machine Failure Risk Prediction [J].
Rahman, Mostafijur ;
Hossain, Md Sabbir ;
Rozario, Uland ;
Roy, Satyabrata ;
Mridha, M. F. ;
Dey, Nilanjan .
IEEE ACCESS, 2025, 13 :120404-120416
[45]   Multi-modal Vehicle Trajectory Prediction Method Based on Structured Road [J].
Hu, Jie ;
Wu, Zuo-Wei ;
Zhang, Zhi-Ling ;
Zhao, Wen-Long ;
Dai, Yi-Peng .
Zhongguo Gonglu Xuebao/China Journal of Highway and Transport, 2025, 38 (02) :286-295
[46]   Attention-Aided Generative Learning for Multi-Scale Multi-Modal Fundus Image Translation [J].
Pham, Van-Nguyen ;
Le, Duc-Tai ;
Bum, Junghyun ;
Lee, Eun Jung ;
Han, Jong Chul ;
Choo, Hyunseung .
IEEE ACCESS, 2023, 11 :51701-51711
[47]   MMEC: Multi-Modal Ensemble Classifier for Protein Secondary Structure Prediction [J].
de Oliveira, Gabriel Bianchin ;
Pedrini, Helio ;
Dias, Zanoni .
COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2021, PT 1, 2021, 13052 :175-184
[48]   ADFusion: Multi-modal adaptive deep fusion for cancer subtype prediction [J].
Zhang, Ziye ;
Huang, Weixian ;
Wang, Shijin ;
Tan, Kaiwen ;
Zheng, Xiaorou ;
Dong, Shoubin .
INFORMATION FUSION, 2025, 121
[49]   Deep-Learning for Change Detection Using Multi-Modal Fusion of Remote Sensing Images: A Review [J].
Saidi, Souad ;
Idbraim, Soufiane ;
Karmoude, Younes ;
Masse, Antoine ;
Arbelo, Manuel .
REMOTE SENSING, 2024, 16 (20)
[50]   Integrated Sensing and Communications Toward Proactive Beamforming in mmWave V2I via Multi-Modal Feature Fusion (MMFF) [J].
Zhang, Haotian ;
Gao, Shijian ;
Cheng, Xiang ;
Yang, Liuqing .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (11) :15721-15735