Multi-Modal Sensing-aided Beam Prediction using Poolformer for UAV Communications

被引:1
作者
Yeo, Yerin [1 ]
Kim, Junghyun [2 ]
机构
[1] Sejong Univ, Dept Artificial Intelligence, Seoul, South Korea
[2] Sejong Univ, Dept Artificial Intelligence & Data Sci, Seoul, South Korea
来源
2024 FIFTEENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS, ICUFN 2024 | 2024年
基金
新加坡国家研究基金会;
关键词
mmWave communications; beam prediction; deep learning; multi-modal sensing; unmanned aerial vehicle;
D O I
10.1109/ICUFN61752.2024.10625314
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a deep learning technique that uses both camera image data and GPS data to predict the optimal beam for efficient beamforming in UAV communication systems. Previous research has proposed single-modal beam prediction models that use either camera image data or GPS data individually. However, such methods have limitations due to their sensitivity to the measurement environment and outliers. To overcome this, we suggest a new technique based on Poolformer, a derivative model of the transformer, which combines these two types of data. Experimental results show that the proposed Poolformer-based model outperforms the existing model in terms of Top-1, 2, 3 accuracy for both 32 and 64 beams. Notably, the Top-3 accuracy of the proposed model approached nearly 100% accuracy in both experiments.
引用
收藏
页码:202 / 204
页数:3
相关论文
共 50 条
[21]   Federated Learning-Aided Beam Prediction for Multi-User Millimeter Wave Communications [J].
Chuang, Cheng-Jui ;
Liu, Kuang-Hao .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2025, 11 (03) :1818-1829
[22]   Peak Age of Information Minimization for UAV-Aided Wireless Sensing and Communications [J].
Jiang, Wenwen ;
Shen, Chao ;
Ai, Bo ;
Li, Hang .
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
[23]   Multi-Modal Sensing Techniques for Interfacing Hand Prostheses: A Review [J].
Fang, Yinfeng ;
Hettiarachchi, Nalinda ;
Zhou, Dalin ;
Liu, Honghai .
IEEE SENSORS JOURNAL, 2015, 15 (11) :6065-6076
[24]   GPS-Aided Deep Learning for Beam Prediction and Tracking in UAV mmWave Communication [J].
Nugroho, Vendi Ardianto ;
Lee, Byung Moo .
IEEE ACCESS, 2025, 13 :117065-117077
[25]   Ticino: A multi-modal remote sensing dataset for semantic segmentation [J].
Barbato, Mirko Paolo ;
Piccoli, Flavio ;
Napoletano, Paolo .
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
[26]   Vision and LiDAR multi-modal fusion beam prediction method for millimeter-wave communication system [J].
Luo, Wenyu ;
Hou, Changxing ;
Shao, Xia ;
Yan, Tianze ;
Xuan, Annan .
JOURNAL OF SUPERCOMPUTING, 2025, 81 (06)
[27]   Diffraction Characteristics Aided Blockage and Beam Prediction for mmWave Communications [J].
Li, Xiaogang ;
Yu, Li ;
Zhang, Yuxiang ;
Zhang, Jianhua ;
Liu, Baoling ;
Jiang, Tao ;
Xia, Liang .
2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
[28]   ISGDRP: a multi-modal learning method for drug response prediction [J].
Zhao, Haochen ;
Zhang, Xiaoyu ;
Zhao, Qichang ;
Duan, Guihua .
15TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, ACM-BCB 2024, 2024,
[29]   UAV-based water pollutants detection and classification framework using multi-modal and multi-sensor ensemble learning [J].
Pichhika, Hari Chandana ;
Yerra, Raja Vara Prasad .
ENVIRONMENTAL MONITORING AND ASSESSMENT, 2025, 197 (06)
[30]   Multitask Collaborative Multi-modal Remote Sensing Target Segmentation Algorithm [J].
Mao, Xiuhua ;
Zhang, Qiang ;
Ruan, Hang ;
Yang, Yuang .
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2024, 46 (08) :3363-3371