UAV Object Detection Based on Joint YOLO and Transformer

被引:0
作者
Gao, Yifan [1 ]
Ding, Rui [1 ]
Zhou, Fuhui [1 ]
Wu, Qihui [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
来源
2024 INTERNATIONAL CONFERENCE ON UBIQUITOUS COMMUNICATION, UCOM 2024 | 2024年
关键词
Object detection; UAV; Transformer; You Only Look Once;
D O I
10.1109/UCOM62433.2024.10695939
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the gradual expansion of computer vision application fields, the demand for object detection based on unmanned aerial vehicle (UAV) aerial images continues to grow. Traditional methods have limitations in handling scale changes, motion blur, and complex backgrounds. We propose a novel approach that combines the model You Only Look Once version 5 based on convolutional neural network with the sequence modeling technology Transformer to better capture long-range dependencies and contextual information, thereby improving detection performance. Experimental results on the VisDrone dataset show that the proposed method has comparable performance to existing methods, demonstrating its effectiveness in UAV object detection.
引用
收藏
页码:202 / 206
页数:5
相关论文
共 25 条
[1]   Object Detection Methods for Improving UAV Autonomy and Remote Sensing Applications [J].
Aposporis, Panagiotis .
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, :845-853
[2]  
Bharati Puja, 2020, Computational Intelligence in Pattern Recognition. Proceedings of CIPR 2019. Advances in Intelligent Systems and Computing (AISC 999), P657, DOI 10.1007/978-981-13-9042-5_56
[3]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[4]   Performance Enhancement of INS/GNSS/Refreshed-SLAM Integration for Acceptable Lane-Level Navigation Accuracy [J].
Chiang, Kai-Wei ;
Tsai, Guang-Je ;
Chu, Hone-Jay ;
El-Sheimy, Naser .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (03) :2463-2476
[5]   An Efficient Multiple Object Detection and Tracking Framework for Automatic Counting and Video Surveillance Applications [J].
del-Blanco, Carlos R. ;
Jaureguizar, Fernando ;
Garcia, Narciso .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (03) :857-862
[6]   Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges [J].
Ding, Jian ;
Xue, Nan ;
Xia, Gui-Song ;
Bai, Xiang ;
Yang, Wen ;
Yang, Michael Ying ;
Belongie, Serge ;
Luo, Jiebo ;
Datcu, Mihai ;
Pelillo, Marcello ;
Zhang, Liangpei .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :7778-7796
[7]   Object detection using YOLO: challenges, architectural successors, datasets and applications [J].
Diwan, Tausif ;
Anirudh, G. ;
Tembhurne, Jitendra, V .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) :9243-9275
[8]  
Ge Z, 2021, Arxiv, DOI arXiv:2107.08430
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]