Large-Scale High-Altitude UAV-Based Vehicle Detection via Pyramid Dual Pooling Attention Path Aggregation Network

被引:5
|
作者
Ying, Zilu [1 ]
Zhou, Jianhong [1 ]
Zhai, Yikui [1 ]
Quan, Hao [2 ]
Li, Wenba [1 ]
Genovese, Angelo [3 ,4 ]
Piuri, Vincenzo [3 ,4 ]
Scotti, Fabio [3 ,4 ]
机构
[1] Wuyi Univ, Sch Elect & Informat Engn, Jiangmen 529020, Guangdong, Peoples R China
[2] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy
[3] Univ Milan, Dept Comp Sci, I-20133 Milan, Italy
[4] Univ Milan, Dipartimento Informat, I-20133 Milan, Italy
关键词
UAV; high-altitude scenes; vehicle detection; attention mechanism; TRACKING;
D O I
10.1109/TITS.2024.3396915
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
UAVs can collect vehicle data in high-altitude scenes, playing a significant role in intelligent urban management due to their wide of view. Nevertheless, the current datasets for UAV-based vehicle detection are acquired at altitude below 150 meters. This contrasts with the data perspective obtained from high-altitude scenes, potentially leading to incongruities in data distribution. Consequently, it is challenging to apply these datasets effectively in high-altitude scenes, and there is an ongoing obstacle. To resolve this challenge, we developed a comprehensive vehicle dataset named LH-UAV-Vehicle, specifically collected at flight altitudes ranging from 250 to 400 meters. Collecting data at higher flight altitudes offers a broader perspective, but it concurrently introduces complexity and diversity in the background, which consequently impacts vehicle localization and recognition accuracy. In response, we proposed the pyramid dual pooling attention path aggregation network (PDPA-PAN), an innovative framework that improves detection performance in high-altitude scenes by combining spatial and semantic information. Object attention integration in both spatial and channel dimensions is aimed by the pyramid dual pooling attention module (PDPAM), which is achieved through the parallel integration of two distinct attention mechanisms. Furthermore, we have individually developed the pyramid pooling attention module (PPAM) and the dual pooling attention module (DPAM). The PPAM emphasizes channel attention, while the DPAM prioritizes spatial attention. This design aims to enhance vehicle information and suppress background interference more effectively. Extensive experiments conducted on the LH-UAV-Vehicle conclusively demonstrate the efficacy of the proposed vehicle detection method. Our code and dataset can be found at https://github.com/yikuizhai/PDPA-PAN.
引用
收藏
页码:14426 / 14444
页数:19
相关论文
共 5 条
  • [1] Doublem-net: multi-scale spatial pyramid pooling-fast and multi-path adaptive feature pyramid network for UAV detection
    Li, Zhongxu
    He, Qihan
    Zhao, Hong
    Yang, Wenyuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 5781 - 5805
  • [2] Vehicle Detection in Congested Traffic Based on Simplified Weighted Dual-Path Feature Pyramid Network With Guided Anchoring
    Luo, Jingqing
    Fang, Husheng
    Shao, Faming
    Hu, Cong
    Meng, Fanjie
    IEEE ACCESS, 2021, 9 : 53219 - 53231
  • [3] Context-Aware Feature Extraction Network for High-Precision UAV-Based Vehicle Detection in Urban Environments
    Said, Yahia
    Alassaf, Yahya
    Saidani, Taoufik
    Ghodhbani, Refka
    Ben Rhaiem, Olfa
    Alalawi, Ali Ahmad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 4349 - 4370
  • [4] Automatic detection method for tobacco beetles combining multi-scale global residual feature pyramid network and dual-path deformable attention
    Chen, Yuling
    Li, Xiaoxia
    Lv, Nianzu
    He, Zhenxiang
    Wu, Bin
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [5] Automatic detection method for tobacco beetles combining multi-scale global residual feature pyramid network and dual-path deformable attention
    Yuling Chen
    Xiaoxia Li
    Nianzu Lv
    Zhenxiang He
    Bin Wu
    Scientific Reports, 14