DANet: Multi-scale UAV Target Detection with Dynamic Feature Perception and Scale-aware Knowledge Distillation

被引:1
作者
Fang, Houzhang [1 ]
Liao, Zikai [1 ]
Wang, Lu [1 ]
Li, Qingshan [1 ]
Chang, Yi [2 ]
Yan, Luxin [2 ]
Wang, Xuhua [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
基金
中国国家自然科学基金;
关键词
Unmanned aerial vehicle; multi-scale infrared target detection; attention mechanism; contrastive learning; knowledge distillation;
D O I
10.1145/3581783.3612146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-scale infrared unmanned aerial vehicle ( UAV) targets (IRUTs) detection under dynamic scenarios remains a challenging task due to weak target features, varying shapes and poses, and complex background interference. Current detection methods find it difficult to address the above issues accurately and efficiently. In this paper, we design a dynamic attentive network (DANet) incorporating a scale-adaptive feature enhancement mechanism (SaFEM) and an attention-guided cross-weighting feature aggregator (ACFA). The SaFEM adaptively adjusts the network's receptive fields at hierarchical network levels leveraging separable deformable convolution (SDC), which enhances the network's multi-scale IRUT awareness. The ACFA, modulated by two crossing attention mechanisms, strengthens structural and semantic properties on neighboring levels for the accurate representation of multi-scale IRUT features from different levels. A plug-and-play anti-distractor contrastive regularization (ADCR) is also imposed on our DANet, which enforces similarity on features of targets and distractors from a new uncompressed feature projector (UFP) to increase the network's anti-distractor ability in complex backgrounds. To further increase the multi-scale UAV detection performance of DANet while maintaining its efficiency superiority, we propose a novel scale-specific knowledge distiller (SSKD) based on a divide-and-conquer strategy. For the "divide" stage, we intendedly construct three task-oriented teachers to learn tailored knowledge for small-, medium-, and largescale IRUTs. For the "conquer" stage, we propose a novel elementwise attentive distillation module (EADM), where we employ a pixel-wise attention mechanism to highlight teacher and student IRUT features, and incorporate IRUT-associated prior knowledge for the collaborative transfer of refined multi-scale IRUT features to our DANet. Extensive experiments on real infrared UAV datasets demonstrate that our DANet is able to detect multi-scale UAVs with a satisfactory balance between accuracy and efficiency.
引用
收藏
页码:2121 / 2130
页数:10
相关论文
共 52 条
  • [1] Local Patch Network With Global Attention for Infrared Small Target Detection
    Chen, Fang
    Gao, Chenqiang
    Liu, Fangcen
    Zhao, Yue
    Zhou, Yuxi
    Meng, Deyu
    Zuo, Wangmeng
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (05) : 3979 - 3991
  • [2] Deformable Convolutional Networks
    Dai, Jifeng
    Qi, Haozhi
    Xiong, Yuwen
    Li, Yi
    Zhang, Guodong
    Hu, Han
    Wei, Yichen
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 764 - 773
  • [3] Attentional Local Contrast Networks for Infrared Small Target Detection
    Dai, Yimian
    Wu, Yiquan
    Zhou, Fei
    Barnard, Kobus
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (11): : 9813 - 9824
  • [4] Asymmetric Contextual Modulation for Infrared Small Target Detection
    Dai, Yimian
    Wu, Yiquan
    Zhou, Fei
    Barnard, Kobus
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 949 - 958
  • [5] RepVGG: Making VGG-style ConvNets Great Again
    Ding, Xiaohan
    Zhang, Xiangyu
    Ma, Ningning
    Han, Jungong
    Ding, Guiguang
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13728 - 13737
  • [6] Differentiated Attention Guided Network Over Hierarchical and Aggregated Features for Intelligent UAV Surveillance
    Fang, Houzhang
    Liao, Zikai
    Wang, Xuhua
    Chang, Yi
    Yan, Luxin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (09) : 9909 - 9920
  • [7] Infrared Small UAV Target Detection Based on Residual Image Prediction via Global and Local Dilated Residual Networks
    Fang, Houzhang
    Xia, Mingjiang
    Zhou, Gang
    Chang, Yi
    Yan, Luxin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [8] 3Global Second-order Pooling Convolutional Networks
    Gao, Zilin
    Xie, Jiangtao
    Wang, Qilong
    Li, Peihua
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3019 - 3028
  • [9] Multilevel Attention-Based Sample Correlations for Knowledge Distillation
    Gou, Jianping
    Sun, Liyuan
    Yu, Baosheng
    Wan, Shaohua
    Ou, Weihua
    Yi, Zhang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (05) : 7099 - 7109
  • [10] Detection, Tracking, and Interdiction for Amateur Drones
    Guevenc, Ismail
    Koohifar, Farshad
    Singh, Simran
    Sichitiu, Mihail L.
    Matolak, David
    [J]. IEEE COMMUNICATIONS MAGAZINE, 2018, 56 (04) : 75 - 81