DETR with Improved DeNoising Training for Multi-Scale Oriented Object Detection in Optical Remote Sensing Images (Invited)

被引:2
|
作者
Jin Ruijiao [1 ,2 ]
Wang Kun [1 ,2 ]
Liu Minhao [1 ,2 ]
Teng Xichao [1 ,2 ]
Li Zhang [1 ,2 ]
Yu Qifeng [1 ,2 ]
机构
[1] Natl Univ Def Technol, Coll Aerosp Sci & Engn, Changsha 410000, Hunan, Peoples R China
[2] Hunan Key Lab Image Measurement & Vision Nav, Changsha 410000, Hunan, Peoples R China
关键词
oriented object detection; DETR object detector; multi-scale rotated deformable attention;
D O I
10.3788/LOP240502
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Oriented object detection is one of the important tasks in remote sensing image interpretation, which faces typical problems such as arbitrary object orientation, dense arrangement of small targets, and angular periodicity caused by target representation, thus, this paper proposes a method called arbitrary- oriented object detection Transformer with improved deNoising anchor boxes (AO(2)DINO) which based on DEtection Transformer (DETR) and improved denoising training. First, a multi-scale rotated deformable attention ( MS-RDA) module is proposed. The MS-RDA module introduces the angle information in the form of rotation matrix for the calculation of attention weights, which improves the adaptability of the model to the orientated objects. Second, this paper proposes a self-adaption assigner (SAA), which uses the rotated intersection over union (IoU) and adaptive threshold to accurately separate dense targets, to improve the small targets detection under the dense arrangement scenarios. Finally, the Kalman filtering IoU (KFIoU) is introduced as the regression loss to solve the angular periodicity problem caused by the representation of orientated objects. Our proposed method is compared with the typical oriented bounding box ( OBB) methods on two public datasets, DOTAv1. 0 and DIOR-R, and the detection accuracy is the highest among the DETR-based OBB methods, and the convergence speed is faster during training, which only needs 12 training epochs to achieve comparable detection accuracy as other methods using 36 training epochs.
引用
收藏
页数:11
相关论文
共 30 条
  • [1] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [2] Anchor-Free Oriented Proposal Generator for Object Detection
    Cheng, Gong
    Wang, Jiabao
    Li, Ke
    Xie, Xingxing
    Lang, Chunbo
    Yao, Yanqing
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] AO2-DETR: Arbitrary-Oriented Object Detection Transformer
    Dai, Linhui
    Liu, Hong
    Tang, Hao
    Wu, Zhiwei
    Song, Pinhao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2342 - 2356
  • [4] Learning RoI Transformer for Oriented Object Detection in Aerial Images
    Ding, Jian
    Xue, Nan
    Long, Yang
    Xia, Gui-Song
    Lu, Qikai
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2844 - 2853
  • [5] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [6] Align Deep Features for Oriented Object Detection
    Han, Jiaming
    Ding, Jian
    Li, Jie
    Xia, Gui-Song
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [7] ReDet: A Rotation-equivariant Detector for Aerial Object Detection
    Han, Jiaming
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2785 - 2794
  • [8] DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
    Li, Feng
    Zhang, Hao
    Liu, Shilong
    Guo, Jian
    Ni, Lionel M.
    Zhang, Lei
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13609 - 13617
  • [9] Li K, 2019, Arxiv, DOI arXiv:1909.00133
  • [10] Li Z, 2023, AAAI CONF ARTIF INTE, P1504