DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

被引:366
|
作者
Li, Feng [1 ,2 ,5 ]
Zhang, Hao [1 ,2 ,5 ]
Liu, Shilong [2 ,3 ,5 ]
Guo, Jian [2 ]
Ni, Lionel M. [1 ,4 ]
Zhang, Lei [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Int Digital Econ Acad IDEA, Shenzhen, Peoples R China
[3] Tsinghua Univ, Beijing, Peoples R China
[4] Hong Kong Univ Sci & Technol, Guangzhou, Peoples R China
[5] IDEA, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR52688.2022.01325
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present in this paper a novel denoising training method to speedup DETR (DEtection TRansformer) training and offer a deepened understanding of the slow convergence issue of DETR-like methods. We show that the slow convergence results from the instability of bipartite graph matching which causes inconsistent optimization goals in early training stages. To address this issue, except for the Hungarian loss, our method additionally feeds ground-truth bounding boxes with noises into Transformer decoder and trains the model to reconstruct the original boxes, which effectively reduces the bipartite graph matching difficulty and leads to a faster convergence. Our method is universal and can be easily plugged into any DETR-like methods by adding dozens of lines of code to achieve a remarkable improvement. As a result, our DN-DETR results in a remarkable improvement (+1.9AP) under the same setting and achieves the best result (AP 43.4 and 48.6 with 12 and 50 epochs of training respectively) among DETR-like methods with ResNet-50 backbone. Compared with the baseline under the same setting, DN-DETR achieves comparable performance with 50% training epochs. Code is available at https://github.com/FengLi-ust/DN-DETR.
引用
收藏
页码:13609 / 13617
页数:9
相关论文
共 28 条
  • [21] Towards Hard-Positive Query Mining for DETR-Based Human-Object Interaction Detection
    Zhong, Xubin
    Ding, Changxing
    Li, Zijian
    Huang, Shaoli
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 444 - 460
  • [22] ONDA-DETR: ONLINE DOMAIN ADAPTATION FOR DETECTION TRANSFORMERS WITH SELF-TRAINING FRAMEWORK
    Suzuki, Satoshi
    Yamane, Taiga
    Makishima, Naoki
    Suzuki, Keita
    Ando, Atsushi
    Masumura, Ryo
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1780 - 1784
  • [23] FSDN-DETR: Enhancing Fuzzy Systems Adapter with DeNoising Anchor Boxes for Transfer Learning in Small Object Detection
    Li, Zhijie
    Zhang, Jiahui
    Zhang, Yingjie
    Yan, Dawei
    Zhang, Xing
    Wozniak, Marcin
    Dong, Wei
    MATHEMATICS, 2025, 13 (02)
  • [24] DETR and YOLOv5: Exploring Performance and Self-Training for Diabetic Foot Ulcer Detection
    Bruengel, Raphael
    Friedrich, Christoph M.
    2021 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2021, : 148 - 153
  • [25] FS-DETR: Few-Shot DEtection TRansformer with prompting and without re-training
    Bulat, Adrian
    Guerrero, Ricardo
    Martinez, Brais
    Tzimiropoulos, Georgios
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11759 - 11768
  • [26] Power-DETR: end-to-end power line defect components detection based on contrastive denoising and hybrid label assignment
    Xie, Zhiyuan
    Dong, Chao
    Zhang, Ke
    Wang, Jiacun
    Xiao, Yangjie
    Guo, Xiwang
    Zhao, Zhenbing
    Shi, Chaojun
    Zhao, Wei
    IET GENERATION TRANSMISSION & DISTRIBUTION, 2024, 18 (20) : 3264 - 3277
  • [27] CCDN-DETR: A Detection Transformer Based on Constrained Contrast Denoising for Multi-Class Synthetic Aperture Radar Object Detection
    Zhang, Lei
    Zheng, Jiachun
    Li, Chaopeng
    Xu, Zhiping
    Yang, Jiawen
    Wei, Qiuxin
    Wu, Xinyi
    SENSORS, 2024, 24 (06)
  • [28] Adltformer Team-Training with Detr: Enhancing Cattle Detection in Non-Ideal Lighting Conditions Through Adaptive Image Enhancement
    Zheng, Zhiqiang
    Wang, Mengbo
    Zhao, Xiaoyu
    Weng, Zhi
    ANIMALS, 2024, 14 (24):