DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

被引:366
|
作者
Li, Feng [1 ,2 ,5 ]
Zhang, Hao [1 ,2 ,5 ]
Liu, Shilong [2 ,3 ,5 ]
Guo, Jian [2 ]
Ni, Lionel M. [1 ,4 ]
Zhang, Lei [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Int Digital Econ Acad IDEA, Shenzhen, Peoples R China
[3] Tsinghua Univ, Beijing, Peoples R China
[4] Hong Kong Univ Sci & Technol, Guangzhou, Peoples R China
[5] IDEA, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR52688.2022.01325
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present in this paper a novel denoising training method to speedup DETR (DEtection TRansformer) training and offer a deepened understanding of the slow convergence issue of DETR-like methods. We show that the slow convergence results from the instability of bipartite graph matching which causes inconsistent optimization goals in early training stages. To address this issue, except for the Hungarian loss, our method additionally feeds ground-truth bounding boxes with noises into Transformer decoder and trains the model to reconstruct the original boxes, which effectively reduces the bipartite graph matching difficulty and leads to a faster convergence. Our method is universal and can be easily plugged into any DETR-like methods by adding dozens of lines of code to achieve a remarkable improvement. As a result, our DN-DETR results in a remarkable improvement (+1.9AP) under the same setting and achieves the best result (AP 43.4 and 48.6 with 12 and 50 epochs of training respectively) among DETR-like methods with ResNet-50 backbone. Compared with the baseline under the same setting, DN-DETR achieves comparable performance with 50% training epochs. Code is available at https://github.com/FengLi-ust/DN-DETR.
引用
收藏
页码:13609 / 13617
页数:9
相关论文
共 28 条
  • [1] DN-DETR: Accelerate DETR Training by Introducing Query DeNoising
    Li, Feng
    Zhang, Hao
    Liu, Shilong
    Guo, Jian
    Ni, Lionel M.
    Zhang, Lei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (04) : 2239 - 2251
  • [2] AParC-DETR: Accelerate DETR training by introducing Adaptive Position-aware Circular Convolution
    Guan, Ya'nan
    Liao, Shujiao
    Yang, Wenyuan
    VISUAL COMPUTER, 2025, 41 (02): : 1319 - 1333
  • [3] Do-DETR: enhancing DETR training convergence with integrated denoising and RoI mechanism
    Liang, Hong
    Li, Yu
    Zhang, Qian
    Shao, Mingwen
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [4] Do-DETR: enhancing DETR training convergence with integrated denoising and RoI mechanismDO-DETR: enhancing DETR training convergence with integrated denoising and RoI mechanismH. Liang et al.
    Hong Liang
    Yu Li
    Qian Zhang
    Mingwen Shao
    Multimedia Systems, 2025, 31 (3)
  • [5] DQ-DETR: DETR with Dynamic Query for Tiny Object Detection
    Huang, Yi-Xin
    Liu, Hou-, I
    Shuai, Hong-Han
    Cheng, Wen-Huang
    COMPUTER VISION - ECCV 2024, PT LXXVI, 2025, 15134 : 290 - 305
  • [6] Teach-DETR: Better Training DETR With Teachers
    Huang, Linjiang
    Lu, Kaixin
    Song, Guanglu
    Wang, Liang
    Liu, Si
    Liu, Yu
    Li, Hongsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15759 - 15771
  • [7] CLS-DETR: A DETR-series object detection network using classification information to accelerate convergence
    Li, Shibao
    Jia, Zekun
    Liu, Yixuan
    Cui, Xuerong
    Liu, Jianhang
    Huang, Tingpei
    Xu, Jiuyun
    PATTERN RECOGNITION LETTERS, 2023, 165 : 168 - 175
  • [8] Conditional DETR for Fast Training Convergence
    Meng, Depu
    Chen, Xiaokang
    Fan, Zejia
    Zeng, Gang
    Li, Houqiang
    Yuan, Yuhui
    Sun, Lei
    Wang, Jingdong
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3631 - 3640
  • [9] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
    Chen, Qiang
    Chen, Xiaokang
    Wang, Jian
    Zhang, Shan
    Yao, Kun
    Feng, Haocheng
    Han, Junyu
    Ding, Errui
    Zeng, Gang
    Wang, Jingdong
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6610 - 6619
  • [10] DETR-ORD: An Improved DETR Detector for Oriented Remote Sensing Object Detection with Feature Reconstruction and Dynamic Query
    He, Xiaohai
    Liang, Kaiwen
    Zhang, Weimin
    Li, Fangxing
    Jiang, Zhou
    Zuo, Zhengqing
    Tan, Xinyan
    REMOTE SENSING, 2024, 16 (18)