Control Distance IoU and Control Distance IoU Loss for Better Bounding Box Regression

被引:33
作者
Dong, Chen [1 ]
Miao, Duoqian [1 ]
机构
[1] Tongji Univ, 4800,Caoan Highway, Shanghai, Peoples R China
关键词
Computer vision; Object detection; IoU; Loss function; NETWORKS; OBJECT;
D O I
10.1016/j.patcog.2022.109256
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous improvements in feedback mechanisms have contributed to the great progress in object detec-tion. In this paper, we first present an evaluation-feedback module, which consists of an evaluation sys-tem and feedback mechanism. Then we analyze and summarize traditional evaluation-feedback modules. We focus on both the evaluation system and the feedback mechanism, and propose Control Distance IoU and Control Distance IoU loss function (CDIoU and CDIoU loss) without increasing parameters in mod-els, which make significant enhancements on several classical and emerging models. Finally, we propose Automatic Ground Truth Clustering (AGTC) and Floating Learning Rate Decay (FLRD) for faster regression in object detection. Experiments show that a coordinated evaluation-feedback module can effectively im-prove model performance. Both CNN and transformer-based detectors with CDIoU + CDIoU loss, AGTC, and FLRD achieve excellent performances. There are a maximum AP improvement of 2.9%, an average AP of 1.1% improvement on MS COCO, a maximum AP improvement of 8.2%, and an average AP improvement of 3.7% on Visdrone dataset.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 38 条
[1]  
[Anonymous], 2016, 4 INT C LEARN REPR S
[2]  
[Anonymous], 2013, arXiv
[3]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[4]   D2Det: Towards High Quality Object Detection and Instance Segmentation [J].
Cao, Jiale ;
Cholakkal, Hisham ;
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
Pang, Yanwei ;
Shao, Ling .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11482-11491
[5]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[6]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[7]  
Chen Y., 2022, SCALE AWARE AUTOMATI
[8]   Corner Proposal Network for Anchor-Free, Two-Stage Object Detection [J].
Duan, Kaiwen ;
Xie, Lingxi ;
Qi, Honggang ;
Bai, Song ;
Huang, Qingming ;
Tian, Qi .
COMPUTER VISION - ECCV 2020, PT III, 2020, 12348 :399-416
[9]  
Gao P., 2021, P IEEE CVF C COMP VI, P3621
[10]  
Gao Z., 2022, P IEEECVF C COMPUTER, P5364