Dynamic multi-scale loss optimization for object detection

被引:0
作者
Yihao Luo
Xiang Cao
Juntao Zhang
Peng Cheng
Tianjiang Wang
Qi Feng
机构
[1] Huazhong University of Science and Technology,School of Computer Science and Technology
[2] Coolanyp Limited Liability Company,undefined
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Object detection; Multi-scale imbalance; Reinforcement learning; Multi-task;
D O I
暂无
中图分类号
学科分类号
摘要
With the continuous improvement of deep object detectors via advanced model architectures, imbalance problems in the training process have received more attention. It is a common paradigm in object detection frameworks to perform multi-scale detection. However, each scale is treated equally during training. In this paper, we carefully study the objective imbalance of multi-scale detector training. We argue that the loss in each scale level is neither equally important nor independent. Different from the existing solutions of setting multi-task weights, we dynamically optimize the loss weight of each scale level in the training process. Specifically, we propose an Adaptive Variance Weighting (AVW) to balance multi-scale loss according to the statistical variance. Then we develop a novel Reinforcement Learning Optimization (RLO) to decide the weighting scheme probabilistically during training. It makes better utilization of multi-scale training loss without extra computational complexity and learnable parameters for backpropagation. Without bells and whistles, the proposed method improves ATSS by 0.9 AP on the MS COCO benchmark. And it achieves 82.1 mAP on Pascal VOC 2007 test set, which outperforms other reinforcement-learning-based methods.
引用
收藏
页码:2349 / 2367
页数:18
相关论文
共 50 条
[11]   Multi-scale structural kernel representation for object detection [J].
Wang, Hao ;
Wang, Qilong ;
Li, Peihua ;
Zuo, Wangmeng .
PATTERN RECOGNITION, 2021, 110
[12]   AUTONOMOUS MULTI-SCALE OBJECT DETECTION WITH HOUGH FORESTS [J].
Scalzo, Maria ;
Velipasalar, Senem .
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, :1643-1647
[13]   StairsNet: Mixed Multi-scale Network for Object Detection [J].
Gao, Weiyi ;
Cao, Wenlong ;
Zhai, Jian ;
Rui, Jianwu .
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 :303-314
[14]   Multi-scale HOG Feature Used in Object Detection [J].
Li, Jin ;
Zhang, Hong ;
Zhang, Lei ;
Li, Yawei ;
Kang, Qiaochu ;
Luo, Zhaohui ;
Wu, Yujie .
TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
[15]   MGFPN: Enhancing multi-scale feature for object detection [J].
He, Weiming ;
Wu, You ;
Xiao, Jing ;
Cao, Yang .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) :11171-11181
[16]   Multi-scale redistribution feature pyramid for object detection [J].
Qian, Huifang ;
Guo, Jiahao ;
Zhou, Xuan .
AI COMMUNICATIONS, 2022, 35 (01) :15-30
[17]   Object Detection Using Multi-Scale Balanced Sampling [J].
Yu, Hang ;
Gong, Jiulu ;
Chen, Derong .
APPLIED SCIENCES-BASEL, 2020, 10 (17)
[18]   Multi-scale coupled attention for visual object detection [J].
Li, Fei ;
Yan, Hongping ;
Shi, Linsu .
SCIENTIFIC REPORTS, 2024, 14 (01)
[19]   Multi-Scale Feature Attention-DEtection TRansformer: Multi-Scale Feature Attention for security check object detection [J].
Sima, Haifeng ;
Chen, Bailiang ;
Tang, Chaosheng ;
Zhang, Yudong ;
Sun, Junding .
IET COMPUTER VISION, 2024, 18 (05) :613-625
[20]   An efficient algorithm for multi-scale maritime object detection and recognition [J].
Liu, Yang ;
Yi, Ran ;
Ma, Ding ;
Wang, Yongfu .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2024, 46 (03) :7259-7271