Dynamic multi-scale loss optimization for object detection

被引:0
作者
Yihao Luo
Xiang Cao
Juntao Zhang
Peng Cheng
Tianjiang Wang
Qi Feng
机构
[1] Huazhong University of Science and Technology,School of Computer Science and Technology
[2] Coolanyp Limited Liability Company,undefined
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Object detection; Multi-scale imbalance; Reinforcement learning; Multi-task;
D O I
暂无
中图分类号
学科分类号
摘要
With the continuous improvement of deep object detectors via advanced model architectures, imbalance problems in the training process have received more attention. It is a common paradigm in object detection frameworks to perform multi-scale detection. However, each scale is treated equally during training. In this paper, we carefully study the objective imbalance of multi-scale detector training. We argue that the loss in each scale level is neither equally important nor independent. Different from the existing solutions of setting multi-task weights, we dynamically optimize the loss weight of each scale level in the training process. Specifically, we propose an Adaptive Variance Weighting (AVW) to balance multi-scale loss according to the statistical variance. Then we develop a novel Reinforcement Learning Optimization (RLO) to decide the weighting scheme probabilistically during training. It makes better utilization of multi-scale training loss without extra computational complexity and learnable parameters for backpropagation. Without bells and whistles, the proposed method improves ATSS by 0.9 AP on the MS COCO benchmark. And it achieves 82.1 mAP on Pascal VOC 2007 test set, which outperforms other reinforcement-learning-based methods.
引用
收藏
页码:2349 / 2367
页数:18
相关论文
共 50 条
[31]   Multi-Scale Feature Similarity and Object Detection for Small Printing Defects Detection [J].
Lou, Haojie ;
Zheng, Yuanlin ;
Chen, Wenqian ;
Liu, Haiwen .
IEEE ACCESS, 2024, 12 :196403-196412
[32]   YOLO-MFD: Remote Sensing Image Object Detection with Multi-Scale Fusion Dynamic Head [J].
Zhang, Zhongyuan ;
Zhu, Wenqiu .
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02) :2547-2563
[33]   A multi-scale feature representation and interaction network for underwater object detection [J].
Yuan, Jiaojiao ;
Hu, Yongli ;
Sun, Yanfeng ;
Yin, Baocai .
IET COMPUTER VISION, 2023, 17 (03) :265-281
[34]   Enhanced Multi-Scale Object Detection Algorithm for Foggy Traffic Scenarios [J].
Wang, Honglin ;
Shi, Zitong ;
Zhu, Cheng .
CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (02) :2451-2474
[35]   High-Level Semantic Networks for Multi-Scale Object Detection [J].
Cao, Jiale ;
Pang, Yanwei ;
Zhao, Shengjie ;
Li, Xuelong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (10) :3372-3386
[36]   Gated CNN: Integrating multi-scale feature layers for object detection [J].
Yuan, Jin ;
Xiong, Heng-Chang ;
Xiao, Yi ;
Guan, Weili ;
Wang, Meng ;
Hong, Richang ;
Li, Zhi-Yong .
PATTERN RECOGNITION, 2020, 105
[37]   Enhanced SSD with interactive multi-scale attention features for object detection [J].
Zhou, Shuren ;
Qiu, Jia .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) :11539-11556
[38]   Multi-Scale Feature Fusion Based Adaptive Object Detection for UAV [J].
Liu Fang ;
Wu Zhiwei ;
Yang Anzhe ;
Han Xiao .
ACTA OPTICA SINICA, 2020, 40 (10)
[39]   MULTI-SCALE SAMPLE SELECTION BASED ON STATISTICAL CHARACTERISTICS FOR OBJECT DETECTION [J].
Li, Zhiguo ;
Yuan, Yuan ;
Ma, Dandan .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1485-1489
[40]   Pyramid attention object detection network with multi-scale feature fusion [J].
Chen, Xiu ;
Li, Yujie ;
Nakatoh, Yoshihisa .
COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104