Dynamic multi-scale loss optimization for object detection

被引:0
作者
Yihao Luo
Xiang Cao
Juntao Zhang
Peng Cheng
Tianjiang Wang
Qi Feng
机构
[1] Huazhong University of Science and Technology,School of Computer Science and Technology
[2] Coolanyp Limited Liability Company,undefined
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Object detection; Multi-scale imbalance; Reinforcement learning; Multi-task;
D O I
暂无
中图分类号
学科分类号
摘要
With the continuous improvement of deep object detectors via advanced model architectures, imbalance problems in the training process have received more attention. It is a common paradigm in object detection frameworks to perform multi-scale detection. However, each scale is treated equally during training. In this paper, we carefully study the objective imbalance of multi-scale detector training. We argue that the loss in each scale level is neither equally important nor independent. Different from the existing solutions of setting multi-task weights, we dynamically optimize the loss weight of each scale level in the training process. Specifically, we propose an Adaptive Variance Weighting (AVW) to balance multi-scale loss according to the statistical variance. Then we develop a novel Reinforcement Learning Optimization (RLO) to decide the weighting scheme probabilistically during training. It makes better utilization of multi-scale training loss without extra computational complexity and learnable parameters for backpropagation. Without bells and whistles, the proposed method improves ATSS by 0.9 AP on the MS COCO benchmark. And it achieves 82.1 mAP on Pascal VOC 2007 test set, which outperforms other reinforcement-learning-based methods.
引用
收藏
页码:2349 / 2367
页数:18
相关论文
共 50 条
[41]   Improved Faster R-CNN for Multi-Scale Object Detection [J].
Li X. ;
Fu C. ;
Li X. ;
Wang Z. .
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (07) :1095-1101
[42]   Object detection in UAV images based on multi-scale split attention [J].
Mao G. ;
Deng T. ;
Yu N. .
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (05)
[43]   Gated CNN: Integrating multi-scale feature layers for object detection [J].
Yuan, Jin ;
Xiong, Heng-Chang ;
Xiao, Yi ;
Guan, Weili ;
Wang, Meng ;
Hong, Richang ;
Li, Zhi-Yong .
PATTERN RECOGNITION, 2020, 105
[44]   Enhanced SSD with interactive multi-scale attention features for object detection [J].
Zhou, Shuren ;
Qiu, Jia .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) :11539-11556
[45]   Multi-Scale Object Detection with the Pixel Attention Mechanism in a Complex Background [J].
Xiao, Jinsheng ;
Guo, Haowen ;
Yao, Yuntao ;
Zhang, Shuhao ;
Zhou, Jian ;
Jiang, Zhijun .
REMOTE SENSING, 2022, 14 (16)
[46]   Adaptive aerial object detection based on multi-scale deep learning [J].
Liu F. ;
Han X. .
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (05)
[47]   A Multi-Scale Learnable Feature Alignment Network for Video Object Detection [J].
Wang, Rui .
2024 IEEE 21ST INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SMART SYSTEMS, MASS 2024, 2024, :496-501
[48]   Object Detection Networks Based on Refined Multi-scale Depth Feature [J].
Li Y.-Q. ;
Gai C.-Y. ;
Xiao C.-J. ;
Wu C. ;
Liu J.-J. .
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (12) :2360-2366
[49]   Bridging Multi-Scale Context-Aware Representation for Object Detection [J].
Wang, Boying ;
Ji, Ruyi ;
Zhang, Libo ;
Wu, Yanjun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) :2317-2329
[50]   Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection [J].
Li, Zekun ;
Liu, Yufan ;
Li, Bing ;
Hu, Weiming ;
Zhou, Xue .
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,