A dual-balanced network for long-tail distribution object detection

被引:2
作者
Gong, Huiyun [1 ]
Li, Yeguang [2 ]
Dong, Jian [1 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Management Changchun Univ Technol, Sch Econ, Jilin, Peoples R China
[3] China Elect Standardizat Inst, Beijing, Peoples R China
关键词
computer vision; learning (artificial intelligence); object detection; SMOTE;
D O I
10.1049/cvi2.12182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection on datasets with imbalanced distributions (i.e. long-tail distributions) dataset is a significantly challenging task. Some re-balancing solutions, such as re-weighting and re-sampling have two main disadvantages. First, re-balancing strategies only utilise a coarse-grained global threshold to suppress some of the most influential categories, while overlooking locally influential categories. Second, very few studies have specifically designed algorithms for object detection tasks under long-tail distribution. To address these two issues, a dual-balanced network for fine-grained re-balancing object detection is proposed. Our re-balancing strategies are both in proposal and classification logic, corresponding to two sub-networks, the Balance Region Proposal Network (BRPN) and the Balance Classification Network (BCN). The BRPN sub-network equalises the number of proposals in the background and foreground by reducing the sampling probability of simple backgrounds, and the BCN sub-network equalises the logic between head and tail categories by globally suppressing negative gradients and locally fixing the over-suppressed negative gradients. In addition, the authors advise a balance binary cross entropy loss to jointly re-balance the entire network. This design can be generalised to different two-stage object detection frameworks. The experimental mAP result of 26.40% on this LVIS-v0.5 dataset outperforms most SOTA methods.
引用
收藏
页码:565 / 575
页数:11
相关论文
共 38 条
[11]  
Hsieh T.I., 2021, AAAI, V3, P15
[12]   Regressive Domain Adaptation for Unsupervised Keypoint Detection [J].
Jiang, Junguang ;
Ji, Yifei ;
Wang, Ximei ;
Liu, Yufeng ;
Wang, Jianmin ;
Long, Mingsheng .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6776-6785
[13]   Towards Better Explanations of Class Activation Mapping [J].
Jung, Hyungsik ;
Oh, Youngrock .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :1316-1324
[14]  
Kang B., 2020, 8 INT C LEARN REPR I
[15]   Overcoming Classifier Imbalance for Long-tail Object Detection with Balanced Group Softmax [J].
Li, Yu ;
Wang, Tao ;
Kang, Bingyi ;
Tang, Sheng ;
Wang, Chunfeng ;
Li, Jintao ;
Feng, Jiashi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10988-10997
[16]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[17]   Exploratory Undersampling for Class-Imbalance Learning [J].
Liu, Xu-Ying ;
Wu, Jianxin ;
Zhou, Zhi-Hua .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 39 (02) :539-550
[18]   Libra R-CNN: Towards Balanced Learning for Object Detection [J].
Pang, Jiangmiao ;
Chen, Kai ;
Shi, Jianping ;
Feng, Huajun ;
Ouyang, Wanli ;
Lin, Dahua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :821-830
[19]   Class-Incremental Learning for Action Recognition in Videos [J].
Park, Jaeyoo ;
Kang, Minsoo ;
Han, Bohyung .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :13678-13687
[20]   Influence-Balanced Loss for Imbalanced Visual Classification [J].
Park, Seulki ;
Lim, Jongin ;
Jeon, Younghan ;
Choi, Jin Young .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :715-724