A dual-balanced network for long-tail distribution object detection

被引:0
|
作者
Gong, Huiyun [1 ]
Li, Yeguang [2 ]
Dong, Jian [1 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Management Changchun Univ Technol, Sch Econ, Jilin, Peoples R China
[3] China Elect Standardizat Inst, Beijing, Peoples R China
关键词
computer vision; learning (artificial intelligence); object detection; SMOTE;
D O I
10.1049/cvi2.12182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection on datasets with imbalanced distributions (i.e. long-tail distributions) dataset is a significantly challenging task. Some re-balancing solutions, such as re-weighting and re-sampling have two main disadvantages. First, re-balancing strategies only utilise a coarse-grained global threshold to suppress some of the most influential categories, while overlooking locally influential categories. Second, very few studies have specifically designed algorithms for object detection tasks under long-tail distribution. To address these two issues, a dual-balanced network for fine-grained re-balancing object detection is proposed. Our re-balancing strategies are both in proposal and classification logic, corresponding to two sub-networks, the Balance Region Proposal Network (BRPN) and the Balance Classification Network (BCN). The BRPN sub-network equalises the number of proposals in the background and foreground by reducing the sampling probability of simple backgrounds, and the BCN sub-network equalises the logic between head and tail categories by globally suppressing negative gradients and locally fixing the over-suppressed negative gradients. In addition, the authors advise a balance binary cross entropy loss to jointly re-balance the entire network. This design can be generalised to different two-stage object detection frameworks. The experimental mAP result of 26.40% on this LVIS-v0.5 dataset outperforms most SOTA methods.
引用
收藏
页码:565 / 575
页数:11
相关论文
共 41 条
  • [1] Logit Normalization for Long-Tail Object Detection
    Zhao, Liang
    Teng, Yao
    Wang, Limin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (06) : 2114 - 2134
  • [2] Distance metric-based learning for long-tail object detection
    Shao, Mingwen
    Peng, Zilu
    IMAGE AND VISION COMPUTING, 2024, 142
  • [3] Model-Independent Approach For Long-Tail Object Detection In Aerial Imagery
    Haleem, Halar
    Bisio, Igor
    Garibotto, Chiara
    Lavagetto, Fabio
    Sciarrone, Andrea
    2024 IEEE ANNUAL CONGRESS ON ARTIFICIAL INTELLIGENCE OF THING, AIOT 2024, 2024, : 78 - 80
  • [4] YOLO-Parallel: Positive Gradient Modeling for Long-Tail Remote Sensing Object Detection
    Gao, Xiangyi
    Zhao, Danpei
    Yuan, Zhichao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [5] Long-tail Detection with Effective Class-Margins
    Cho, Jang Hyun
    Krahenbuhl, Philipp
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 698 - 714
  • [6] Improving the Performance of Object Detection by Preserving Balanced Class Distribution
    Lee, Heewon
    Ahn, Sangtae
    MATHEMATICS, 2023, 11 (21)
  • [7] Incorporating long-tail data in complex backgrounds for visual surface defect detection in PCBs
    Zhu, Liying
    Wang, Sen
    Chen, Mingfang
    Shen, Aiping
    Li, Xuangang
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (06) : 7591 - 7604
  • [8] Distribution-balanced augmentation for rough data driven object detection
    Wang Z.
    Tian L.
    Du Q.
    Sun Z.
    An Y.
    Liao W.
    Multimedia Tools and Applications, 2024, 83 (18) : 56103 - 56125
  • [9] Balanced Classification: A Unified Framework for Long-Tailed Object Detection
    Qi, Tianhao
    Xie, Hongtao
    Li, Pandeng
    Ge, Jiannan
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3088 - 3101
  • [10] A dual neural network for object detection in UAV images
    Tian, Gangyi
    Liu, Jianran
    Yang, Wenyuan
    NEUROCOMPUTING, 2021, 443 : 292 - 301