A dual-balanced network for long-tail distribution object detection

被引:0
|
作者
Gong, Huiyun [1 ]
Li, Yeguang [2 ]
Dong, Jian [1 ,3 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Management Changchun Univ Technol, Sch Econ, Jilin, Peoples R China
[3] China Elect Standardizat Inst, Beijing, Peoples R China
关键词
computer vision; learning (artificial intelligence); object detection; SMOTE;
D O I
10.1049/cvi2.12182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection on datasets with imbalanced distributions (i.e. long-tail distributions) dataset is a significantly challenging task. Some re-balancing solutions, such as re-weighting and re-sampling have two main disadvantages. First, re-balancing strategies only utilise a coarse-grained global threshold to suppress some of the most influential categories, while overlooking locally influential categories. Second, very few studies have specifically designed algorithms for object detection tasks under long-tail distribution. To address these two issues, a dual-balanced network for fine-grained re-balancing object detection is proposed. Our re-balancing strategies are both in proposal and classification logic, corresponding to two sub-networks, the Balance Region Proposal Network (BRPN) and the Balance Classification Network (BCN). The BRPN sub-network equalises the number of proposals in the background and foreground by reducing the sampling probability of simple backgrounds, and the BCN sub-network equalises the logic between head and tail categories by globally suppressing negative gradients and locally fixing the over-suppressed negative gradients. In addition, the authors advise a balance binary cross entropy loss to jointly re-balance the entire network. This design can be generalised to different two-stage object detection frameworks. The experimental mAP result of 26.40% on this LVIS-v0.5 dataset outperforms most SOTA methods.
引用
收藏
页码:565 / 575
页数:11
相关论文
共 41 条
  • [31] Solo-to-Collaborative Dual-Attention Network for One-Shot Object Detection in Remote Sensing Images
    Li, Lingjun
    Yao, Xiwen
    Cheng, Gong
    Xu, Mingliang
    Han, Jungong
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [32] LRAF-Net: Long-Range Attention Fusion Network for Visible-Infrared Object Detection
    Fu, Haolong
    Wang, Shixun
    Duan, Puhong
    Xiao, Changyan
    Dian, Renwei
    Li, Shutao
    Li, Zhiyong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 13232 - 13245
  • [33] SpermDet: Structure-Aware Network With Local Context Enhancement and Dual-Path Fusion for Object Detection in Sperm Images
    Zhang, Hongyu
    Hu, Zhujun
    Huang, Huaying
    Liu, Shuang
    Rao, Yunbo
    Wang, Qifei
    Ahmad, Naveed
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [34] Object drift determination network based on dual-template joint decision-making in long-term visual tracking
    Hou, Zhiqiang
    Zhao, Jiaxin
    Wang, Zhuo
    Ma, Sugang
    Yu, Wangsheng
    Fan, Jiulun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 97
  • [35] Evaluation of a Dual Convolutional Neural Network Architecture for Object-wise Anomaly Detection in Cluttered X-ray Security Imagery
    Gauss, Yona Falinie A.
    Bhowmik, Neelanjan
    Akcay, Samet
    Guillen-Garcia, Paolo M.
    Barker, Jack W.
    Breckon, Toby P.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [36] Dual Network Structure With Interweaved Global-Local Feature Hierarchy for Transformer-Based Object Detection in Remote Sensing Image
    Xue, Jingqian
    He, Da
    Liu, Mengwei
    Shi, Qian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 6856 - 6866
  • [37] Multi-Modal Object Detection Method Based on Dual-Branch Asymmetric Attention Backbone and Feature Fusion Pyramid Network
    Wang, Jinpeng
    Su, Nan
    Zhao, Chunhui
    Yan, Yiming
    Feng, Shou
    REMOTE SENSING, 2024, 16 (20)
  • [38] MFDAFF-Net: Multiscale Frequency-Aware and Dual Attention-Guided Feature Fusion Network for UAV Imagery Object Detection
    Tian, Shu
    Zhang, Bingxi
    Cao, Lin
    Kang, Lihong
    Tian, Jing
    Xing, Xiangwei
    Shen, Bo
    Fan, Chunzhuo
    Du, Kangning
    Fu, Chong
    Zhang, Ye
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 10640 - 10656
  • [39] Active Object Detection Based on a Novel Deep Q-Learning Network and Long-Term Learning Strategy for the Service Robot
    Liu, Shaopeng
    Tian, Guohui
    Zhang, Ying
    Zhang, Mengyang
    Liu, Shuo
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (06) : 5984 - 5993
  • [40] DREB-Net: Dual-Stream Restoration Embedding Blur-Feature Fusion Network for High-Mobility UAV Object Detection
    Li, Qingpeng
    Zhang, Yuxin
    Fang, Leyuan
    Kang, Yuhan
    Li, Shutao
    Xiang Zhu, Xiao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63