Distance metric-based learning for long-tail object detection

被引:0
作者
Shao, Mingwen [1 ,2 ]
Peng, Zilu [2 ]
机构
[1] Quanzhou Vocat & Tech Univ, Natl Sci Digital Ind Coll, Jinjiang 362000, Peoples R China
[2] China Univ Petr, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep convolutional neural network; Object detection; Long -tail distribution; Metric learning; Feature extraction; SMOTE;
D O I
10.1016/j.imavis.2023.104888
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the recent success of general object detection, almost all models perform unsatisfactorily on long-tail datasets. The main cause of performance degradation is the imbalance in the number of positive samples between categories. The traditional approaches can lead to distortion of the classification feature space, which in turn can seriously affect the classification ability of the network. To address the above issues, we propose a novel distance metric-based learning approach for long-tail object detection (LTDL) in this paper. Specifically, we directly use the feature space as the optimization target, thus allowing clearer decision boundaries between classes. In order to optimize the decision boundary, we adjust the intra-class and inter-class distances by Margin Module (MAM). Meanwhile, to further exploit the information provided by the dataset, we introduce supervised information of labels for distance weighting using the Semantic Module (SEM). In addition, to protect the learning of tail samples and optimize the classifier, we propose a Distance-based Equilibrium Loss (DEL). Extensive experiments conducted on the LVIS benchmark have demonstrated the strength of our proposed approach. The experimental results show that our method improves the baseline by 2.9% AP. And our best model can outperform almost all other representative methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] BDR-Net: Bhattacharyya Distance-Based Distribution Metric Modeling for Rotating Object Detection in Remote Sensing
    Wang, Haining
    Liao, Yurong
    Li, Yang
    Fang, Yuqiang
    Ni, Shuyan
    Luo, Yalun
    Jiang, Bitao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [22] SEMI-SUPERVISED DISTANCE METRIC LEARNING FOR VISUAL OBJECT CLASSIFICATION
    Cevikalp, Hakan
    Paredes, Roberto
    VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2009, : 315 - +
  • [23] Meta-learning Advisor Networks for Long-tail and Noisy Labels in Social Image Classification
    Ricci, Simone
    Uricchio, Tiberio
    Del Bimbo, Alberto
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (05)
  • [24] Metric learning based object recognition and retrieval
    Yang, Jianyu
    Xu, Haoran
    NEUROCOMPUTING, 2016, 190 : 70 - 81
  • [25] Feature Contrastive Transfer Learning for Few-Shot Long-Tail Sonar Image Classification
    Bai, Zhongyu
    Xu, Hongli
    Ding, Qichuan
    Zhang, Xiangyue
    IEEE COMMUNICATIONS LETTERS, 2025, 29 (03) : 562 - 566
  • [26] A Deeply Supervised Attention Metric-Based Network and an Open Aerial Image Dataset for Remote Sensing Change Detection
    Shi, Qian
    Liu, Mengxi
    Li, Shengchen
    Liu, Xiaoping
    Wang, Fei
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [27] Survey of Object Detection Based on Deep Learning
    Luo H.-L.
    Chen H.-K.
    1600, Chinese Institute of Electronics (48): : 1230 - 1239
  • [28] Dynamic Subclass-Balancing Contrastive Learning for Long-Tail Pedestrian Trajectory Prediction With Progressive Refinement
    Yang, Biao
    Yan, Kai
    Hu, Chuan
    Hu, Hongyu
    Yu, Zhitao
    Ni, Rongrong
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 8645 - 8658
  • [29] Metric-Based Learning for Nearest-Neighbor Few-Shot Image Classification
    Lee, Min Jun
    So, Jungmin
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 460 - 464
  • [30] Sampling-invariant fully metric learning for few-shot object detection
    Leng, Jiaxu
    Chen, Taiyue
    Gao, Xinbo
    Mo, Mengjingcheng
    Yu, Yongtao
    Zhang, Yan
    NEUROCOMPUTING, 2022, 511 : 54 - 66