Distance metric-based learning for long-tail object detection

被引:0
|
作者
Shao, Mingwen [1 ,2 ]
Peng, Zilu [2 ]
机构
[1] Quanzhou Vocat & Tech Univ, Natl Sci Digital Ind Coll, Jinjiang 362000, Peoples R China
[2] China Univ Petr, Sch Comp Sci & Technol, Qingdao 266580, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep convolutional neural network; Object detection; Long -tail distribution; Metric learning; Feature extraction; SMOTE;
D O I
10.1016/j.imavis.2023.104888
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the recent success of general object detection, almost all models perform unsatisfactorily on long-tail datasets. The main cause of performance degradation is the imbalance in the number of positive samples between categories. The traditional approaches can lead to distortion of the classification feature space, which in turn can seriously affect the classification ability of the network. To address the above issues, we propose a novel distance metric-based learning approach for long-tail object detection (LTDL) in this paper. Specifically, we directly use the feature space as the optimization target, thus allowing clearer decision boundaries between classes. In order to optimize the decision boundary, we adjust the intra-class and inter-class distances by Margin Module (MAM). Meanwhile, to further exploit the information provided by the dataset, we introduce supervised information of labels for distance weighting using the Semantic Module (SEM). In addition, to protect the learning of tail samples and optimize the classifier, we propose a Distance-based Equilibrium Loss (DEL). Extensive experiments conducted on the LVIS benchmark have demonstrated the strength of our proposed approach. The experimental results show that our method improves the baseline by 2.9% AP. And our best model can outperform almost all other representative methods.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Logit Normalization for Long-Tail Object Detection
    Zhao, Liang
    Teng, Yao
    Wang, Limin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (06) : 2114 - 2134
  • [2] A dual-balanced network for long-tail distribution object detection
    Gong, Huiyun
    Li, Yeguang
    Dong, Jian
    IET COMPUTER VISION, 2023, 17 (05) : 565 - 575
  • [3] Model-Independent Approach For Long-Tail Object Detection In Aerial Imagery
    Haleem, Halar
    Bisio, Igor
    Garibotto, Chiara
    Lavagetto, Fabio
    Sciarrone, Andrea
    2024 IEEE ANNUAL CONGRESS ON ARTIFICIAL INTELLIGENCE OF THING, AIOT 2024, 2024, : 78 - 80
  • [4] YOLO-Parallel: Positive Gradient Modeling for Long-Tail Remote Sensing Object Detection
    Gao, Xiangyi
    Zhao, Danpei
    Yuan, Zhichao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [5] Long-tail Detection with Effective Class-Margins
    Cho, Jang Hyun
    Krahenbuhl, Philipp
    COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 698 - 714
  • [6] Gaussian Combined Distance: A Generic Metric for Object Detection
    Guan, Ziqian
    Fu, Xieyi
    Huang, Pengjun
    Zhang, Hengyuan
    Du, Hubin
    Liu, Yongtao
    Wang, Yinglin
    Ma, Qang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [7] Metric-Based Few-Shot Transfer Learning Approach for Voice Pathology Detection
    Won, Jong-Ho
    Kim, Deok-Hwan
    IEEE ACCESS, 2024, 12 : 159226 - 159238
  • [8] Distorted Vehicle Detection and Distance Estimation by Metric Learning-Based SSD
    Zhang, Fanghui
    Jin, Yi
    Kan, Shichao
    Zhang, Linna
    Cen, Yigang
    Jin, Wen
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 1426 - 1437
  • [9] Fast Supervised Selection of Prototypes for Metric-Based Learning
    Belanche, Lluis A.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT II, 2018, 11140 : 577 - 586
  • [10] Local Metric Learning for Exemplar-Based Object Detection
    You, Xinge
    Li, Qiang
    Tao, Dacheng
    Ou, Weihua
    Gong, Mingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2014, 24 (08) : 1265 - 1276