Gaussian Combined Distance: A Generic Metric for Object Detection

被引:0
|
作者
Guan, Ziqian [1 ]
Fu, Xieyi [1 ]
Huang, Pengjun [1 ]
Zhang, Hengyuan [1 ]
Du, Hubin [1 ]
Liu, Yongtao [1 ]
Wang, Yinglin [2 ]
Ma, Qang [2 ]
机构
[1] North China Inst Sci & Technol, Key Lab Special Robots Safety Prod & Emergency Dis, Langfang 065201, Peoples R China
[2] Hegang Ind Technol Serv Co Ltd, Langfang 065008, Peoples R China
关键词
Measurement; Object detection; Feature extraction; Optimization; Detectors; Geoscience and remote sensing; Accuracy; Training; Sensitivity; Convergence; Generic metric; tiny object detection;
D O I
10.1109/LGRS.2025.3531970
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In object detection, a well-defined similarity metric can significantly enhance the model performance. Currently, the intersection over union (IoU)-based similarity metric is the most commonly preferred choice for detectors. However, detectors using IoU as a similarity metric often perform poorly when detecting small objects because of their sensitivity to minor positional deviations. To address this issue, recent studies have proposed the Wasserstein distance (WD) as an alternative to IoU for measuring the similarity of Gaussian-distributed bounding boxes. However, we have observed that the WD lacks scale invariance, which negatively impacts the model's generalization capability. In addition, when used as a loss function, its independent optimization of the center attributes leads to slow model convergence and unsatisfactory detection precision. To address these challenges, we introduce the Gaussian Combined Distance (GCD). Through analytical examination of GCD and its gradient, we demonstrate that GCD not only possesses scale invariance but also facilitates joint optimization, which enhances model localization performance. Extensive experiments on the AI-TOD-v2 dataset for tiny object detection show that GCD, as a bounding box regression loss function and label assignment metric, achieves state-of-the-art (SOTA) performance across various detectors. We further validated the generalizability of GCD on the MS-COCO-2017 and Visdrone-2019 datasets, where it outperforms the WD across diverse scales of datasets. The code is available at: https://github.com/MArKkwanGuan/mmdet-GCD.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Dense Information Learning Based Semi-Supervised Object Detection
    Yang, Xi
    Li, Penghui
    Zhou, Qiubai
    Wang, Nannan
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1022 - 1035
  • [42] Occlusion Handling in Generic Object Detection: A Review
    Saleh, Kaziwa
    Szenasi, Sandor
    Vamossy, Zoltan
    2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 477 - 484
  • [43] Neighborhood sampling confidence metric for object detection
    Christophe Gouguenheim
    Ahmad Berjaoui
    AI and Ethics, 2024, 4 (1): : 57 - 64
  • [44] Object Detection and Distance Measurement in Teleoperation
    Zhang, Ailing
    Chu, Meng
    Chen, Zixin
    Zhou, Fuqiang
    Gao, Shuo
    MACHINES, 2022, 10 (05)
  • [45] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 318 - 327
  • [46] Knowledge Amalgamation for Object Detection With Transformers
    Zhang, Haofei
    Mao, Feng
    Xue, Mengqi
    Fang, Gongfan
    Feng, Zunlei
    Song, Jie
    Song, Mingli
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 2093 - 2106
  • [47] Object Detection: Training From Scratch
    Zhao, Kai
    Zhou, Yan
    Chen, Xin
    IEEE ACCESS, 2020, 8 : 157520 - 157529
  • [48] Save the Tiny, Save the All: Hierarchical Activation Network for Tiny Object Detection
    Guo, Guangqian
    Chen, Pengfei
    Yu, Xuehui
    Han, Zhenjun
    Ye, Qixiang
    Gao, Shan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 221 - 234
  • [49] DHLA: Dynamic Hybrid Label Assignment for End-to-End Object Detection
    Hu, Zhiliang
    Chen, Si
    Hua, Yang
    Wang, Da-Han
    Zhu, Shunzhi
    Yan, Yan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (02) : 1055 - 1069
  • [50] An Efficient Intersection Over Union Algorithm for 3D Object Detection
    Mohammed, Sazan Ali Kamal
    Razak, Mohd Zulhakimi Ab
    Abd Rahman, Abdul Hadi
    Abu Bakar, Maria
    IEEE ACCESS, 2024, 12 : 169768 - 169786