Gaussian Combined Distance: A Generic Metric for Object Detection

被引:0
|
作者
Guan, Ziqian [1 ]
Fu, Xieyi [1 ]
Huang, Pengjun [1 ]
Zhang, Hengyuan [1 ]
Du, Hubin [1 ]
Liu, Yongtao [1 ]
Wang, Yinglin [2 ]
Ma, Qang [2 ]
机构
[1] North China Inst Sci & Technol, Key Lab Special Robots Safety Prod & Emergency Dis, Langfang 065201, Peoples R China
[2] Hegang Ind Technol Serv Co Ltd, Langfang 065008, Peoples R China
关键词
Measurement; Object detection; Feature extraction; Optimization; Detectors; Geoscience and remote sensing; Accuracy; Training; Sensitivity; Convergence; Generic metric; tiny object detection;
D O I
10.1109/LGRS.2025.3531970
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In object detection, a well-defined similarity metric can significantly enhance the model performance. Currently, the intersection over union (IoU)-based similarity metric is the most commonly preferred choice for detectors. However, detectors using IoU as a similarity metric often perform poorly when detecting small objects because of their sensitivity to minor positional deviations. To address this issue, recent studies have proposed the Wasserstein distance (WD) as an alternative to IoU for measuring the similarity of Gaussian-distributed bounding boxes. However, we have observed that the WD lacks scale invariance, which negatively impacts the model's generalization capability. In addition, when used as a loss function, its independent optimization of the center attributes leads to slow model convergence and unsatisfactory detection precision. To address these challenges, we introduce the Gaussian Combined Distance (GCD). Through analytical examination of GCD and its gradient, we demonstrate that GCD not only possesses scale invariance but also facilitates joint optimization, which enhances model localization performance. Extensive experiments on the AI-TOD-v2 dataset for tiny object detection show that GCD, as a bounding box regression loss function and label assignment metric, achieves state-of-the-art (SOTA) performance across various detectors. We further validated the generalizability of GCD on the MS-COCO-2017 and Visdrone-2019 datasets, where it outperforms the WD across diverse scales of datasets. The code is available at: https://github.com/MArKkwanGuan/mmdet-GCD.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Localization Distillation for Object Detection
    Zheng, Zhaohui
    Ye, Rongguang
    Hou, Qibin
    Ren, Dongwei
    Wang, Ping
    Zuo, Wangmeng
    Cheng, Ming-Ming
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 10070 - 10083
  • [32] Deep Learning for Generic Object Detection: A Survey
    Liu, Li
    Ouyang, Wanli
    Wang, Xiaogang
    Fieguth, Paul
    Chen, Jie
    Liu, Xinwang
    Pietikainen, Matti
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (02) : 261 - 318
  • [33] Deep Learning for Generic Object Detection: A Survey
    Li Liu
    Wanli Ouyang
    Xiaogang Wang
    Paul Fieguth
    Jie Chen
    Xinwang Liu
    Matti Pietikäinen
    International Journal of Computer Vision, 2020, 128 : 261 - 318
  • [34] Local structured representation for generic object detection
    Junge Zhang
    Kaiqi Huang
    Tieniu Tan
    Zhaoxiang Zhang
    Frontiers of Computer Science, 2017, 11 : 632 - 648
  • [35] Local structured representation for generic object detection
    Zhang, Junge
    Huang, Kaiqi
    Tan, Tieniu
    Zhang, Zhaoxiang
    FRONTIERS OF COMPUTER SCIENCE, 2017, 11 (04) : 632 - 648
  • [36] Multiscale Feature Knowledge Distillation and Implicit Object Discovery for Few-Shot Object Detection in Remote Sensing Images
    Chen, Jie
    Guo, Ya
    Qin, Dengda
    Zhu, Jingru
    Gou, Zhenbo
    Sun, Geng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [37] SWIN-TOD: Smooth Wasserstein Distance and Instance-Level Neighboring Enhancement for Remote Sensing Tiny Object Detection
    Wang, Guangbiao
    Zhao, Hongbo
    Lyu, Shuchang
    Cheng, Guangliang
    Chang, Qing
    Feng, Wenquan
    Zhao, Qi
    Shi, Zhenwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [38] AP-Loss for Accurate One-Stage Object Detection
    Chen, Kean
    Lin, Weiyao
    Li, Jianguo
    See, John
    Wang, Ji
    Zou, Junni
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (11) : 3782 - 3798
  • [39] Learning Cruxes to Push for Object Detection in Low-Quality Images
    Fu, Chenping
    Xiao, Jiewen
    Yuan, Wanqi
    Liu, Risheng
    Fan, Xin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12233 - 12243
  • [40] BRSTD: Bio-Inspired Remote Sensing Tiny Object Detection
    Huang, Sihan
    Lin, Chuan
    Jiang, Xintong
    Qu, Zhenshen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62