DILA: Dynamic Gaussian Distribution Fitting and Imitation Learning-Based Label Assignment for tiny object detection

被引:0
作者
Chen, Penglei [1 ,2 ]
Wang, Jiangtao [1 ,2 ]
Zhang, Zhiwei [1 ]
He, Cheng [1 ]
机构
[1] Huaibei Normal Univ, Sch Phys & Elect Informat, Huaibei 235000, Peoples R China
[2] Huaibei Normal Univ, Anhui Prov Key Lab Intelligent Comp & Applicat, Huaibei 235000, Peoples R China
关键词
Label assignment; Tiny object detection; Effective receptive field; Receptive field distance; NET;
D O I
10.1016/j.asoc.2024.111980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid advancement of deep learning technology, significant achievements have been made in the field of general object detection. However, challenges still remain in the detection of tiny objects. There are two main drawbacks: (1) static prior information makes the localization of tiny objects relatively fixed; (2) Intersection over Union (IoU) is highly sensitive to deviations in tiny objects. To this end, we propose a Dynamic Gaussian Distribution Fitting and Imitation Learning-Based Label Assignment (DILA) strategy for Tiny Object Detection. Specifically, to address the positional deviation of effective receptive fields at different network layers during Gaussian modeling, DILA first designs an Adaptive Dynamic Calculation Strategy (ADCS) to compute estimation factors for the effective receptive field in different feature spaces, dynamically modeling prior information using Gaussian distribution. Then, DILA introduces a new Balance of Gaussian Scaling-aware Metric (BGSM) to measure the similarity between tiny bounding boxes and predefined anchors, instead of using IoU, which is highly sensitive to tiny pixel shifts, for sample assignment, thereby providing a more accurate basis for label assignment. Finally, a Detail Information Imitation Compensation Module (DIM) is presented to improve and compensate for the detailed information of tiny objects that troubles label assignment, achieving balanced learning for tiny objects. The proposed DILA strategy can be seamlessly integrated into various anchor-based detectors. Extensive experiments were conducted on three publicly available datasets for tiny object detection. The results indicate that when DILA is embedded into Faster RCNN, it outperforms other stateof-the-art methods in terms of detection performance for tiny objects, achieving an improvement in average precision of 10.3%, 1.5%, and 4.2% on AI-TOD, SODA-D, and VisDrone2019, respectively. The source codes and results are available at: https://github.com/chnu-cpl/DILA.
引用
收藏
页数:14
相关论文
共 51 条
  • [1] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [2] Towards Large-Scale Small Object Detection: Survey and Benchmarks
    Cheng, Gong
    Yuan, Xiang
    Yao, Xiwen
    Yan, Kebing
    Zeng, Qinghua
    Xie, Xingxing
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488
  • [3] RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images
    Cong, Runmin
    Zhang, Yumo
    Fang, Leyuan
    Li, Jun
    Zhao, Yao
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] Small Object Detection in Remote Sensing Images Based on Super-Resolution with Auxiliary Generative Adversarial Networks
    Courtrai, Luc
    Minh-Tan Pham
    Lefevre, Sebastien
    [J]. REMOTE SENSING, 2020, 12 (19) : 1 - 19
  • [5] Dynamic Head: Unifying Object Detection Heads with Attentions
    Dai, Xiyang
    Chen, Yinpeng
    Xiao, Bin
    Chen, Dongdong
    Liu, Mengchen
    Yuan, Lu
    Zhang, Lei
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7369 - 7378
  • [6] VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results
    Du, Dawei
    Zhu, Pengfei
    Wen, Longyin
    Bian, Xiao
    Ling, Haibin
    Hu, Qinghua
    Peng, Tao
    Zheng, Jiayu
    Wang, Xinyao
    Zhang, Yue
    Bo, Liefeng
    Shi, Hailin
    Zhu, Rui
    Kumar, Aashish
    Li, Aijin
    Zinollayev, Almaz
    Askergaliyev, Anuar
    Schumann, Arne
    Mao, Binjie
    Lee, Byeongwon
    Liu, Chang
    Chen, Changrui
    Pan, Chunhong
    Huo, Chunlei
    Yu, Da
    Cong, Dechun
    Zeng, Dening
    Pailla, Dheeraj Reddy
    Li, Di
    Wang, Dong
    Cho, Donghyeon
    Zhang, Dongyu
    Bai, Furui
    Jose, George
    Gao, Guangyu
    Liu, Guizhong
    Xiong, Haitao
    Qi, Hao
    Wang, Haoran
    Qiu, Heqian
    Li, Hongliang
    Lu, Huchuan
    Kim, Ildoo
    Kim, Jaekyum
    Shen, Jane
    Lee, Jihoon
    Ge, Jing
    Xu, Jingjing
    Zhou, Jingkai
    Meier, Jonas
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 213 - 226
  • [7] Coarse-grained Density Map Guided Object Detection in Aerial Images
    Duan, Chengzhen
    Wei, Zhiwei
    Zhang, Chi
    Qu, Siying
    Wang, Hongpeng
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2789 - 2798
  • [8] CenterNet: Keypoint Triplets for Object Detection
    Duan, Kaiwen
    Bai, Song
    Xie, Lingxi
    Qi, Honggang
    Huang, Qingming
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
  • [9] Gaussian similarity-based adaptive dynamic label assignment for tiny object detection
    Fu, Ronghao
    Chen, Chengcheng
    Yan, Shuang
    Heidari, Ali Asghar
    Wang, Xianchang
    Escorcia-Gutierrez, Jose
    Mansour, Romany F.
    Chene, Huiling
    [J]. NEUROCOMPUTING, 2023, 543
  • [10] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]