DILA: Dynamic Gaussian Distribution Fitting and Imitation Learning-Based Label Assignment for tiny object detection

被引：0

作者：

Chen, Penglei ^{[1
,2
]}

Wang, Jiangtao ^{[1
,2
]}

Zhang, Zhiwei ^{[1
]}

He, Cheng ^{[1
]}

机构：

[1] Huaibei Normal Univ, Sch Phys & Elect Informat, Huaibei 235000, Peoples R China

[2] Huaibei Normal Univ, Anhui Prov Key Lab Intelligent Comp & Applicat, Huaibei 235000, Peoples R China

来源：

APPLIED SOFT COMPUTING | 2024年 / 164卷

关键词：

Label assignment; Tiny object detection; Effective receptive field; Receptive field distance; NET;

D O I：

10.1016/j.asoc.2024.111980

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the rapid advancement of deep learning technology, significant achievements have been made in the field of general object detection. However, challenges still remain in the detection of tiny objects. There are two main drawbacks: (1) static prior information makes the localization of tiny objects relatively fixed; (2) Intersection over Union (IoU) is highly sensitive to deviations in tiny objects. To this end, we propose a Dynamic Gaussian Distribution Fitting and Imitation Learning-Based Label Assignment (DILA) strategy for Tiny Object Detection. Specifically, to address the positional deviation of effective receptive fields at different network layers during Gaussian modeling, DILA first designs an Adaptive Dynamic Calculation Strategy (ADCS) to compute estimation factors for the effective receptive field in different feature spaces, dynamically modeling prior information using Gaussian distribution. Then, DILA introduces a new Balance of Gaussian Scaling-aware Metric (BGSM) to measure the similarity between tiny bounding boxes and predefined anchors, instead of using IoU, which is highly sensitive to tiny pixel shifts, for sample assignment, thereby providing a more accurate basis for label assignment. Finally, a Detail Information Imitation Compensation Module (DIM) is presented to improve and compensate for the detailed information of tiny objects that troubles label assignment, achieving balanced learning for tiny objects. The proposed DILA strategy can be seamlessly integrated into various anchor-based detectors. Extensive experiments were conducted on three publicly available datasets for tiny object detection. The results indicate that when DILA is embedded into Faster RCNN, it outperforms other stateof-the-art methods in terms of detection performance for tiny objects, achieving an improvement in average precision of 10.3%, 1.5%, and 4.2% on AI-TOD, SODA-D, and VisDrone2019, respectively. The source codes and results are available at: https://github.com/chnu-cpl/DILA.

引用

页数：14

共 51 条

[1] Cascade R-CNN: Delving into High Quality Object Detection
Cai, Zhaowei
Vasconcelos, Nuno
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
[2] Towards Large-Scale Small Object Detection: Survey and Benchmarks
Cheng, Gong
Yuan, Xiang
Yao, Xiwen
Yan, Kebing
Zeng, Qinghua
Xie, Xingxing
Han, Junwei
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13467 - 13488
[3] RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images
Cong, Runmin
Zhang, Yumo
Fang, Leyuan
Li, Jun
Zhao, Yao
Kwong, Sam
[J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4] Small Object Detection in Remote Sensing Images Based on Super-Resolution with Auxiliary Generative Adversarial Networks
Courtrai, Luc
Minh-Tan Pham
Lefevre, Sebastien
[J]. REMOTE SENSING, 2020, 12 (19) : 1 - 19
[5] Dynamic Head: Unifying Object Detection Heads with Attentions
Dai, Xiyang
Chen, Yinpeng
Xiao, Bin
Chen, Dongdong
Liu, Mengchen
Yuan, Lu
Zhang, Lei
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7369 - 7378
[6] VisDrone-DET2019: The Vision Meets Drone Object Detection in Image Challenge Results
Du, Dawei
Zhu, Pengfei
Wen, Longyin
Bian, Xiao
Ling, Haibin
Hu, Qinghua
Peng, Tao
Zheng, Jiayu
Wang, Xinyao
Zhang, Yue
Bo, Liefeng
Shi, Hailin
Zhu, Rui
Kumar, Aashish
Li, Aijin
Zinollayev, Almaz
Askergaliyev, Anuar
Schumann, Arne
Mao, Binjie
Lee, Byeongwon
Liu, Chang
Chen, Changrui
Pan, Chunhong
Huo, Chunlei
Yu, Da
Cong, Dechun
Zeng, Dening
Pailla, Dheeraj Reddy
Li, Di
Wang, Dong
Cho, Donghyeon
Zhang, Dongyu
Bai, Furui
Jose, George
Gao, Guangyu
Liu, Guizhong
Xiong, Haitao
Qi, Hao
Wang, Haoran
Qiu, Heqian
Li, Hongliang
Lu, Huchuan
Kim, Ildoo
Kim, Jaekyum
Shen, Jane
Lee, Jihoon
Ge, Jing
Xu, Jingjing
Zhou, Jingkai
Meier, Jonas
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 213 - 226
[7] Coarse-grained Density Map Guided Object Detection in Aerial Images
Duan, Chengzhen
Wei, Zhiwei
Zhang, Chi
Qu, Siying
Wang, Hongpeng
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 2789 - 2798
[8] CenterNet: Keypoint Triplets for Object Detection
Duan, Kaiwen
Bai, Song
Xie, Lingxi
Qi, Honggang
Huang, Qingming
Tian, Qi
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
[9] Gaussian similarity-based adaptive dynamic label assignment for tiny object detection
Fu, Ronghao
Chen, Chengcheng
Yan, Shuang
Heidari, Ali Asghar
Wang, Xianchang
Escorcia-Gutierrez, Jose
Mansour, Romany F.
Chene, Huiling
[J]. NEUROCOMPUTING, 2023, 543
[10] Ge Z, 2021, Arxiv, DOI [arXiv:2107.08430, DOI 10.48550/ARXIV.2107.08430]

← 1 2 3 4 5 6 →