SSRDet: Small Object Detection Based on Feature Pyramid Network

被引:5
作者
Zhang, Lijuan [1 ,2 ]
Wang, Minhui [2 ]
Jiang, Yutong [3 ]
Li, Dongming [1 ]
Zhou, Yue [3 ]
机构
[1] Wuxi Univ, Coll Internet Things Engn, Wuxi 214105, Jiangsu, Peoples R China
[2] Changchun Univ Technol, Sch Comp Sci & Engn, Changchun 130012, Jilin, Peoples R China
[3] China North Vehicle Res Inst, Beijing 100072, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Feature extraction; Detectors; Training; Semantics; Data augmentation; Gaussian distribution; Labeling; Small object detection; attention module; feature pyramid network; label assignment;
D O I
10.1109/ACCESS.2023.3306242
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the increasing presence of small objects in videos or images from practical applications, small object identification is currently an extremely popular topic in the field of machine vision. Additionally, small object detection is still a difficult process because to small objects' issues with fuzzy appearance, limited information, occlusion, and noise present. Most existing methods mainly use feature pyramid networks to enrich shallow features using contextual features. However, due to the inconsistency of gradients between different layers of the feature pyramid network, the shallow features cannot be fully utilized resulting in the slow improvement of small object detection accuracy. To effectively improve the small object detection algorithm, we propose a new feature pyramid network-based small object detection algorithm, SSRDet. To effectively assign positive and negative sample labels and address the issue of sample scale imbalance, we first present RFLA. Then, to overcome the gradient inconsistency between various layers and enable the full utilization of the shallow features, we extend the feature pyramid network by including a scale enhancement module (SEM) and a scale selection module (SSM). Finally, we introduced the attention module (SPAM) to filter out the background noise in the shallow feature extraction to better extract small object features. We validated our method on VisDrone2019 and AI-TOD, and our method outperformed the state-of-the-art detectors.
引用
收藏
页码:96743 / 96752
页数:10
相关论文
共 36 条
[11]  
Kisantal M, 2019, Arxiv, DOI arXiv:1902.07296
[12]   Perceptual Generative Adversarial Networks for Small Object Detection [J].
Li, Jianan ;
Liang, Xiaodan ;
Wei, Yunchao ;
Xu, Tingfa ;
Feng, Jiashi ;
Yan, Shuicheng .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1951-1959
[13]   Contextual Transformer Networks for Visual Recognition [J].
Li, Yehao ;
Yao, Ting ;
Pan, Yingwei ;
Mei, Tao .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) :1489-1500
[14]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007
[15]   Feature Pyramid Networks for Object Detection [J].
Lin, Tsung-Yi ;
Dollar, Piotr ;
Girshick, Ross ;
He, Kaiming ;
Hariharan, Bharath ;
Belongie, Serge .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :936-944
[16]  
Liu H., 2021, arXiv
[17]   Path Aggregation Network for Instance Segmentation [J].
Liu, Shu ;
Qi, Lu ;
Qin, Haifang ;
Shi, Jianping ;
Jia, Jiaya .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8759-8768
[18]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[19]   IPG-Net: Image Pyramid Guidance Network for Small Object Detection [J].
Liu, Ziming ;
Gao, Guangyu ;
Sun, Lin ;
Fang, Li .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :4422-4430
[20]   Grid R-CNN [J].
Lu, Xin ;
Li, Buyu ;
Yue, Yuxin ;
Li, Quanquan ;
Yan, Junjie .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7355-7364