Small object detection using deep feature learning and feature fusion network

被引:16
作者
Tong, Kang [1 ]
Wu, Yiquan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Elect & Informat Engn, Nanjing 211106, Peoples R China
关键词
Small object detection; SeaDefine; Deep feature learning; Feature fusion; Multi-scale strategy; SEGMENTATION;
D O I
10.1016/j.engappai.2024.107931
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Small object detection is a fundamental and challenging issue in computer vision. We believe that there are two factors that affect the performance of small object detection: small object dataset and small object itself. In terms of datasets, we introduce a dataset named SeaDefine, which opens up a new direction for small object detection in maritime environment. For the small object itself, we utilize deep feature learning and feature fusion network (DFLFFN) to help detect objects. Concretely, the designed deep feature learning module (DFLM) at the singlelayer level can describe objects for a variety of scenarios through activating multi-scale receptive fields over a wider scope. Meanwhile, to intensify classification capacity of small objects, the shallow features with rich details will be integrated with the deep features generated from the DFLM by introducing feature fusion block (FFB). In addition, we analyze the multi-scale strategy from a mathematical perspective to a certain extent. A large number of results in the experiments show that proposed DFLFFN achieves the leading detection performance on the MS-COCO and SeaDefine datasets. In particular, our DFLFFN surpasses the baseline by 4.1 points on APrs score for SeaDefine dataset, and 7.8 points on APS score for MS-COCO dataset.
引用
收藏
页数:13
相关论文
共 78 条
[1]   Skeleton-based action recognition using sparse spatio-temporal GCN with edge effective resistance [J].
Ahmad, Tasweer ;
Jin, Lianwen ;
Lin, Luojun ;
Tang, GuoZhi .
NEUROCOMPUTING, 2021, 423 :389-398
[2]   Skin lesion segmentation by using object detection networks, DeepLab3+, and active contours [J].
Bagheri, Fatemeh ;
Tarokh, Mohammad Jafar ;
Ziaratban, Majid .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2022, 30 (07) :2489-2507
[3]   Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].
Bell, Sean ;
Zitnick, C. Lawrence ;
Bala, Kavita ;
Girshick, Ross .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883
[4]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934, DOI 10.48550/ARXIV.2004.10934]
[5]   Anchor pruning for object detection [J].
Bonnaerens, Maxim ;
Freiberger, Matthias ;
Dambre, Joni .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 221
[6]   STDnet: Exploiting high resolution feature maps for small object detection [J].
Bosquet, Brais ;
Mucientes, Manuel ;
Brea, Victor M. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 91
[7]   Feature-Fused SSD: Fast Detection for Small Objects [J].
Cao, Guimei ;
Xie, Xuemei ;
Yang, Wenzhe ;
Liao, Quan ;
Shi, Guangming ;
Wu, Jinjian .
NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
[8]   A deep learning based four-fold approach to classify brain MRI: BTSCNet [J].
Chaki, Jyotismita ;
Wozniak, Marcin .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
[9]   R-CNN for Small Object Detection [J].
Chen, Chenyi ;
Liu, Ming-Yu ;
Tuzel, Oncel ;
Xiao, Jianxiong .
COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 :214-230
[10]   A Survey of the Four Pillars for Small Object Detection: Multiscale Representation, Contextual Information, Super-Resolution, and Region Proposal [J].
Chen, Guang ;
Wang, Haitao ;
Chen, Kai ;
Li, Zhijun ;
Song, Zida ;
Liu, Yinlong ;
Chen, Wenkai ;
Knoll, Alois .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02) :936-953