Tiny object detection with context enhancement and feature purification

被引：84

作者：

Xiao, Jinsheng ^{[1
]}

Guo, Haowen ^{[1
]}

Zhou, Jian ^{[2
]}

Zhao, Tao ^{[1
]}

Yu, Qiuze ^{[1
]}

Chen, Yunhua ^{[3
]}

Wang, Zhongyuan ^{[4
]}

机构：

[1] Wuhan Univ, Sch Elect Informat, Wuhan, Peoples R China

[2] Wuhan Univ, State Key Lab Informat Engn Surveying, Mapping & Remote Sensing, Wuhan, Peoples R China

[3] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou, Peoples R China

[4] Wuhan Univ, Sch Comp, Wuhan, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2023年 / 211卷

基金：

中国国家自然科学基金;

关键词：

Tiny-object detection; Context enhancement; Feature purification; Dilated convolution;

D O I：

10.1016/j.eswa.2022.118665

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tiny object detection is one of the challenges in the field of object detection, which can be applied in a variety of fields. Thanks to the advances in deep learning, significant improvement has been made in image object detection. However, the performance of tiny object detection still needs to be considerably enhanced. In this paper, we proposed a novel feature pyramid composite neural network structure comprising two modules: the context enhancement module (CEM) and feature purification module (FPM). The top-to-bottom input of the feature pyramid network into the multi-scale dilated convolution features in the CEM can augment the context information. When fusing multi-scale features in the FPM, the feature purification procedures for channel and space dimensions are employed to eliminate conflicting information, and tiny objects are more noticeable in contradictory information. Furthermore, a new data-enhancement strategy is introduced to increase the contribution of tiny objects in the loss function, which is named copy-reduce-paste and improves the balance of training samples. Overall, the experiments on the VOC dataset show that the APs score can reach 16.9% and the IOU is 0.5:0.95 for the suggested method. The APs score is more than 3.9% that of YOLOV4, more than 7.7% that of CenterNet, and more than 5.3% that of RefineDet. On TinyPerson dataset, our APs score is more than 0.8% that of YOLOV5, providing a new alternative solution for the tiny-object detection research community.

引用

页数：10

共 45 条

[1]

[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4

[2]

Chen Yinbo, 2020, arXiv

[3]

Choi Jiwoong., 2021, Active learning for deep object detection via probabilistic modeling

[4] 3D-CSTM: A 3D continuous spatio-temporal mapping method [J].

Cong, Yangzi ;

Chen, Chi ;

Yang, Bisheng ;

Li, Jianping ;

Wu, Weitong ;

Li, Yuhao ;

Yang, Yandi .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 186 :232-245

[5] Extended Feature Pyramid Network for Small Object Detection [J].

Deng, Chunfang ;

Wang, Mengmeng ;

Liu, Liang ;

Liu, Yong ;

Jiang, Yunliang .

IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1968-1979

[6] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[7] NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].

Ghiasi, Golnaz ;

Lin, Tsung-Yi ;

Le, Quoc V. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038

[8]

He K., 2017, IEEE INT C COMPUT VI, P2961, DOI DOI 10.1109/ICCV.2017.322

[9]

He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]

[10] Geometric feature enhanced line segment extraction from large-scale point clouds with hierarchical topological optimization [J].

Hu, Zongtian ;

Chen, Chi ;

Yang, Bisheng ;

Wang, Zhiye ;

Ma, Ruiqi ;

Wu, Weitong ;

Sun, Wenlu .

INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112

← 1 2 3 4 5 →