Detecting Small Objects Using a Channel-Aware Deconvolutional Network

被引：40

作者：

Duan, Kaiwen ^{[1
,2
]}

Du, Dawei ^{[3
]}

Qi, Honggang ^{[1
,2
]}

Huang, Qingming ^{[1
,2
,4
]}

机构：

[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China

[2] Univ Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China

[3] SUNY Albany, Dept Comp Sci, Albany, NY 12222 USA

[4] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2020年 / 30卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Object detection; Feature extraction; Training; Birds; Deconvolution; Proposals; Detectors; Small object detection; channel-aware deconvolution; multi-RPN; anchor matching;

D O I：

10.1109/TCSVT.2019.2906246

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Detecting small objects is a challenging task due to their low resolution and noisy representation even using deep learning methods. In this paper, we propose a novel object detection method based on the channel-aware deconvolutional network (CADNet) for accurate small object detection. Specifically, we develop the channel-aware deconvolution (ChaDeConv) layer to exploit the correlations of feature maps in different channels across deeper layers, improving the recall rate of small objects at low additional computational costs. Following the ChaDeConv layer, the multiple region proposal sub-network (Multi-RPN) is employed to supervise and optimize multiple detection layers simultaneously to achieve better accuracy. The Multi-RPN module is only used in the training phase and does not increase the computation cost of the inference. In addition, we design a new anchor matching strategy based on the center point translation (CPTMatching) of anchors to select more extending anchors as positive samples in the training phase. The extensive experiments on the PASCAL VOC 2007/2012, MS COCO, and UAVDT datasets show that the proposed CADNet achieves state-of-the-art performance compared to the existing methods.

引用

页码：1639 / 1652

页数：14

共 70 条

[11] The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking [J].

Du, Dawei ;

Qi, Yuankai ;

Yu, Hongyang ;

Yang, Yifan ;

Duan, Kaiwen ;

Li, Guorong ;

Zhang, Weigang ;

Huang, Qingming ;

Tian, Qi .

COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :375-391

[12]

Everingham M., 2012, PASCAL VISUAL OBJECT

[13]

Everingham M., PASCAL VISUAL OBJECT

[14]

Fu C.Y., 2017, arXiv

[15]

Gao MF, 2017, 2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), P1077, DOI 10.1109/FSKD.2017.8392913

[16]

Geiger Andreas, 2012, P IEEE C COMP VIS PA

[17] Object detection via a multi-region & semantic segmentation-aware CNN model [J].

Gidaris, Spyros ;

Komodakis, Nikos .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1134-1142

[18]

Girshick R. B., 2014, P IEEE C COMPUTER VI, P580, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81]

[19] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[20] CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion [J].

Han, Junwei ;

Chen, Hao ;

Liu, Nian ;

Yan, Chenggang ;

Li, Xuelong .

IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (11) :3171-3183

← 1 2 3 4 5 6 7 →