Detecting Small Objects Using a Channel-Aware Deconvolutional Network

被引：40

作者：

Duan, Kaiwen ^{[1
,2
]}

Du, Dawei ^{[3
]}

Qi, Honggang ^{[1
,2
]}

Huang, Qingming ^{[1
,2
,4
]}

机构：

[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China

[2] Univ Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China

[3] SUNY Albany, Dept Comp Sci, Albany, NY 12222 USA

[4] Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2020年 / 30卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Object detection; Feature extraction; Training; Birds; Deconvolution; Proposals; Detectors; Small object detection; channel-aware deconvolution; multi-RPN; anchor matching;

D O I：

10.1109/TCSVT.2019.2906246

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Detecting small objects is a challenging task due to their low resolution and noisy representation even using deep learning methods. In this paper, we propose a novel object detection method based on the channel-aware deconvolutional network (CADNet) for accurate small object detection. Specifically, we develop the channel-aware deconvolution (ChaDeConv) layer to exploit the correlations of feature maps in different channels across deeper layers, improving the recall rate of small objects at low additional computational costs. Following the ChaDeConv layer, the multiple region proposal sub-network (Multi-RPN) is employed to supervise and optimize multiple detection layers simultaneously to achieve better accuracy. The Multi-RPN module is only used in the training phase and does not increase the computation cost of the inference. In addition, we design a new anchor matching strategy based on the center point translation (CPTMatching) of anchors to select more extending anchors as positive samples in the training phase. The extensive experiments on the PASCAL VOC 2007/2012, MS COCO, and UAVDT datasets show that the proposed CADNet achieves state-of-the-art performance compared to the existing methods.

引用

页码：1639 / 1652

页数：14

共 70 条

[21] A Unified Metric Learning-Based Framework for Co-Saliency Detection [J].

Han, Junwei ;

Cheng, Gong ;

Li, Zhenpeng ;

Zhang, Dingwen .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) :2473-2483

[22]

He K., 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), DOI [DOI 10.1109/CVPR.2016.90, 10.1109/CVPR.2016.90]

[23]

He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]

[24]

He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]

[25]

Hoiem D, 2012, LECT NOTES COMPUT SC, V7574, P340, DOI 10.1007/978-3-642-33712-3_25

[26]

Huang J., 2017, CVPR, DOI DOI 10.1109/CVPR.2017.351

[27]

Jeong J., 2017, P BRIT MACH VIS C, DOI DOI 10.5244/C.31.76

[28] Caffe: Convolutional Architecture for Fast Feature Embedding [J].

Jia, Yangqing ;

Shelhamer, Evan ;

Donahue, Jeff ;

Karayev, Sergey ;

Long, Jonathan ;

Girshick, Ross ;

Guadarrama, Sergio ;

Darrell, Trevor .

PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678

[29] RON: Reverse Connection with Objectness Prior Networks for Object Detection [J].

Kong, Tao ;

Sun, Fuchun ;

Yao, Anbang ;

Liu, Huaping ;

Lu, Ming ;

Chen, Yurong .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5244-5252

[30] HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection [J].

Kong, Tao ;

Yao, Anbang ;

Chen, Yurong ;

Sun, Fuchun .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :845-853

← 1 2 3 4 5 6 7 →