Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection

被引:5
作者
Pan, Jing [1 ]
Sun, Hanqing [1 ]
Song, Zhanjie [2 ]
Han, Jungong [3 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Sch Math, Tianjin 300072, Peoples R China
[3] Univ Warwick, WMG Data Sci, Coventry CV4 7AL, England
关键词
dual-resolution; CNN; visual object detection; progressive fusion;
D O I
10.3390/s19143111
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Downsampling input images is a simple trick to speed up visual object-detection algorithms, especially on robotic vision and applied mobile vision systems. However, this trick comes with a significant decline in accuracy. In this paper, dual-resolution dual-path Convolutional Neural Networks (CNNs), named DualNets, are proposed to bump up the accuracy of those detection applications. In contrast to previous methods that simply downsample the input images, DualNets explicitly take dual inputs in different resolutions and extract complementary visual features from these using dual CNN paths. The two paths in a DualNet are a backbone path and an auxiliary path that accepts larger inputs and then rapidly downsamples them to relatively small feature maps. With the help of the carefully designed auxiliary CNN paths in DualNets, auxiliary features are extracted from the larger input with controllable computation. Auxiliary features are then fused with the backbone features using a proposed progressive residual fusion strategy to enrich feature representation.This architecture, as the feature extractor, is further integrated with the Single Shot Detector (SSD) to accomplish latency-sensitive visual object-detection tasks. We evaluate the resulting detection pipeline on Pascal VOC and MS COCO benchmarks. Results show that the proposed DualNets can raise the accuracy of those CNN detection applications that are sensitive to computation payloads.
引用
收藏
页数:16
相关论文
共 38 条
[1]  
[Anonymous], 2017, 170404861 ARXIV
[2]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[3]  
[Anonymous], 2018, P IEEE C COMP VIS PA
[4]  
[Anonymous], 180707466 ARXIV
[5]  
[Anonymous], 2017, 170106659 ARXIV
[6]  
[Anonymous], 2015, ADV NEURAL INFORM PR
[7]  
[Anonymous], 2014, P ECCV
[8]  
[Anonymous], 14116550 ARXIV
[9]  
[Anonymous], 2015, 150302531 ARXIV
[10]  
[Anonymous], PROC CVPR IEEE