Learning region-guided scale-aware feature selection for object detection

被引：0

作者：

Liu Liu

Rujing Wang

Chengjun Xie

Rui Li

Fangyuan Wang

Man Zhou

Yue Teng

机构：

[1] University of Science and Technology of China,Institute of Intelligent Machines

[2] Chinese Academy of Sciences,undefined

来源：

Neural Computing and Applications | 2021年 / 33卷

关键词：

Scale variation; Object detection; RoI Pyramid; Scale-aware feature selective;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Scale variation is one of the major challenges in object detection task. Modern region-based object detection architectures often adopt Feature Pyramid Network (FPN) as feature extraction neck to achieve multi-scale feature representation in solving scale variation problem. However, due to the rough feature selection strategy in Region of Interest (RoI) feature extraction step, these methods might not perform well on object detection under strong scale variation. In this work, we are motivated by the limitations of current FPN-based two-stage object detectors and then present a novel module, namely scale-aware feature selective (SAFS) module, that flexibly and adaptively selects feature levels in two-stage object detectors. Specifically, we firstly build the RoI Pyramid in standard FPN structure to extract RoI features from various scale levels. Next, in order to achieve scale-aware mechanism for solving scale variation issue, we develop a novel weighting gate function containing one set of trainable parameters to automatically learn the fusion weight for each RoI feature level, which relieves the limitation of hard feature selection strategy guided by online instance size. Outputs from the RoI features with the learned weights are fused for classification and bounding box regression. Furthermore, we design a multi-level SAFS architecture to obtain different types of RoI feature combinations that ensures our method is more robust to various instance scales. Experimental results show that our SAFS module is very compatible with most of two-stage object detectors and could achieve state-of-the-art results with Average Precision of 48.3 on COCO test-dev and other popular object detection benchmarks. Our code will be made publicly available.

引用

页码：6389 / 6403

页数：14

共 23 条

[1] Li J(2017)Scale-aware fast R-CNN for pedestrian detection IEEE Trans Multimedia 20 985-996
[2] Liang X(2013)Selective search for object recognition Int J Comput Vis 104 154-171
[3] Shen S(1999)Genetic k-means algorithm IEEE Trans Syst Man Cybern Part B (Cybern) 29 433-439
[4] Xu T(2010)The pascal visual object classes (VOC) challenge Int J Comput Vis 88 303-338
[5] Feng J(2019)Towards new retail: a benchmark dataset for smart unmanned vending machines IEEE Trans Ind Inform 16 7722-7731
[6] Yan S(undefined)undefined undefined undefined undefined-undefined
[7] Uijlings JR(undefined)undefined undefined undefined undefined-undefined
[8] Van De Sande KE(undefined)undefined undefined undefined undefined-undefined
[9] Gevers T(undefined)undefined undefined undefined undefined-undefined
[10] Smeulders AW(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 →