A Fast and High-Performance Object Proposal Method for Vision Sensors: Application to Object Detection

被引：22

作者：

Jiang, Chao ^{[1
,2
,3
]}

Wang, Zhiling ^{[1
,2
,3
]}

Liang, Huawei ^{[1
,2
,3
]}

Tan, Shuhang ^{[1
,2
,3
]}

机构：

[1] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China

[2] Anhui Engn Lab Intelligent Driving Technol & Appl, Hefei 230031, Peoples R China

[3] Chinese Acad Sci, Innovat Res Inst Robot & Intelligent Mfg, Hefei 230031, Peoples R China

来源：

IEEE SENSORS JOURNAL | 2022年 / 22卷 / 10期

关键词：

Proposals; Computational efficiency; Object detection; Location awareness; Merging; Feature extraction; Visualization; Object proposals; object detection; enhanced frequency feature; binarization; lateral inhibition; autonomous vehicle; RECOGNITION; USERS; LIDAR;

D O I：

10.1109/JSEN.2022.3155232

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Use of the object proposal method as a preprocessing step for object detection of vision sensors has improved computational efficiency in recent years. Good object proposal methods require high object detection recall, low computational cost, good localization accuracy, and repeatability. However, existing methods cannot always achieve a good balance of performance. To solve this problem, we propose a fast and high-performance object proposal algorithm. First, we propose a construction method to enhance frequency features that are combined with a linear classifier to learn and generate a set of proposal boxes. Second, we propose a strategy of binarizing frequency features and classifiers to accelerate the calculation. Last, we propose a merging strategy to improve the localization quality of the proposal boxes. Empirically, for the VOC2007 and MSCOCO2017 datasets using the intersection over union (IOU) threshold of 0.5 and 10(4) proposals, our method achieves 99.3% object detection recall, 81.1% mean average best overlap, and 80% mean repeatability with an average time of 0.0014 seconds per image. The average time is three times faster than the current fastest method, and the mean repeatability is 11% higher than that of the region proposal network (RPN) method. We applied our method to the target detection of autonomous vehicles, and in the experiment with the Oxford RobotCar dataset, we achieved 95.6% detection precision and 91.2% detection recall. This work could provide a new way to improve real-time performance and detection accuracy in the object detection of visual sensors.

引用

页码：9543 / 9557

页数：15

共 46 条

[1] Measuring the Objectness of Image Windows
Alexe, Bogdan
Deselaers, Thomas
Ferrari, Vittorio
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
[2] Learning a Neural Solver for Multiple Object Tracking
Braso, Guillem
Leal-Taixe, Laura
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6246 - 6256
[3] Active Vision via Extremum Seeking for Robots in Unstructured Environments: Applications in Object Recognition and Manipulation
Calli, Berk
Caarls, Wouter
Wisse, Martijn
Jonker, Pieter P.
[J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (04) : 1810 - 1822
[4] Enhancing object detection for autonomous driving by optimizing anchor generation and addressing class imbalance
Carranza-Garcia, Manuel
Lara-Benitez, Pedro
Garcia-Gutierrez, Jorge
Riquelme, Jose C.
[J]. NEUROCOMPUTING, 2021, 449 : 229 - 244
[5] On the Performance of One-Stage and Two-Stage Object Detectors in Autonomous Vehicles Using Camera Data
Carranza-Garcia, Manuel
Torres-Mateo, Jesus
Lara-Benitez, Pedro
Garcia-Gutierrez, Jorge
[J]. REMOTE SENSING, 2021, 13 (01) : 1 - 23
[6] CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts
Carreira, Joao
Sminchisescu, Cristian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (07) : 1312 - 1328
[7] Chen XZ, 2015, PROC CVPR IEEE, P2587, DOI 10.1109/CVPR.2015.7298874
[8] Scale-Aware Domain Adaptive Faster R-CNN
Chen, Yuhua
Wang, Haoran
Li, Wen
Sakaridis, Christos
Dai, Dengxin
Van Gool, Luc
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (07) : 2223 - 2243
[9] BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Cheng, Ming-Ming
Zhang, Ziming
Lin, Wen-Yan
Torr, Philip
[J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3286 - 3293
[10] Driving Behavior Analysis of Intelligent Vehicle System for Lane Detection Using Vision-Sensor
Dewangan, Deepak Kumar
Sahu, Satya Prakash
[J]. IEEE SENSORS JOURNAL, 2021, 21 (05) : 6367 - 6375

← 1 2 3 4 5 →