A Fast and High-Performance Object Proposal Method for Vision Sensors: Application to Object Detection

被引:22
作者
Jiang, Chao [1 ,2 ,3 ]
Wang, Zhiling [1 ,2 ,3 ]
Liang, Huawei [1 ,2 ,3 ]
Tan, Shuhang [1 ,2 ,3 ]
机构
[1] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China
[2] Anhui Engn Lab Intelligent Driving Technol & Appl, Hefei 230031, Peoples R China
[3] Chinese Acad Sci, Innovat Res Inst Robot & Intelligent Mfg, Hefei 230031, Peoples R China
关键词
Proposals; Computational efficiency; Object detection; Location awareness; Merging; Feature extraction; Visualization; Object proposals; object detection; enhanced frequency feature; binarization; lateral inhibition; autonomous vehicle; RECOGNITION; USERS; LIDAR;
D O I
10.1109/JSEN.2022.3155232
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Use of the object proposal method as a preprocessing step for object detection of vision sensors has improved computational efficiency in recent years. Good object proposal methods require high object detection recall, low computational cost, good localization accuracy, and repeatability. However, existing methods cannot always achieve a good balance of performance. To solve this problem, we propose a fast and high-performance object proposal algorithm. First, we propose a construction method to enhance frequency features that are combined with a linear classifier to learn and generate a set of proposal boxes. Second, we propose a strategy of binarizing frequency features and classifiers to accelerate the calculation. Last, we propose a merging strategy to improve the localization quality of the proposal boxes. Empirically, for the VOC2007 and MSCOCO2017 datasets using the intersection over union (IOU) threshold of 0.5 and 10(4) proposals, our method achieves 99.3% object detection recall, 81.1% mean average best overlap, and 80% mean repeatability with an average time of 0.0014 seconds per image. The average time is three times faster than the current fastest method, and the mean repeatability is 11% higher than that of the region proposal network (RPN) method. We applied our method to the target detection of autonomous vehicles, and in the experiment with the Oxford RobotCar dataset, we achieved 95.6% detection precision and 91.2% detection recall. This work could provide a new way to improve real-time performance and detection accuracy in the object detection of visual sensors.
引用
收藏
页码:9543 / 9557
页数:15
相关论文
共 46 条
  • [1] Measuring the Objectness of Image Windows
    Alexe, Bogdan
    Deselaers, Thomas
    Ferrari, Vittorio
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
  • [2] Learning a Neural Solver for Multiple Object Tracking
    Braso, Guillem
    Leal-Taixe, Laura
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6246 - 6256
  • [3] Active Vision via Extremum Seeking for Robots in Unstructured Environments: Applications in Object Recognition and Manipulation
    Calli, Berk
    Caarls, Wouter
    Wisse, Martijn
    Jonker, Pieter P.
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2018, 15 (04) : 1810 - 1822
  • [4] Enhancing object detection for autonomous driving by optimizing anchor generation and addressing class imbalance
    Carranza-Garcia, Manuel
    Lara-Benitez, Pedro
    Garcia-Gutierrez, Jorge
    Riquelme, Jose C.
    [J]. NEUROCOMPUTING, 2021, 449 : 229 - 244
  • [5] On the Performance of One-Stage and Two-Stage Object Detectors in Autonomous Vehicles Using Camera Data
    Carranza-Garcia, Manuel
    Torres-Mateo, Jesus
    Lara-Benitez, Pedro
    Garcia-Gutierrez, Jorge
    [J]. REMOTE SENSING, 2021, 13 (01) : 1 - 23
  • [6] CPMC: Automatic Object Segmentation Using Constrained Parametric Min-Cuts
    Carreira, Joao
    Sminchisescu, Cristian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (07) : 1312 - 1328
  • [7] Chen XZ, 2015, PROC CVPR IEEE, P2587, DOI 10.1109/CVPR.2015.7298874
  • [8] Scale-Aware Domain Adaptive Faster R-CNN
    Chen, Yuhua
    Wang, Haoran
    Li, Wen
    Sakaridis, Christos
    Dai, Dengxin
    Van Gool, Luc
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (07) : 2223 - 2243
  • [9] BING: Binarized Normed Gradients for Objectness Estimation at 300fps
    Cheng, Ming-Ming
    Zhang, Ziming
    Lin, Wen-Yan
    Torr, Philip
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3286 - 3293
  • [10] Driving Behavior Analysis of Intelligent Vehicle System for Lane Detection Using Vision-Sensor
    Dewangan, Deepak Kumar
    Sahu, Satya Prakash
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (05) : 6367 - 6375