GPU-Accelerated Real-Time Stereo Estimation With Binary Neural Network

被引:27
|
作者
Chen, Gang [1 ]
Meng, Haitao [2 ,3 ]
Liang, Yucheng [1 ]
Huang, Kai [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou 510275, Peoples R China
[2] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110819, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
GPU acceleration; stereo estimation; binary neural network;
D O I
10.1109/TPDS.2020.3006238
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Depth estimation from stereo images is essential to many applications such as robotics and autonomous vehicles, most of which ask for the real-time response, high energy and storage efficiency. Recent work has shown deep neural networks (DNN) perform extremely well for stereo estimation. However, these state-of-the-art DNN based algorithms are challenging to be deployed into real-world applications due to the high computational complexities of DNNs. Most of them are too slow for real-time inference and require several seconds of GPU computation to process image frames. In this article, we address the problem of fast stereo estimation and propose an efficient and light-weighted stereo matching system, called StereoBit, to produce a disparity map in a real-time manner while achieving close to state-of-the-art accuracy. To achieve this goal, we propose a binary neural network to generate weighted Hamming distance for an efficient similarity join in stereo estimation. In addition, we propose a novel approximation approach to derive StereoBit network directly from the well-trained network with the cosine similarity. Our approximation strategies enable a significant speedup while maintaining almost the same accuracy compared to the network with the cosine similarity. Furthermore, we present an optimization framework for fully exploiting the computing power of StereoBit. The framework provides a significant speedup of stereo estimation routines, and at the same time, reduces the memory usage for storing parameters. The effectiveness of StereoBit is evaluated by comprehensive experiments. StereoBit can achieve 60 frames per second on an NVIDIA TITAN Xp GPU on KITTI 2012 benchmark while achieving 3-pixel non-occluded stereo error 3.56 percent.
引用
收藏
页码:2896 / 2907
页数:12
相关论文
共 50 条
  • [21] Real-time dose computation: GPU-accelerated source modeling and superposition/convolution
    Jacques, Robert
    Wong, John
    Taylor, Russell
    McNutt, Todd
    MEDICAL PHYSICS, 2011, 38 (01) : 294 - 305
  • [22] Real-Time Multiview Human Body Tracking Using GPU-Accelerated PSO
    Rymut, Boguslaw
    Kwolek, Bogdan
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 458 - 468
  • [23] A GPU-accelerated real-time human voice separation framework for mobile phones
    Chen, Gang
    Zheng, Yi
    Zhou, Zhaoheng
    He, Shengyu
    Yi, Wang
    JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 145
  • [24] A GPU-Accelerated Real-Time NLMeans Algorithm for Denoising Color Video Sequences
    Goossens, Bart
    Luong, Hiep
    Aelterman, Jan
    Pizurica, Aleksandra
    Philips, Wilfried
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PT II, 2010, 6475 : 46 - 57
  • [25] GPU-accelerated massive black hole binary parameter estimation with LISA
    Katz, Michael L.
    Marsat, Sylvain
    Chua, Alvin J. K.
    Babak, Stanislav
    Larson, Shane L.
    PHYSICAL REVIEW D, 2020, 102 (02)
  • [26] StereoEngine: An FPGA-Based Accelerator for Real-Time High-Quality Stereo Estimation With Binary Neural Network
    Chen, Gang
    Ling, Yehua
    He, Tao
    Meng, Haitao
    He, Shengyu
    Zhang, Yu
    Huang, Kai
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (11) : 4179 - 4190
  • [27] A GPU-ACCELERATED REAL-TIME IMPLEMENTATION OF TRINICON-BSS FOR MULTIPLE SEPARATION UNITS
    Anderson, Craig A.
    Meier, Stefan
    Kellermann, Walter
    Teal, Paul D.
    Poletti, Mark A.
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 102 - 106
  • [28] GPU-accelerated real-time IR smoke screen simulation and assessment of its obscuration
    Wu Xin
    Zhang Jian-qi
    Huang Xi
    Liu De-lian
    INFRARED PHYSICS & TECHNOLOGY, 2012, 55 (01) : 150 - 155
  • [29] GPU-accelerated Real-time Free-viewpoint DIBR for 3DTV
    Do, Luat
    Bravo, German
    Zinger, Svitlana
    de With, Peter H. N.
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (02) : 633 - 640
  • [30] GPU-Accelerated Real-Time Mesh Simplification Using Parallel Half Edge Collapses
    Odaker, Thomas
    Kranzlmueller, Dieter
    Volkert, Jens
    MATHEMATICAL AND ENGINEERING METHODS IN COMPUTER SCIENCE, MEMICS 2015, 2016, 9548 : 107 - 118