A 1920 x 1080 30-frames/s 2.3 TOPS/W Stereo-Depth Processor for Energy-Efficient Autonomous Navigation of Micro Aerial Vehicles

被引:19
作者
Li, Ziyun [1 ]
Dong, Qing [1 ]
Saligane, Mehdi [1 ]
Kempke, Benjamin [1 ]
Gong, Luyao [1 ]
Zhang, Zhengya [1 ]
Dreslinski, Ronald [1 ]
Sylvester, Dennis [1 ]
Blaauw, David [1 ]
Kim, Hun-Seok [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48105 USA
关键词
8T-SRAM; autonomous navigation; semi-global matching (SGM); stereo vision;
D O I
10.1109/JSSC.2017.2751501
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a single-chip, high-performance, and energy-efficient stereo vision depth-estimation processor for micro aerial vehicles (MAVs). The proposed processor implements the state-of-the-art semi-global matching (SGM) algorithm to deliver full high-definition (HD, 1920 x 1080) stereo-depth outputs with a maximum of 38 frames/s throughput. Algorithm-architecture co-optimization is conducted, introducing overlapping block-based processing that eliminates very large on-chip memory and off-chip DRAM. We exploit inherent data parallelism in the algorithm by processing 128 local disparity costs and aggregating the SGM costs along four paths for all 128 disparities in parallel. A dependence-resolving scan associated with 16-stage deep pipeline is introduced to hide the data dependence between neighboring pixels in the SGM algorithm. Moreover, we propose a customized ultra-high bandwidth dual-port SRAM that utilizes the unique memory access characteristic of SGM to achieve highly energy-efficient memory access at a very high on-chip memory bandwidth of 1.64 Tb/s. The fabricated processor produces 512 levels of depth information for each pixel at full HD resolution with 30-frames/s performance, consuming 836 mW from a 0.75-V supply in TSMC 40-nm GP CMOS. We ported the design on a quadcopter MAV to demonstrate its performance in realistic real-time flight.
引用
收藏
页码:76 / 90
页数:15
相关论文
共 35 条
[1]  
[Anonymous], 2015, P IEEE INT SOL STAT
[2]   A Database and Evaluation Methodology for Optical Flow [J].
Baker, Simon ;
Scharstein, Daniel ;
Lewis, J. P. ;
Roth, Stefan ;
Black, Michael J. ;
Szeliski, Richard .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) :1-31
[3]   CLASS OF ALGORITHMS FOR FAST DIGITAL IMAGE REGISTRATION [J].
BARNEA, DI ;
SILVERMAN, HF .
IEEE TRANSACTIONS ON COMPUTERS, 1972, C 21 (02) :179-+
[4]  
Bi Y., 2016, INT MICR AIR VEH C C
[5]   Fast approximate energy minimization via graph cuts [J].
Boykov, Y ;
Veksler, O ;
Zabih, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (11) :1222-1239
[6]   Advances in computational stereo [J].
Brown, MZ ;
Burschka, D ;
Hager, GD .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (08) :993-1008
[7]  
Chao-Chung Cheng, 2010, 2010 IEEE International Symposium on Circuits and Systems. ISCAS 2010, P4109, DOI 10.1109/ISCAS.2010.5537613
[8]  
Chen H., 2015, Biomed Res. Int, V2015, DOI DOI 10.1007/S00382-015-2660-8
[9]  
Cho H, 2014, IEEE INT CONF ROBOT, P1836, DOI 10.1109/ICRA.2014.6907100
[10]  
Davis W. R. Jr., 1996, Lincoln Laboratory Journal, V9, P197