Lossy Compression for Embedded Computer Vision Systems

被引:14
作者
Guo, Li [1 ]
Zhou, Dajiang [1 ]
Zhou, Jinjia [2 ,3 ]
Kimura, Shinji [1 ]
Goto, Satoshi [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka 8080135, Japan
[2] Hosei Univ, Sch Sci & Engn, Tokyo 1848485, Japan
[3] PRESTO, JST, Tokyo 1020076, Japan
基金
日本学术振兴会;
关键词
Computer vision; feature extraction; lossy compression; memory traffic reduction; HISTOGRAMS;
D O I
10.1109/ACCESS.2018.2852809
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computer vision applications are rapidly gaining popularity in embedded systems, which typically involve a difficult tradeoff between vision performance and energy consumption under a constraint of real-time processing throughput. Recently, hardware (FPGA and ASIC-based) implementations have emerged, which significantly improves the energy efficiency of vision computation. These implementations, however, often involve intensive memory traffic that retains a significant portion of energy consumption at the system level. To address this issue, we are the first researchers to present a lossy compression framework to exploit the tradeoff between vision performance and memory traffic for input images. To meet various requirements for memory access patterns in the vision system, a line-to-block format conversion is designed for the framework. Differential pulse-code modulation-based gradient-oriented quantization is developed as the lossy compression algorithm. We also present its hardware design that supports up to 12-scale 1080p@60fps real-time processing. For histogram of oriented gradient-based deformable part models on VOC2007, the proposed framework achieves a 49.6%-60.5% memory traffic reduction at a detection rate degradation of 0.05%-0.34%. For AlexNet on ImageNet, memory traffic reduction achieves up to 60.8% with less than 0.61% classification rate degradation. Compared with the power consumption reduction from memory traffic, the overhead involved for the proposed input image compression is less than 5%.
引用
收藏
页码:39385 / 39397
页数:13
相关论文
共 35 条
[21]   Lossless Frame Memory Compression Using Pixel-Grain Prediction and Dynamic Order Entropy Coding [J].
Lian, Xiaocong ;
Liu, Zhenyu ;
Zhou, Wei ;
Duan, Zhemin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) :223-235
[22]  
Minami M., 2005, P 2 INT WORKSH NETW, P1
[23]  
Nallusamy R., 2011, Information Technology Journal, V10, P1, DOI DOI 10.3923/itj.2011.1.10
[24]  
Pawlowski J. T, 2011, 2011 IEEE HOT CHIPS, DOI DOI 10.1109/HOTCHIPS.2011.7477494
[25]  
Peemen M, 2013, 2013 IEEE 31ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), P13, DOI 10.1109/ICCD.2013.6657019
[26]   Histograms of Sparse Codes for Object Detection [J].
Ren, Xiaofeng ;
Ramanan, Deva .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3246-3253
[27]   ImageNet Large Scale Visual Recognition Challenge [J].
Russakovsky, Olga ;
Deng, Jia ;
Su, Hao ;
Krause, Jonathan ;
Satheesh, Sanjeev ;
Ma, Sean ;
Huang, Zhiheng ;
Karpathy, Andrej ;
Khosla, Aditya ;
Bernstein, Michael ;
Berg, Alexander C. ;
Fei-Fei, Li .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252
[28]  
Song Lingyun, 2010, Cold Spring Harb Protoc, V2010, DOI 10.1101/pdb.prot5384
[29]  
Suleiman A., 2016, P S VLSI TECHN CIRC, P184
[30]   An Energy-Efficient Hardware Implementation of HOG-Based Object Detection at 1080HD 60 fps with Multi-Scale Support [J].
Suleiman, Amr ;
Sze, Vivienne .
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 84 (03) :325-337