Lossy Compression for Embedded Computer Vision Systems

被引：14

作者：

Guo, Li ^{[1
]}

Zhou, Dajiang ^{[1
]}

Zhou, Jinjia ^{[2
,3
]}

Kimura, Shinji ^{[1
]}

Goto, Satoshi ^{[1
]}

机构：

[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka 8080135, Japan

[2] Hosei Univ, Sch Sci & Engn, Tokyo 1848485, Japan

[3] PRESTO, JST, Tokyo 1020076, Japan

来源：

IEEE ACCESS | 2018年 / 6卷

基金：

日本学术振兴会;

关键词：

Computer vision; feature extraction; lossy compression; memory traffic reduction; HISTOGRAMS;

D O I：

10.1109/ACCESS.2018.2852809

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Computer vision applications are rapidly gaining popularity in embedded systems, which typically involve a difficult tradeoff between vision performance and energy consumption under a constraint of real-time processing throughput. Recently, hardware (FPGA and ASIC-based) implementations have emerged, which significantly improves the energy efficiency of vision computation. These implementations, however, often involve intensive memory traffic that retains a significant portion of energy consumption at the system level. To address this issue, we are the first researchers to present a lossy compression framework to exploit the tradeoff between vision performance and memory traffic for input images. To meet various requirements for memory access patterns in the vision system, a line-to-block format conversion is designed for the framework. Differential pulse-code modulation-based gradient-oriented quantization is developed as the lossy compression algorithm. We also present its hardware design that supports up to 12-scale 1080p@60fps real-time processing. For histogram of oriented gradient-based deformable part models on VOC2007, the proposed framework achieves a 49.6%-60.5% memory traffic reduction at a detection rate degradation of 0.05%-0.34%. For AlexNet on ImageNet, memory traffic reduction achieves up to 60.8% with less than 0.61% classification rate degradation. Compared with the power consumption reduction from memory traffic, the overhead involved for the proposed input image compression is less than 5%.

引用

页码：39385 / 39397

页数：13

共 35 条

[21] Lossless Frame Memory Compression Using Pixel-Grain Prediction and Dynamic Order Entropy Coding [J].

Lian, Xiaocong ;

Liu, Zhenyu ;

Zhou, Wei ;

Duan, Zhemin .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) :223-235

[22]

Minami M., 2005, P 2 INT WORKSH NETW, P1

[23]

Nallusamy R., 2011, Information Technology Journal, V10, P1, DOI DOI 10.3923/itj.2011.1.10

[24]

Pawlowski J. T, 2011, 2011 IEEE HOT CHIPS, DOI DOI 10.1109/HOTCHIPS.2011.7477494

[25]

Peemen M, 2013, 2013 IEEE 31ST INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), P13, DOI 10.1109/ICCD.2013.6657019

[26] Histograms of Sparse Codes for Object Detection [J].

Ren, Xiaofeng ;

Ramanan, Deva .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3246-3253

[27] ImageNet Large Scale Visual Recognition Challenge [J].

Russakovsky, Olga ;

Deng, Jia ;

Su, Hao ;

Krause, Jonathan ;

Satheesh, Sanjeev ;

Ma, Sean ;

Huang, Zhiheng ;

Karpathy, Andrej ;

Khosla, Aditya ;

Bernstein, Michael ;

Berg, Alexander C. ;

Fei-Fei, Li .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252

[28]

Song Lingyun, 2010, Cold Spring Harb Protoc, V2010, DOI 10.1101/pdb.prot5384

[29]

Suleiman A., 2016, P S VLSI TECHN CIRC, P184

[30] An Energy-Efficient Hardware Implementation of HOG-Based Object Detection at 1080HD 60 fps with Multi-Scale Support [J].

Suleiman, Amr ;

Sze, Vivienne .

JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 84 (03) :325-337

← 1 2 3 4 →