Lossy Compression for Embedded Computer Vision Systems

被引：14

作者：

Guo, Li ^{[1
]}

Zhou, Dajiang ^{[1
]}

Zhou, Jinjia ^{[2
,3
]}

Kimura, Shinji ^{[1
]}

Goto, Satoshi ^{[1
]}

机构：

[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka 8080135, Japan

[2] Hosei Univ, Sch Sci & Engn, Tokyo 1848485, Japan

[3] PRESTO, JST, Tokyo 1020076, Japan

来源：

IEEE ACCESS | 2018年 / 6卷

基金：

日本学术振兴会;

关键词：

Computer vision; feature extraction; lossy compression; memory traffic reduction; HISTOGRAMS;

D O I：

10.1109/ACCESS.2018.2852809

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Computer vision applications are rapidly gaining popularity in embedded systems, which typically involve a difficult tradeoff between vision performance and energy consumption under a constraint of real-time processing throughput. Recently, hardware (FPGA and ASIC-based) implementations have emerged, which significantly improves the energy efficiency of vision computation. These implementations, however, often involve intensive memory traffic that retains a significant portion of energy consumption at the system level. To address this issue, we are the first researchers to present a lossy compression framework to exploit the tradeoff between vision performance and memory traffic for input images. To meet various requirements for memory access patterns in the vision system, a line-to-block format conversion is designed for the framework. Differential pulse-code modulation-based gradient-oriented quantization is developed as the lossy compression algorithm. We also present its hardware design that supports up to 12-scale 1080p@60fps real-time processing. For histogram of oriented gradient-based deformable part models on VOC2007, the proposed framework achieves a 49.6%-60.5% memory traffic reduction at a detection rate degradation of 0.05%-0.34%. For AlexNet on ImageNet, memory traffic reduction achieves up to 60.8% with less than 0.61% classification rate degradation. Compared with the power consumption reduction from memory traffic, the overhead involved for the proposed input image compression is less than 5%.

引用

页码：39385 / 39397

页数：13

共 35 条

[1]

[Anonymous], 2009, JESD792E JEDEC

[2]

[Anonymous], 2012, DDR3 SDRAM Standard

[3]

Chen YH, 2016, ISSCC DIG TECH PAP I, V59, P262, DOI 10.1109/ISSCC.2016.7418007

[4] DaDianNao: A Machine-Learning Supercomputer [J].

Chen, Yunji ;

Luo, Tao ;

Liu, Shaoli ;

Zhang, Shijin ;

He, Liqiang ;

Wang, Jia ;

Li, Ling ;

Chen, Tianshi ;

Xu, Zhiwei ;

Sun, Ninghui ;

Temam, Olivier .

2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2014, :609-622

[5] Histograms of oriented gradients for human detection [J].

Dalal, N ;

Triggs, B .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893

[6]

Deng J., 2012, ImageNet Large Scale Visual Recognition Competition

[7] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[8] The PASCAL Visual Object Classes Challenge: A Retrospective [J].

Everingham, Mark ;

Eslami, S. M. Ali ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136

[9]

Farabet C, 2010, IEEE INT SYMP CIRC S, P257, DOI 10.1109/ISCAS.2010.5537908

[10]

Girshick R.B., 2012, Discriminatively trained deformable part models, release 5

← 1 2 3 4 →