Energy-Quality Scalable Memory-Frugal Feature Extraction for Always-On Deep Sub-mW Distributed Vision

被引：3

作者：

Alvarez, Anastacia ^{[1
,2
]}

Ponnusamy, Gopalakrishnan ^{[1
]}

Alioto, Massimo ^{[1
]}

机构：

[1] Natl Univ Singapore, ECE Dept, Singapore 117583, Singapore

[2] Univ Philippines Diliman, EEE Inst, Quezon City 1101, Philippines

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

新加坡国家研究基金会;

关键词：

Low-power; energy-quality scaling; vision; video processing; feature extraction; Internet of Things; sensor nodes; ACCELERATOR; PROCESSOR; CIRCUITS; SYSTEMS;

D O I：

10.1109/ACCESS.2020.2968576

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this work, an energy-quality (EQ) scalable and memory-frugal architecture for video feature extraction is introduced to reduce circuit complexity, power and silicon area. Leveraging on the inherent resiliency of vision against noise and inaccuracies, the proposed approach introduces properly selected EQ tuning knobs to reduce the energy of feature extraction at graceful quality degradation. As opposed to prior art, the proposed architecture enables the adjustment of such knobs, and adapts its cycle-level timing to reduce the amount of computation per frame at lower quality targets. As further benefit, the approach adds opportunities for energy reduction via aggressive voltage scaling. The proposed architecture mitigates the traditionally dominant area/energy of the on-chip memory by reducing the number of pixels stored on chip, introducing memory access reuse and on-the-fly computation. At the same time, EQ tuning preserves the ability to conventionally operate at maximum quality, when required by the task or the visual context. A 0.55 mm(2) testchip in 40nm exhibits power down to 82 mu W at 5fps frame rate (i.e., 33X lower than prior art), while assuring successful object detection at VGA resolution. To the best of the authors' knowledge, this is the first feature extractor with sub-mW operation and sub-mm(2) area, making the proposed approach well suited for tightly power-constrained and low-cost distributed vision systems (e.g., video sensor nodes).

引用

页码：18951 / 18961

页数：11

共 33 条

[11] An Energy Efficient Full-Frame Feature Extraction Accelerator With Shift-Latch FIFO in 28 nm CMOS [J].

Jeon, Dongsuk ;

Henry, Michael B. ;

Kim, Yejoong ;

Lee, Inhee ;

Zhang, Zhengya ;

Blaauw, David ;

Sylvester, Dennis .

IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2014, 49 (05) :1271-1284

[12]

Karami E., 2017, ARXIV PREPRINT ARXIV

[13] Camera mote with a high-performance parallel processor for real-time frame-based video processing [J].

Kleihorst, Richard ;

Abbo, Anteneh ;

Schueler, Ben ;

Danilin, Alexander .

2007 IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, 2007, :69-74

[14]

Knut D. E., 1998, ART COMPUTER PROGRAM, V3

[15]

Krig S., 2014, COMPUTER VISION METR

[16]

Leuven K. U., AFFINE COVARIANT FEA

[17]

Lowe D. G., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P1150, DOI 10.1109/ICCV.1999.790410

[18] Distinctive image features from scale-invariant keypoints [J].

Lowe, DG .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110

[19]

Malladi KT, 2012, CONF PROC INT SYMP C, P37, DOI 10.1109/ISCA.2012.6237004

[20]

Meinerzhagen P., 2012, ESSCIRC 2012 - 38th European Solid State Circuits Conference, P321, DOI 10.1109/ESSCIRC.2012.6341319

← 1 2 3 4 →