ImaGen: A General Framework for Generating Memory- and Power-Efficient Image Processing Accelerators

被引：6

作者：

Ujjainkar, Nisarg ^{[1
]}

Leng, Jingwen ^{[2
]}

Zhu, Yuhao ^{[1
]}

机构：

[1] Univ Rochester, Rochester, NY 14627 USA

[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

来源：

PROCEEDINGS OF THE 2023 THE 50TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, ISCA 2023 | 2023年

关键词：

Accelerator; Line Buffer; Image Processing; Constrained Optimization; Synthesis; Compiler; LANGUAGE; COMPILER;

D O I：

10.1145/3579371.3589076

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Image processing algorithms are prime targets for hardware acceleration as they are commonly used in resource- and power-limited applications. Today's image processing accelerator designs make rigid assumptions about the algorithm structures and/or on-chip memory resources. As a result, they either have narrowapplicability or result in inefficient designs. This paper presents a compiler framework that automatically generates memory- and power-efficient image processing accelerators. We allow programmers to describe generic image processing algorithms (in a domain specific language) and specify on-chip memory structures available. Our framework then formulates a constrained optimization problem that minimizes on-chip memory usage while maintaining theoretical maximum throughput. The key challenge we address is to analytically express the throughput bottleneck, on-chip memory contention, to enable a lightweight compilation. FPGA prototyping and ASIC synthesis show that, compared to existing approaches, accelerators generated by our framework reduce the on-chip memory usage and/or power consumption by double digits. ImaGen code is available at: https://github.com/horizon-research/imagen.

引用

页码：579 / 591

页数：13

共 40 条

[1]

Bagni Daniele, 2017, Tech. note XAPP1300

[2]

Chandramoorthy N, 2015, INT S HIGH PERF COMP, P1, DOI 10.1109/HPCA.2015.7056017

[3] SODA: Stencil with Optimized Dataflow Architecture [J].

Chi, Yuze ;

Cong, Jason ;

Wei, Peng ;

Zhou, Peipei .

2018 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD) DIGEST OF TECHNICAL PAPERS, 2018,

[4]

developer.arm, Mali-C55

[5]

developers.google, Google or-tools

[6] Type-Directed Scheduling of Streaming Accelerators [J].

Durst, David ;

Feldman, Matthew ;

Huff, Dillon ;

Akeley, David ;

Daly, Ross ;

Bernstein, Gilbert Louis ;

Patrignani, Marco ;

Fatahalian, Kayvon ;

Hanrahan, Pat .

PROCEEDINGS OF THE 41ST ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '20), 2020, :408-422

[7]

eda ncsu, FreePDK45

[8] Crescent: Taming Memory Irregularities for Accelerating Deep Point Cloud Analytics [J].

Feng, Yu ;

Hammonds, Gunnar ;

Gan, Yiming ;

Zhu, Yuhao .

PROCEEDINGS OF THE 2022 THE 49TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '22), 2022, :962-977

[9] Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation [J].

Feng, Yu ;

Tian, Boyuan ;

Xu, Tiancheng ;

Whatmough, Paul ;

Zhu, Yuhao .

2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, :1037-1050

[10] ASV: Accelerated Stereo Vision System [J].

Feng, Yu ;

Whatmough, Paul ;

Zhu, Yuhao .

MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, :643-656

← 1 2 3 4 →