X-CGRA: An Energy-Efficient Approximate Coarse-Grained Reconfigurable Architecture

被引:36
作者
Akbari, Omid [1 ,2 ]
Kamal, Mehdi [1 ]
Afzali-Kusha, Ali [1 ]
Pedram, Massoud [3 ]
Shafique, Muhammad [4 ]
机构
[1] Univ Tehran, Sch Elect & Comp Engn, Tehran 14395515, Iran
[2] Tarbiat Modares Univ, Sch Elect & Comp Engn, Tehran 14115111, Iran
[3] Univ Southern Calif, Elect Engn, Los Angeles, CA 90211 USA
[4] Vienna Univ Technol TU Wien, Inst Comp Engn, A-1040 Vienna, Austria
关键词
Energy efficiency; Reconfigurable architectures; Energy consumption; Approximate computing; Adders; Program processors; Adaptivity; approximate computing; coarsegrained reconfigurable architecture (CGRA); energy efficiency; error resilience; mapping; quality of service (QoS); PROCESSOR; DESIGN; ERROR; ADDER;
D O I
10.1109/TCAD.2019.2937738
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we present an energy-efficient approximate CGRA (X-CGRA). Instead of conventional exact arithmetic units, it employs configurable approximate adders and multipliers in the so-called quality-scalable processing elements (QSPEs). Furthermore, the structure and functionality of the other architectural components, like context memory, are modified based on the quality-scalable operating modes of the QSPEs. The quality reconfigurability of the X-CGRA makes it amenable for both error-resilient and nonresilient applications. To map the applications on the X-CGRA, a mapping technique is proposed that efficiently utilizes the QSPEs and selects appropriate approximation modes in order to lower the energy consumption while satisfying a user-defined quality constraint. We evaluate the efficacy of our X-CGRA for several benchmark applications from different domains, including image/video processing, signal processing, and scientific computations. Different sizes of X-CGRA are synthesized using a 15-nm FinFET technology. For these benchmarks, the results indicate energy consumption reduction of up to 3.21x compared to those of a typical exact CGRA, at the cost of 4% quality loss.
引用
收藏
页码:2558 / 2571
页数:14
相关论文
共 45 条
[1]   Toward Approximate Computing for Coarse-Grained Reconfigurable Architectures [J].
Akbari, Omid ;
Kamal, Mehdi ;
Afzali-Kusha, Ali ;
Pedram, Massoud ;
Shafique, Muhammad .
IEEE MICRO, 2018, 38 (06) :63-72
[2]  
Akbari O, 2018, DES AUT TEST EUROPE, P413, DOI 10.23919/DATE.2018.8342045
[3]   RAP-CLA: A Reconfigurable Approximate Carry Look-Ahead Adder [J].
Akbari, Omid ;
Kamal, Mehdi ;
Afzali-Kusha, Ali ;
Pedram, Massoud .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2018, 65 (08) :1089-1093
[4]   Dual-Quality 4:2 Compressors for Utilizing in Dynamic Accuracy Configurable Multipliers [J].
Akbari, Omid ;
Kamal, Mehdi ;
Afzali-Kusha, Ali ;
Pedram, Massoud .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2017, 25 (04) :1352-1361
[5]  
[Anonymous], 1994, Synthesis and Optimization of Digital Circuits
[6]  
[Anonymous], 2014, PROC INT C FIELD PRO
[7]   Ingredients of Adaptability: A Survey of Reconfigurable Processors [J].
Chattopadhyay, Anupam .
VLSI DESIGN, 2013,
[8]  
El-Harouni W, 2017, DES AUT TEST EUROPE, P1384, DOI 10.23919/DATE.2017.7927209
[9]  
Esmaeilzadeh H, 2012, ASPLOS XVII: SEVENTEENTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, P301
[10]  
GUAN L, 2000, MULTIMEDIA IMAGE VID