Energy-Quality Scalable Design Space Exploration of Approximate FFT Hardware Architectures

被引：11

作者：

Pereira, Pedro Taua Lopes ^{[1
]}

da Costa, Patricia Ucker Leleu ^{[1
]}

Ferreira, Guilherme da Costa ^{[1
]}

de Abreu, Brunno Alves ^{[1
]}

Paim, Guilherme ^{[2
,3
]}

da Costa, Eduardo Antonio Cesar ^{[4
]}

Bampi, Sergio ^{[1
]}

机构：

[1] Univ Fed Rio Grande Sul UFRGS, Inst Informat, PGMICRO, BR-91501970 Porto Alegre, RS, Brazil

[2] Univ Fed Rio Grande Sul UFRGS, Inst Informat, PGMICRO, BR-91501970 Porto Alegre, RS, Brazil

[3] Inst Engn Sistemas Computadores Invest & Desenvol, High Performance Comp Architectures & Syst HPCAS, P-1000029 Lisbon, Portugal

[4] Univ Catolica Pelotas UCPel, Grad Program Comput & Elect Engn, BR-96015560 Pelotas, RS, Brazil

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS | 2022年 / 69卷 / 11期

关键词：

FFT; radix-2; butterflies; approximate adders; HIGH-SPEED; CIRCUITS; MULTIPLIER; ALGORITHM; SYSTEMS; ADDER;

D O I：

10.1109/TCSI.2022.3191180

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents a comprehensive design space exploration for boosting energy efficiency of a fast Fourier transform (FFT) VLSI accelerator, exploiting several approximate multipliers (AxM) combined with approximate adder (AxA) circuits. The FFT hardware herein presented consists of a fixed-point sequential architecture using a radix-2 butterfly with decimation in time. We explore a set of AxMs - namely Dynamic Range Unbiased (DRUM), Rounding-based Approximate (RoBA), leading one Bit-based Approximate (LoBA), and Truncated approach - jointly with the LOA, ETA-I, Copy(A), Copy(B), Trunc(0), Trunc(1) approximate adders. The approximate arithmetic operators are used in the butterfly kernel with exploration of the approximation levels (for the L and K least-significant bits, respectively, for the AxM and AxA), aiming at discovering the most energy-efficient configuration under a design-time QoR constraint. The mean square error and peak signal-to-noise ratio metrics define which approximate levels combining L and K variations will enable the FFT to process signals to generate spectrograms without significant losses. Our results show that the LoBA multiplier with L=8 together with the LOA, Trunc(1) and Trunc(0), at different approximation levels, provide most energy savings with controllable quality degradation, presenting a minimum decrease of 20.2% in power dissipation without degrading the spectrogram generation quality.

引用

页码：4524 / 4534

页数：11

共 26 条

[1] Energy-Quality Scalable Adders Based on Nonzeroing Bit Truncation
Frustaci, Fabio
Perri, Stefania
Corsonello, Pasquale
Alioto, Massimo
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (04) : 964 - 968
[2] ReAdapt: A Reconfigurable Datapath for Runtime Energy-Quality Scalable Adaptive Filters
Lopes Pereira, Pedro Taua
Paim, Guilherme
Cesar da Costa, Eduardo Antonio
Melo de Almeida, Sergio Jose
Bampi, Sergio
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (01) : 327 - 339
[3] Scalable FFT Architecture vs. Multiple Pipeline FFT Architectures-Hardware Implementation and Cost
Suleiman, Adnan
Hussein, Adel
Bataineh, Khaldoun
Akopian, David
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3792 - 3796
[4] Design of High Hardware Efficiency Approximate Floating-Point FFT Processor
Yan, Chenggang
Zhao, Xuan
Zhang, Tingting
Ge, Jipeng
Wang, Chenghua
Liu, Weiqiang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (11) : 4283 - 4294
[5] A Statistic-Based Scan Chain Reordering for Energy-Quality Scalable Scan Test
Seo, Sungyoul
Cho, Keewon
Lee, Young-Woo
Kang, Sungho
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2018, 8 (03) : 391 - 403
[6] Design Space Exploration of 1-D FFT Processor
Liu, Shaohan
Liu, Dake
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (11): : 1609 - 1621
[7] NSGA-II-Based Design Space Exploration for Energy and Throughput Aware Multicore Architectures
Hussain, Ishfaq
Parveen, Abida
Ahmad, Ayaz
Qadri, Muhammad Yasir
Qadri, Nadia N.
Ahmed, Jameel
CYBERNETICS AND SYSTEMS, 2017, 48 (6-7) : 536 - 550
[8] Exploration of Approximate Multipliers Design Space using Carry Propagation Free Compressors
Boroumand, Sina
Afshar, Hadi P.
Brisk, Philip
Mohammadi, Siamak
2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 611 - 616
[9] Energy-Quality Scalable Memory-Frugal Feature Extraction for Always-On Deep Sub-mW Distributed Vision
Alvarez, Anastacia
Ponnusamy, Gopalakrishnan
Alioto, Massimo
IEEE ACCESS, 2020, 8 : 18951 - 18961
[10] Energy-Quality Scalable Analog-to-Digital Conversion and Machine Learning Engine in a 51.9 nJ/frame Voice Activity Detector
Teo, Jinq Horng
Cheng, Shuai
Alioto, Massimo
2019 26TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2019, : 174 - 177

← 1 2 3 →