Energy-Quality Scalable Design Space Exploration of Approximate FFT Hardware Architectures

被引:11
作者
Pereira, Pedro Taua Lopes [1 ]
da Costa, Patricia Ucker Leleu [1 ]
Ferreira, Guilherme da Costa [1 ]
de Abreu, Brunno Alves [1 ]
Paim, Guilherme [2 ,3 ]
da Costa, Eduardo Antonio Cesar [4 ]
Bampi, Sergio [1 ]
机构
[1] Univ Fed Rio Grande Sul UFRGS, Inst Informat, PGMICRO, BR-91501970 Porto Alegre, RS, Brazil
[2] Univ Fed Rio Grande Sul UFRGS, Inst Informat, PGMICRO, BR-91501970 Porto Alegre, RS, Brazil
[3] Inst Engn Sistemas Computadores Invest & Desenvol, High Performance Comp Architectures & Syst HPCAS, P-1000029 Lisbon, Portugal
[4] Univ Catolica Pelotas UCPel, Grad Program Comput & Elect Engn, BR-96015560 Pelotas, RS, Brazil
关键词
FFT; radix-2; butterflies; approximate adders; HIGH-SPEED; CIRCUITS; MULTIPLIER; ALGORITHM; SYSTEMS; ADDER;
D O I
10.1109/TCSI.2022.3191180
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a comprehensive design space exploration for boosting energy efficiency of a fast Fourier transform (FFT) VLSI accelerator, exploiting several approximate multipliers (AxM) combined with approximate adder (AxA) circuits. The FFT hardware herein presented consists of a fixed-point sequential architecture using a radix-2 butterfly with decimation in time. We explore a set of AxMs - namely Dynamic Range Unbiased (DRUM), Rounding-based Approximate (RoBA), leading one Bit-based Approximate (LoBA), and Truncated approach - jointly with the LOA, ETA-I, Copy(A), Copy(B), Trunc(0), Trunc(1) approximate adders. The approximate arithmetic operators are used in the butterfly kernel with exploration of the approximation levels (for the L and K least-significant bits, respectively, for the AxM and AxA), aiming at discovering the most energy-efficient configuration under a design-time QoR constraint. The mean square error and peak signal-to-noise ratio metrics define which approximate levels combining L and K variations will enable the FFT to process signals to generate spectrograms without significant losses. Our results show that the LoBA multiplier with L=8 together with the LOA, Trunc(1) and Trunc(0), at different approximation levels, provide most energy savings with controllable quality degradation, presenting a minimum decrease of 20.2% in power dissipation without degrading the spectrogram generation quality.
引用
收藏
页码:4524 / 4534
页数:11
相关论文
共 26 条
[1]   Energy-Quality Scalable Adders Based on Nonzeroing Bit Truncation [J].
Frustaci, Fabio ;
Perri, Stefania ;
Corsonello, Pasquale ;
Alioto, Massimo .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (04) :964-968
[2]   ReAdapt: A Reconfigurable Datapath for Runtime Energy-Quality Scalable Adaptive Filters [J].
Lopes Pereira, Pedro Taua ;
Paim, Guilherme ;
Cesar da Costa, Eduardo Antonio ;
Melo de Almeida, Sergio Jose ;
Bampi, Sergio .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (01) :327-339
[3]   Scalable FFT Architecture vs. Multiple Pipeline FFT Architectures-Hardware Implementation and Cost [J].
Suleiman, Adnan ;
Hussein, Adel ;
Bataineh, Khaldoun ;
Akopian, David .
2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, :3792-3796
[4]   Design of High Hardware Efficiency Approximate Floating-Point FFT Processor [J].
Yan, Chenggang ;
Zhao, Xuan ;
Zhang, Tingting ;
Ge, Jipeng ;
Wang, Chenghua ;
Liu, Weiqiang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (11) :4283-4294
[5]   A Statistic-Based Scan Chain Reordering for Energy-Quality Scalable Scan Test [J].
Seo, Sungyoul ;
Cho, Keewon ;
Lee, Young-Woo ;
Kang, Sungho .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2018, 8 (03) :391-403
[6]   Design Space Exploration of 1-D FFT Processor [J].
Liu, Shaohan ;
Liu, Dake .
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (11) :1609-1621
[7]   NSGA-II-Based Design Space Exploration for Energy and Throughput Aware Multicore Architectures [J].
Hussain, Ishfaq ;
Parveen, Abida ;
Ahmad, Ayaz ;
Qadri, Muhammad Yasir ;
Qadri, Nadia N. ;
Ahmed, Jameel .
CYBERNETICS AND SYSTEMS, 2017, 48 (6-7) :536-550
[8]   Exploration of Approximate Multipliers Design Space using Carry Propagation Free Compressors [J].
Boroumand, Sina ;
Afshar, Hadi P. ;
Brisk, Philip ;
Mohammadi, Siamak .
2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, :611-616
[9]   Energy-Quality Scalable Memory-Frugal Feature Extraction for Always-On Deep Sub-mW Distributed Vision [J].
Alvarez, Anastacia ;
Ponnusamy, Gopalakrishnan ;
Alioto, Massimo .
IEEE ACCESS, 2020, 8 :18951-18961
[10]   Energy-Quality Scalable Analog-to-Digital Conversion and Machine Learning Engine in a 51.9 nJ/frame Voice Activity Detector [J].
Teo, Jinq Horng ;
Cheng, Shuai ;
Alioto, Massimo .
2019 26TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2019, :174-177