Energy-Quality Scalable Design Space Exploration of Approximate FFT Hardware Architectures

被引:11
|
作者
Pereira, Pedro Taua Lopes [1 ]
da Costa, Patricia Ucker Leleu [1 ]
Ferreira, Guilherme da Costa [1 ]
de Abreu, Brunno Alves [1 ]
Paim, Guilherme [2 ,3 ]
da Costa, Eduardo Antonio Cesar [4 ]
Bampi, Sergio [1 ]
机构
[1] Univ Fed Rio Grande Sul UFRGS, Inst Informat, PGMICRO, BR-91501970 Porto Alegre, RS, Brazil
[2] Univ Fed Rio Grande Sul UFRGS, Inst Informat, PGMICRO, BR-91501970 Porto Alegre, RS, Brazil
[3] Inst Engn Sistemas Computadores Invest & Desenvol, High Performance Comp Architectures & Syst HPCAS, P-1000029 Lisbon, Portugal
[4] Univ Catolica Pelotas UCPel, Grad Program Comput & Elect Engn, BR-96015560 Pelotas, RS, Brazil
关键词
FFT; radix-2; butterflies; approximate adders; HIGH-SPEED; CIRCUITS; MULTIPLIER; ALGORITHM; SYSTEMS; ADDER;
D O I
10.1109/TCSI.2022.3191180
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a comprehensive design space exploration for boosting energy efficiency of a fast Fourier transform (FFT) VLSI accelerator, exploiting several approximate multipliers (AxM) combined with approximate adder (AxA) circuits. The FFT hardware herein presented consists of a fixed-point sequential architecture using a radix-2 butterfly with decimation in time. We explore a set of AxMs - namely Dynamic Range Unbiased (DRUM), Rounding-based Approximate (RoBA), leading one Bit-based Approximate (LoBA), and Truncated approach - jointly with the LOA, ETA-I, Copy(A), Copy(B), Trunc(0), Trunc(1) approximate adders. The approximate arithmetic operators are used in the butterfly kernel with exploration of the approximation levels (for the L and K least-significant bits, respectively, for the AxM and AxA), aiming at discovering the most energy-efficient configuration under a design-time QoR constraint. The mean square error and peak signal-to-noise ratio metrics define which approximate levels combining L and K variations will enable the FFT to process signals to generate spectrograms without significant losses. Our results show that the LoBA multiplier with L=8 together with the LOA, Trunc(1) and Trunc(0), at different approximation levels, provide most energy savings with controllable quality degradation, presenting a minimum decrease of 20.2% in power dissipation without degrading the spectrogram generation quality.
引用
收藏
页码:4524 / 4534
页数:11
相关论文
共 26 条
  • [1] Energy-Quality Scalable Adders Based on Nonzeroing Bit Truncation
    Frustaci, Fabio
    Perri, Stefania
    Corsonello, Pasquale
    Alioto, Massimo
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2019, 27 (04) : 964 - 968
  • [2] ReAdapt: A Reconfigurable Datapath for Runtime Energy-Quality Scalable Adaptive Filters
    Lopes Pereira, Pedro Taua
    Paim, Guilherme
    Cesar da Costa, Eduardo Antonio
    Melo de Almeida, Sergio Jose
    Bampi, Sergio
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (01) : 327 - 339
  • [3] Scalable FFT Architecture vs. Multiple Pipeline FFT Architectures-Hardware Implementation and Cost
    Suleiman, Adnan
    Hussein, Adel
    Bataineh, Khaldoun
    Akopian, David
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3792 - 3796
  • [4] Design of High Hardware Efficiency Approximate Floating-Point FFT Processor
    Yan, Chenggang
    Zhao, Xuan
    Zhang, Tingting
    Ge, Jipeng
    Wang, Chenghua
    Liu, Weiqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2023, 70 (11) : 4283 - 4294
  • [5] A Statistic-Based Scan Chain Reordering for Energy-Quality Scalable Scan Test
    Seo, Sungyoul
    Cho, Keewon
    Lee, Young-Woo
    Kang, Sungho
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2018, 8 (03) : 391 - 403
  • [6] Design Space Exploration of 1-D FFT Processor
    Liu, Shaohan
    Liu, Dake
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 90 (11): : 1609 - 1621
  • [7] NSGA-II-Based Design Space Exploration for Energy and Throughput Aware Multicore Architectures
    Hussain, Ishfaq
    Parveen, Abida
    Ahmad, Ayaz
    Qadri, Muhammad Yasir
    Qadri, Nadia N.
    Ahmed, Jameel
    CYBERNETICS AND SYSTEMS, 2017, 48 (6-7) : 536 - 550
  • [8] Exploration of Approximate Multipliers Design Space using Carry Propagation Free Compressors
    Boroumand, Sina
    Afshar, Hadi P.
    Brisk, Philip
    Mohammadi, Siamak
    2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 611 - 616
  • [9] Energy-Quality Scalable Memory-Frugal Feature Extraction for Always-On Deep Sub-mW Distributed Vision
    Alvarez, Anastacia
    Ponnusamy, Gopalakrishnan
    Alioto, Massimo
    IEEE ACCESS, 2020, 8 : 18951 - 18961
  • [10] Energy-Quality Scalable Analog-to-Digital Conversion and Machine Learning Engine in a 51.9 nJ/frame Voice Activity Detector
    Teo, Jinq Horng
    Cheng, Shuai
    Alioto, Massimo
    2019 26TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2019, : 174 - 177