Efficient algorithm and architecture of critical-band transform for low-power speech applications

被引:0
作者
Wang, Chao [1 ]
Gan, Woon-Seng
机构
[1] Nanyang Technol Univ, Ctr Signal Proc, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Digital Signal Proc Lab, Singapore 639798, Singapore
关键词
Bark; Supply Voltage; Parallel Processing; Efficient Algorithm; Power Dissipation;
D O I
10.1155/2007/89264
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An efficient algorithm and its corresponding VLSI architecture for the critical-band transform(CBT) are developed to approximate the critical-band filtering of the human ear. The CBT consists of a constant-bandwidth transform in the lower frequency range and a Brown constant-Q transform (CQT) in the higher frequency range. The corresponding VLSI architecture is proposed to achieve significant power efficiency by reducing the computational complexity, using pipeline and parallel processing, and applying the supply voltage scaling technique. A 21-band Bark scale CBT processor with a sampling rate of 16 kHz is designed and simulated. Simulation results verify its suitability for performing short-time spectral analysis on speech. It has a better fitting on the human ear critical-band analysis, significantly fewer computations, and therefore is more energy-efficient than other methods. With a 0.35 mu m CMOS technology, it calculates a 160-point speech in 4.99 milliseconds at 234 kHz. The power dissipation is 15.6 mu W at 1.1 V. It achieves 82.1% power reduction as compared to a benchmark 256-point FFT processor. Copyright (c) 2007 C. Wang and W.- S. Gan.
引用
收藏
页数:10
相关论文
共 23 条
[11]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[12]   Mel filter-like admissible wavelet packet structure for speech recognition [J].
Farooq, O ;
Datta, S .
IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (07) :196-198
[13]   Auditory patterns [J].
Fletcher, H .
REVIEWS OF MODERN PHYSICS, 1940, 12 (01) :0047-0065
[14]   AN AUDITORY SPECTRAL-ANALYSIS MODEL USING THE CHIRP Z-TRANSFORM [J].
KATES, JM .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (01) :148-156
[15]  
Liu ZY, 2005, IEICE T FUND ELECTR, VE88A, P3523, DOI [10.1093/ietfec/e88-a.12.3523, 10.1039/ietfec/e88-a.12.3523]
[16]   DIGITAL AUDIO CODING FOR VISUAL COMMUNICATIONS [J].
NOLL, P .
PROCEEDINGS OF THE IEEE, 1995, 83 (06) :925-943
[17]   CRITICAL BAND ANALYSIS SYNTHESIS [J].
PETERSEN, TL ;
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (03) :656-663
[18]   SIGNAL MODELING TECHNIQUES IN SPEECH RECOGNITION [J].
PICONE, JW .
PROCEEDINGS OF THE IEEE, 1993, 81 (09) :1215-1247
[19]  
Rabiner L., 1993, Fundamentals of Speech Recognition
[20]  
RUETZ PA, 1990, P INT C PATT REC JUN, V2, P385