VLSI Design for SVM-Based Speaker Verification System

被引:19
作者
Wang, Jia-Ching [1 ]
Lian, Li-Xun [1 ]
Lin, Yan-Yu [1 ]
Zhao, Jia-Hao [1 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taoyuan 32001, Taiwan
关键词
Cepstral coefficient; speaker verification; support vector machine (SVM); VLSI; SUPPORT VECTOR MACHINES; SPEECH RECOGNITION; CHIP DESIGN; IDENTIFICATION; ALGORITHM;
D O I
10.1109/TVLSI.2014.2335112
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This brief presents the chip implementation of a support vector machine (SVM)-based speaker verification system. The proposed chip comprises a speaker feature extraction (SFE) module, an SVM module, and a decision module. The SFE module performs autocorrelation analysis, linear predictive coefficient (LPC) extraction, and LPC-to-cepstrum conversion. The SVM module includes a Gaussian kernel unit and a scaling unit. The purpose of the Gaussian kernel unit is first to evaluate the kernel value of a test vector and a support vector. Four Gaussian kernel processing elements (GK-PEs) are designed to process four support vectors simultaneously. Each GK-PE is designed in the pipeline fashion and is capable of performing 2-norm and exponential operations. An enhanced CORDIC architecture is proposed to calculate the exponential value. As well as the Gaussian kernel unit, a scaling unit is also developed for use in the SVM module. The scaling unit is used to perform scaling multiplications and the remaining operations of SVM decision value evaluation. Finally, the decision module accumulates the frame scores that are generated by all of the test frames, and then compare it with a threshold to see if the test utterance is spoken by the claimed speaker. This designed chip is characterized by its high speed and its ability to handle a large number of support vectors in the SVM. The prototype chip is a semicustom chip that is fabricated using Taiwan Semiconductor Manufacturing Company 0.90-nm CMOS technology on a die with a size of similar to 7.9 x 7.9 mm(2)
引用
收藏
页码:1355 / 1359
页数:5
相关论文
共 29 条
[1]  
[Anonymous], THESIS SWISS FEDERAL
[2]  
[Anonymous], P WORKSH ADV BIOM IN
[3]  
[Anonymous], 1953, Methods of mathematical physics
[4]  
[Anonymous], IEICE T FUNDAM ELE A
[5]  
[Anonymous], P IEEE 16 INT C MICR
[6]  
[Anonymous], P INT C AC SPEECH SI
[7]   Support vector machines for speaker and language recognition [J].
Campbell, WM ;
Campbell, JP ;
Reynolds, DA ;
Singer, E ;
Torres-Carrasquillo, PA .
COMPUTER SPEECH AND LANGUAGE, 2006, 20 (2-3) :210-229
[8]   Support vector machines using GMM supervectors for speaker verification [J].
Campbell, WM ;
Sturim, DE ;
Reynolds, DA .
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311
[9]   The past, present, and future of speech processing [J].
Childers, D ;
Cox, RV ;
DeMori, R ;
Furui, S ;
Juang, BH ;
Mariani, JJ ;
Price, P ;
Sagayama, S ;
Sondhi, MM ;
Weischedel, R .
IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (03) :24-48
[10]   Speaker identification for security systems using reinforcement-trained pRAM neural network architectures [J].
Clarkson, TG ;
Christodoulou, CC ;
Guan, YL ;
Gorse, D ;
Romano-Critchley, DA ;
Taylor, JG .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2001, 31 (01) :65-76