A GMM-Based Speaker Identification System on FPGA

被引：0

作者：

Kan, Phak Len Eh ^{[1
]}

Allen, Tim ^{[1
]}

Quigley, Steven F. ^{[1
]}

机构：

[1] Univ Birmingham, Sch Elect Elect & Comp Engn, Birmingham B15 2TT, W Midlands, England

来源：

RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS | 2010年 / 5992卷

关键词：

Speaker Identification; MFCC; GMM; Field Programmable Gate Array (FPGA);

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Speaker identification is the process of identifying persons from their voice. Speaker-specific characteristics exist in speech signals due to different speakers having different resonances of the vocal tract and these can be exploited by extracting feature vectors such as Mel frequency cepstral coefficients (MFCCs) from the speech signal. The Gaussian Mixture Model (GMM) as a well-known statistical model then models the distribution of each speaker's MFCCs in a multidimensional acoustic space. The GMM-based speaker identification system has features that make it promising for hardware acceleration. This paper describes the classification hardware implementation of a text-independent GMM-based speaker identification system. A speed factor of 90 was achieved compared to software-based implementation on a standard PC.

引用

页码：358 / 363

页数：6

共 9 条

[1]

[Anonymous], SPEECH SYNTHESIS REC

[2]

LIN EC, 2007, INT S FIELD PROGR GA, P60

[3]

MELNIKOFF S, 2003, ELECT LETT

[4]

Melnikoff S. J., 2002, INT C FIELD PROGR LO, P202

[5] Implementing a simple continuous speech recognition system on an FPGA [J].

Melnikoff, SJ ;

Quigley, SF ;

Russell, MJ .

10TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2002, :275-276

[6]

MIURA K, 2008, INT C FIELD PROGR TE

[7]

RAMOSLARA R, 2009, FIELD PROGRAMMABLE L, P202

[8] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS [J].

REYNOLDS, DA ;

ROSE, RC .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01) :72-83

[9] Scalable architecture for word HMM-based speech recognition and VLSI implementation in complete system [J].

Yoshizawa, S ;

Wada, N ;

Hayasaka, N ;

Miyanaga, Y .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2006, 53 (01) :70-77

← 1 →