Robust speaker recognition based on biologically inspired features

被引：0

作者：

Zouhir, Youssef ^{[1
]}

Ben Fredj, Ines ^{[1
]}

Ouni, Kais ^{[1
]}

Zarka, Mohamed ^{[2
]}

机构：

[1] Univ Carthage, Natl Engn Sch Carthage, SE&ICT Lab, Res Lab Smart Elect & ICT,LR18ES44, Tunis 2035, Tunisia

[2] King Khalid Univ, Coll Sci & Arts Tanumah, Comp Sci Dept, Abha 61421, Saudi Arabia

来源：

INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING | 2020年 / 12卷 / 1-2期

关键词：

auditory filter model; biologically inspired features; NGCC; normalised gammachirp cepstral coefficients; PLPnGc; perceptual linear predictive normalised gammachirp; GMM-UBM; Gaussian mixture model-universal background model; robust speaker recognition; AUDITORY FILTER; DOMAIN;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper proposes two speech parameterisation techniques for noise-robust speaker recognition: the normalised gammachirp cepstral coefficients (NGCC) and the perceptual linear predictive normalised gammachirp (PLPnGc). These techniques employ a biologically inspired auditory model that simulates the cochlea spectral behaviour. In an automatic speaker recognition (ASR) system, we consider the Gaussian mixture model-universal background model (GMM-UBM) for speaker modelling. The performances are evaluated in clean and noisy environments using Timit, Aurora, and Demand databases. The experimental results in noisy environments showed that the biologically inspired feature extraction techniques give a better recognition rate than state-of-the-art methods.

引用

页码：19 / 27

页数：9

共 41 条

[1]

[Anonymous], 2015, INT J ADV RES COMPUT

[2]

[Anonymous], 1997, Proceeedings of Eurospeech

[3]

Ben Fredj Ines, 2017, 2017 International Conference on Advanced Systems and Electric Technologies (IC_ASET). Proceedings, P118, DOI 10.1109/ASET.2017.7983676

[4]

Ben Fredj I., 2014, INT J CONTROL ENERGY, V1, P57

[5]

Ben Fredj I, 2018, INT J SIGNAL IMAGING, V11, P65

[6] A tutorial on text-independent speaker verification [J].

Bimbot, F ;

Bonastre, JF ;

Fredouille, C ;

Gravier, G ;

Magrin-Chagnolleau, I ;

Meignier, S ;

Merlin, T ;

Ortega-García, J ;

Petrovska-Delacrétaz, D ;

Reynolds, DA .

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :430-451

[7]

Bourlard H, 1995, 4 EUR C SPEECH COMM, P1663

[8]

Chien J.T., 2016, INTERSPEECH

[9] Environmental Sound Recognition With Time-Frequency Audio Features [J].

Chu, Selina ;

Narayanan, Shrikanth ;

Kuo, C. -C. Jay .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06) :1142-1158

[10] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].

DAVIS, SB ;

MERMELSTEIN, P .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366

← 1 2 3 4 5 →