GFCC-Based Robust Gender Detection

被引：0

作者：

Islam, M. A. ^{[1
]}

机构：

[1] Int Islamic Univ Chittagong, Elect & Elect Engn, Chittagong, Bangladesh

来源：

2016 INTERNATIONAL CONFERENCE ON INNOVATIONS IN SCIENCE, ENGINEERING AND TECHNOLOGY (ICISET 2016) | 2016年

关键词：

Gender classification; GFCC; GMM; Modelling; Robustness;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Gender classification technique is a part of the signal processing comprises with feature extraction and behavioural gender modelling. Fundamental frequency and pitch are mostly used as feature for gender detection due to their unique characteristics in voice source. In this study, Gammatone Frequency Cepstral Coefficient (GFCC)-based robust gender classification method has been presented. This study was accomplished using speech samples from a text-dependent data set. The prototype gender behavioural modelling was done using Gaussian mixture model (GMM) to obtain better performance and only clean signal was used to train the model. The performance of the proposed method was tested under both clean and contaminated conditions. The clean signal was contaminated using nine different noises at a range of signal-to-noise ratios (SNRs) from 0 dB to 10 dB. The obtained performance showed the proposed method was very robust against noise and the average performance at 0 dB SNR was almost 100% for female and 92% for male irrespective to noises. So, it could be said the proposed method performance was almost noise invariant.

引用

页数：4

共 13 条

[1] A tutorial on Support Vector Machines for pattern recognition
Burges, CJC
[J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
[2] GENDER RECOGNITION FROM SPEECH .2. FINE ANALYSIS
CHILDERS, DG
WU, K
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (04) : 1841 - 1856
[3] Neural-Response-Based Text-Dependent Speaker Identification Under Noisy Conditions
Islam, M. A.
Zilany, M. S. A.
Wissam, A. J.
[J]. INTERNATIONAL CONFERENCE FOR INNOVATION IN BIOMEDICAL ENGINEERING AND LIFE SCIENCES, ICIBEL2015, 2016, 56 : 11 - 14
[4] A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery
Islam, Md. Atiqul
Jassim, Wissam A.
Cheok, Ng Siew
Zilany, Muhammad Shamsul Arefeen
[J]. PLOS ONE, 2016, 11 (07):
[5] Li Q. P., 2009, APPL SIGN PROC AUD A, P181
[6] Mamun Nursadul, FUNCTIONAL ELECT STI
[7] Meenakshi G. Nisha, 2015, 2015 21 NAT C, P1
[8] Shao Y., 2009, IEEE INT C AC SPEECH
[9] Shao Yang, 2007, IEEE INT C AC SPEECH, V4
[10] Slomka S., 1997, TENCON97

← 1 2 →