GFCC-Based Robust Gender Detection

被引:0
作者
Islam, M. A. [1 ]
机构
[1] Int Islamic Univ Chittagong, Elect & Elect Engn, Chittagong, Bangladesh
来源
2016 INTERNATIONAL CONFERENCE ON INNOVATIONS IN SCIENCE, ENGINEERING AND TECHNOLOGY (ICISET 2016) | 2016年
关键词
Gender classification; GFCC; GMM; Modelling; Robustness;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Gender classification technique is a part of the signal processing comprises with feature extraction and behavioural gender modelling. Fundamental frequency and pitch are mostly used as feature for gender detection due to their unique characteristics in voice source. In this study, Gammatone Frequency Cepstral Coefficient (GFCC)-based robust gender classification method has been presented. This study was accomplished using speech samples from a text-dependent data set. The prototype gender behavioural modelling was done using Gaussian mixture model (GMM) to obtain better performance and only clean signal was used to train the model. The performance of the proposed method was tested under both clean and contaminated conditions. The clean signal was contaminated using nine different noises at a range of signal-to-noise ratios (SNRs) from 0 dB to 10 dB. The obtained performance showed the proposed method was very robust against noise and the average performance at 0 dB SNR was almost 100% for female and 92% for male irrespective to noises. So, it could be said the proposed method performance was almost noise invariant.
引用
收藏
页数:4
相关论文
共 13 条
  • [1] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
  • [2] GENDER RECOGNITION FROM SPEECH .2. FINE ANALYSIS
    CHILDERS, DG
    WU, K
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (04) : 1841 - 1856
  • [3] Neural-Response-Based Text-Dependent Speaker Identification Under Noisy Conditions
    Islam, M. A.
    Zilany, M. S. A.
    Wissam, A. J.
    [J]. INTERNATIONAL CONFERENCE FOR INNOVATION IN BIOMEDICAL ENGINEERING AND LIFE SCIENCES, ICIBEL2015, 2016, 56 : 11 - 14
  • [4] A Robust Speaker Identification System Using the Responses from a Model of the Auditory Periphery
    Islam, Md. Atiqul
    Jassim, Wissam A.
    Cheok, Ng Siew
    Zilany, Muhammad Shamsul Arefeen
    [J]. PLOS ONE, 2016, 11 (07):
  • [5] Li Q. P., 2009, APPL SIGN PROC AUD A, P181
  • [6] Mamun Nursadul, FUNCTIONAL ELECT STI
  • [7] Meenakshi G. Nisha, 2015, 2015 21 NAT C, P1
  • [8] Shao Y., 2009, IEEE INT C AC SPEECH
  • [9] Shao Yang, 2007, IEEE INT C AC SPEECH, V4
  • [10] Slomka S., 1997, TENCON97