A new pitch-range based feature set for a speaker's age and gender classification

被引:37
作者
Barkana, Buket D. [1 ]
Zhou, Jingcheng [1 ]
机构
[1] Univ Bridgeport, Dept Elect Engn, Bridgeport, CT 06604 USA
关键词
Age and gender classification; Pitch range; Fundamental frequency; MFCCs; SPEAKING FUNDAMENTAL-FREQUENCY; RECOGNITION; VOICE; SPEECH; TIME;
D O I
10.1016/j.apacoust.2015.04.013
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a pitch-range (PR) based feature set for age and gender classification. The performance of the proposed feature set is compared With MFCCs, energy, relative spectral transform-perceptual linear prediction (RASTA_PLP), and fundamental frequency (F0). Voice activity detection (VAD) is performed to extract speech utterances before feature extraction. Two different classifiers, k-Nearest Neighbors (kNN) and Support Vector Machines (SVM) are used in order to evaluate the effectiveness of the feature sets. Experimental results are reported for the aGender database. Both kNN and SVM classifiers achieved the highest accuracy rates by the proposed PR feature set in age + gender and age classifications. PR features represent the pitch changes over time. In age + gender classification, the class of middle-aged female speaker is recognized with an accuracy of 92.86%, followed by senior female speakers with 83.61%, children with 83.02%, middle-aged male speakers with 73.58%, young female speakers with 67.35%, and senior male speakers with 34.33% by using 3PR features with the SVM classifier. Low classification accuracies are observed for young male speakers. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:52 / 61
页数:10
相关论文
共 46 条
  • [1] [Anonymous], 2009, INTERSPEECH
  • [2] [Anonymous], P INT
  • [3] [Anonymous], SPOKEN LANGUAGE PROC
  • [4] [Anonymous], P 8 EUR C SPEECH COM
  • [5] [Anonymous], P 7 INT C LANG RES E
  • [6] [Anonymous], FUNDAMENTALS SPEECH
  • [7] [Anonymous], P INT
  • [8] [Anonymous], IEEE WORKSH MACH LEA
  • [9] [Anonymous], SPEECH COMMUN
  • [10] [Anonymous], P INT