GMM-based speaker age and gender classification in Czech and Slovak

被引:14
作者
Pribil, Jiri [1 ,2 ]
Pribilova, Anna [3 ]
Matousek, Jindrich [4 ]
机构
[1] Slovak Acad Sci, Inst Measurement Sci, Bratislava, Slovakia
[2] Univ West Bohemia, Fac Sci Appl, NTIS, Plze, Czech Republic
[3] Slovak Univ Technol Bratislava, Fac Elect Engn & Informat Technol, Ilkovicova 3, Bratislava 81219, Slovakia
[4] Univ West Bohemia, Fac Sci Appl, Dept Cybernet, NTIS, Plzen, Czech Republic
来源
JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS | 2017年 / 68卷 / 01期
关键词
GMM classifier; spectral and prosodic features of speech; speaker gender and age classification; VOICE; RECOGNITION;
D O I
10.1515/jee-2017-0001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The paper describes an experiment with using the Gaussian mixture models (GMM) for automatic classification of the speaker age and gender. It analyses and compares the influence of different number of mixtures and different types of speech features used for GMM gender/age classification. Dependence of the computational complexity on the number of used mixtures is also analysed. Finally, the GMM classification accuracy is compared with the output of the conventional listening tests. The results of these objective and subjective evaluations are in correspondence.
引用
收藏
页码:3 / 12
页数:10
相关论文
共 25 条
[1]  
[Anonymous], NETLAB PATTERN ANAL
[2]   Speaker age estimation using i-vectors [J].
Bahari, Mohamad Hasan ;
McLaren, Mitchell ;
Hugo Van Hamme ;
van Leeuwen, David A. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 34 :99-108
[3]   The aged voice:: A new hypothesis (Reprinted from Voice, vol 3, pg 57-73, 1994) [J].
Baken, RJ .
JOURNAL OF VOICE, 2005, 19 (03) :317-325
[4]   A new pitch-range based feature set for a speaker's age and gender classification [J].
Barkana, Buket D. ;
Zhou, Jingcheng .
APPLIED ACOUSTICS, 2015, 98 :52-61
[5]   Age and gender recognition for telephone applications based on GMM supervectors and support vector machines [J].
Bocklet, Tobias ;
Maier, Andreas ;
Bauer, Josef G. ;
Burkhardt, Felix ;
Noeth, Elmar .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :1605-+
[6]  
Boersma P., Praat: Doing Phonetics by Computer
[7]   TEXT-INDEPENDENT SPEAKER RECOGNITION USING TWO-DIMENSIONAL INFORMATION ENTROPY [J].
Bozilovic, Bosko ;
Todorovic, Branislav M. ;
Obradovic, Miroslav .
JOURNAL OF ELECTRICAL ENGINEERING-ELEKTROTECHNICKY CASOPIS, 2015, 66 (03) :169-173
[8]   Supervector Dimension Reduction for Efficient Speaker Age Estimation Based on the Acoustic Speech Signal [J].
Dobry, Gil ;
Hecht, Ron M. ;
Avigal, Mireille ;
Zigel, Yaniv .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07) :1975-1985
[9]   Selective Review and Analysis of Aging Effects in Biometric System Implementation [J].
Fairhurst, Michael ;
Erbilek, Meryem ;
Da Costa-Abreu, Marjory .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (03) :294-303
[10]  
Fedorova A, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P3036