Automatic Gender Recognition using Linear Prediction Coefficients and Artificial Neural Network on Speech Signal

被引:0
作者
Yusnita, M. A. [1 ]
Hafiz, A. M. [1 ]
Fadzilah, Nor M. [1 ]
Zulhanip, Aida Zulia [1 ]
Idris, Mohaiyedin [1 ]
机构
[1] Univ Teknol MARA, Fac Elect Engn, Permatang Pauh 13500, Penang, Malaysia
来源
2017 7TH IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE) | 2017年
关键词
Automatic gender recognition; linear prediction coefficients; artificial neural network; multi-layer perceptron; speech recognition system; CLASSIFICATION; EMOTION; AGE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Gender Recognition (AGR) system is an intelligent machine inspired by the highly advanced skills of human cognitive and developed through adequate training to recognize the gender of a speaker as male or female. In this paper, speech from 93 speakers were extracted using Linear Prediction Coefficients (LPC). Pre-processing steps such as normalization, pre-emphasis, frame blocking and windowing were carried out prior to feature extraction. The LPC coefficients of different order was investigated to produce the optimum parameters for the developed AGR. Artificial Neural Network (ANN) was used as the recognition engine and Multi-Layer Perceptron (MLP) was adopted to train the ANN. The experimental results show the highest overall recognition rate that can be achieved by the proposed system was 93.3% in average and the results also indicate almost equal performance for the detection of male and female throughout the experiments.
引用
收藏
页码:372 / 377
页数:6
相关论文
共 24 条
[1]  
[Anonymous], 2009, ICIS 2009 P
[2]  
[Anonymous], AUDIO SIGNAL PROCESS
[3]   COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].
DAVIS, SB ;
MERMELSTEIN, P .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366
[4]  
Firoz Shah A., 2009, Proceedings of the 2009 International Conference on Advances in Computing, Control, & Telecommunication Technologies (ACT 2009), P162, DOI 10.1109/ACT.2009.49
[5]  
Furui S., 2001, DIGITAL SPEECH PROCE, V7
[6]  
GUPTA M, 2016, 4 INT C PAR DISTR GR, P737
[7]   Low Depressive Symptom and Mental Distress Scores in Adult Long-term Survivors of Childhood Acute Lymphoblastic Leukemia [J].
Harila, Marika J. ;
Niinivirta, Tomi I. T. ;
Winqvist, Satu ;
Harila-Saari, Arja H. .
JOURNAL OF PEDIATRIC HEMATOLOGY ONCOLOGY, 2011, 33 (03) :194-198
[8]   Emotion, age, and gender classification in children's speech by humans and machines [J].
Kaya, Heysem ;
Salah, Albert Ali ;
Karpovc, Alexey ;
Frolova, Olga ;
Grigorev, Aleksey ;
Lyakso, Elena .
COMPUTER SPEECH AND LANGUAGE, 2017, 46 :268-283
[9]  
Keyvanrad Mohammad Ali, 2009, 2009 14th International CSI Computer Conference (CSICC 2009) (Postponed from July 2009), P613, DOI 10.1109/CSICC.2009.5349647
[10]   Use of Multiple Classifier System for Gender Driven Speech Emotion Recognition [J].
Ladde, Pravina P. ;
Deshmukh, Vaishali S. .
2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (CICN), 2015, :713-717