A Hybrid Ensemble Stacking Model for Gender Voice Recognition Approach

被引:16
作者
Alkhammash, Eman H. [1 ]
Hadjouni, Myriam [2 ]
Elshewey, Ahmed M. [3 ]
机构
[1] Taif Univ, Coll Comp & Informat Technol, Dept Comp Sci, POB 11099, Taif 21944, Saudi Arabia
[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Comp Sci, POB 84428, Riyadh 11671, Saudi Arabia
[3] Suez Univ, Fac Comp & Informat, Comp Sci Dept, Suez, Egypt
关键词
machine learning; stacking model; ensemble learning; k-nearest neighbor; stochastic gradient descent; support vector machine; logistic regression; linear discriminant analysis; REGRESSION; CLASSIFIER;
D O I
10.3390/electronics11111750
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gender recognition by voice is a vital research subject in speech processing and acoustics, as human voices have many remarkable characteristics. Voice recognition is beneficial in a variety of applications, including mobile health care systems, interactive systems, crime analysis, and recognition systems. Several algorithms for voice recognition have been developed, but there is still potential for development in terms of the system's accuracy and efficiency. Recent research has focused on combining ensemble learning with a variety of machine learning models in order to create more accurate classifiers. In this paper, a stacked ensemble for gender voice recognition model is presented, using four classifiers, namely, k-nearest neighbor (KNN), support vector machine (SVM), stochastic gradient descent (SGD), and logistic regression (LR) as base classifiers and linear discriminant analysis (LDA) as meta classifier. The dataset used includes 3168 instances and 21 features, where 20 features are the predictors, and one feature is the target. Several prediction evaluation metrics, including precision, accuracy, recall, F1 score, and area under the receiver operating characteristic curve (AUC), were computed to verify the execution of the proposed model. The results obtained illustrated that the stacked model achieved better results compared to other conventional machine learning models. The stacked model achieved high accuracy with 99.64%.
引用
收藏
页数:13
相关论文
共 57 条
[1]  
Anguita D., 2012, ESANN, P441
[2]  
[Anonymous], 2013, Robust Data Min.
[3]  
[Anonymous], GENDER RECOGNITION V
[4]  
[Anonymous], 2017, International Journal on Advances in Software, V10, P1
[5]   A random forest classifier for lymph diseases [J].
Azar, Ahmad Taher ;
Elshazly, Hanaa Ismail ;
Hassanien, Aboul Ella ;
Elkorany, Abeer Mohamed .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2014, 113 (02) :465-473
[6]  
Bisio I, 2015, IEEE ICC, P7030, DOI 10.1109/ICC.2015.7249447
[7]  
Bottou Leon, 2012, Neural Networks: Tricks of the Trade. Second Edition: LNCS 7700, P421, DOI 10.1007/978-3-642-35289-8_25
[8]  
Buyukyilmaz M, 2016, ACSR ADV COMPUT, V58, P409
[9]   Data Cleaning: Overview and Emerging Challenges [J].
Chu, Xu ;
Ilyas, Ihab F. ;
Krishnan, Sanjay ;
Wang, Jiannan .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :2201-2206
[10]  
Clarke B, 2009, SPRINGER SER STAT, P1, DOI 10.1007/978-0-387-98135-2