DGHNL: A new deep genetic hierarchical network of learners for prediction of credit scoring

被引:105
作者
Plawiak, Pawel [1 ,2 ]
Abdar, Moloud [3 ]
Plawiak, Joanna [2 ,4 ]
Makarenkov, Vladimir [3 ]
Acharya, U. Rajendra [5 ,6 ,7 ]
机构
[1] Polish Acad Sci, Inst Theoret & Appl Informat, Baltycka 5, PL-44100 Gliwice, Poland
[2] Cracow Univ Technol, Fac Comp Sci & Telecommun, Dept Informat & Commun Technol, Warsaw 24 St,F-3, PL-31155 Krakow, Poland
[3] Univ Quebec Montreal, Dept Comp Sci, Montreal, PQ H2X 3Y7, Canada
[4] AGH Univ Sci & Technol, Dept Biocybernet & Biomed Engn, Fac Elect Engn Automat Comp Sci & Biomed Engn, PL-30059 Krakow, Poland
[5] Ngee Ann Polytech, Dept Elect & Comp Engn, Clementi 599491, Singapore
[6] SUSS Univ, Sch Sci & Technol, Dept Biomed Engn, Clementi 599491, Singapore
[7] Kumamoto Univ, IROAST, Kumamoto, Japan
关键词
Credit scoring; Machine learning; Data mining; Ensemble learning; Deep learning; Genetic algorithm; Feature extraction and selection; CLASSIFICATION;
D O I
10.1016/j.ins.2019.12.045
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Credit scoring (CS) is an effective and crucial approach used for risk management in banks and other financial institutions. It provides appropriate guidance on granting loans and reduces risks in the financial area. Hence, companies and banks are trying to use novel automated solutions to deal with CS challenge to protect their own finances and customers. Nowadays, different machine learning (ML) and data mining (DM) algorithms have been used to improve various aspects of CS prediction. In this paper, we introduce a novel methodology, named Deep Genetic Hierarchical Network of Learners (DGHNL). The proposed methodology comprises different types of learners, including Support Vector Machines (SVM), k-Nearest Neighbors (kNN), Probabilistic Neural Networks (PNN), and fuzzy systems. The Statlog German (1000 instances) credit approval dataset available in the UCI machine learning repository is used to test the effectiveness of our model in the CS domain. Our DGHNL model encompasses five kinds of learners, two kinds of data normalization procedures, two extraction of features methods, three kinds of kernel functions, and three kinds of parameter optimizations. Furthermore, the model applies deep learning, ensemble learning, supervised training, layered learning, genetic selection of features (attributes), genetic optimization of learners parameters, and novel genetic layered training (selection of learners) approaches used along with the cross-validation (CV) train-ingtesting method (stratified 10-fold). The novelty of our approach relies on a proper flow and fusion of information (DGHNL structure and its optimization). We show that the proposed DGHNL model with a 29-layer structure is capable to achieve the prediction accuracy of 94.60% (54 errors per 1000 classifications) for the Statlog German credit approval data. It is the best prediction performance for this well-known credit scoring dataset, compared to the existing work in the field. (C) 2019 The Authors. Published by Elsevier Inc.
引用
收藏
页码:401 / 418
页数:18
相关论文
共 19 条
[1]   A comparative study on base classifiers in ensemble methods for credit scoring [J].
Abelian, Joaquin ;
Castellano, Javier G. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 :1-10
[2]   A new hybrid ensemble credit scoring model based on classifiers consensus system approach [J].
Ala'raj, Maher ;
Abbod, Maysam F. .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 64 :36-55
[3]  
[Anonymous], 2010, UCI Machine Learning Repository
[4]  
[Anonymous], P 4 INT FOR DEC SCI
[5]   Improving credit scoring by differentiating defaulter behaviour [J].
Bravo, Cristian ;
Thomas, Lyn C. ;
Weber, Richard .
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2015, 66 (05) :771-781
[6]   Reject inference in consumer credit scoring with nonignorable missing data [J].
Buecker, Michael ;
van Kampen, Maarten ;
Kraemer, Walter .
JOURNAL OF BANKING & FINANCE, 2013, 37 (03) :1040-1045
[7]   An artificial immune classifier for credit scoring analysis [J].
Chang, Shiow-Yun ;
Yeh, Tsung-Yuan .
APPLIED SOFT COMPUTING, 2012, 12 (02) :611-618
[8]   Principal Component Analysis and ReliefF Cascaded with Decision Tree for Credit Scoring [J].
Damrongsakmethee, Thitimanan ;
Neagoe, Victor-Emil .
ARTIFICIAL INTELLIGENCE METHODS IN INTELLIGENT ALGORITHMS, 2019, 985 :85-95
[9]  
Durand D., 1941, Risk Elements in Consumer Instalment Financing
[10]  
Feelders A. J., 2000, International Journal of Intelligent Systems in Accounting, Finance and Management, V9, P1, DOI 10.1002/(SICI)1099-1174(200003)9:1<1::AID-ISAF177>3.0.CO