One-Dimensional Convolutional Neural Networks with Feature Selection for Highly Concise Rule Extraction from Credit Scoring Datasets with Heterogeneous Attributes

被引:12
作者
Hayashi, Yoichi [1 ]
Takano, Naoki [1 ]
机构
[1] Meiji Univ, Dept Comp Sci, Kawasaki, Kanagawa 2148571, Japan
基金
日本学术振兴会;
关键词
convolutional neural networks; transparency; rule extraction; conciseness; heterogeneous attribute; dimension reduction; feature selection; credit scoring; risk assessment; ROUGH SET; CLASSIFIERS; INTERPRETABILITY; ACCURACY;
D O I
10.3390/electronics9081318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Convolution neural networks (CNNs) have proven effectiveness, but they are not applicable to all datasets, such as those with heterogeneous attributes, which are often used in the finance and banking industries. Such datasets are difficult to classify, and to date, existing high-accuracy classifiers and rule-extraction methods have not been able to achieve sufficiently high classification accuracies or concise classification rules. This study aims to provide a new approach for achieving transparency and conciseness in credit scoring datasets with heterogeneous attributes by using a one-dimensional (1D) fully-connected layer first CNN combined with the Recursive-Rule Extraction (Re-RX) algorithm with a J48graft decision tree (hereafter 1D FCLF-CNN). Based on a comparison between the proposed 1D FCLF-CNN and existing rule extraction methods, our architecture enabled the extraction of the most concise rules (6.2) and achieved the best accuracy (73.10%), i.e., the highest interpretability-priority rule extraction. These results suggest that the 1D FCLF-CNN with Re-RX with J48graft is very effective for extracting highly concise rules for heterogeneous credit scoring datasets. Although it does not completely overcome the accuracy-interpretability dilemma for deep learning, it does appear to resolve this issue for credit scoring datasets with heterogeneous attributes, and thus, could lead to a new era in the financial industry.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 55 条
[22]   A data driven ensemble classifier for credit scoring analysis [J].
Hsieh, Nan-Chen ;
Hung, Lun-Ping .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (01) :534-545
[23]   The human-like intelligence with bio-inspired computing approach for credit ratings prediction [J].
Hsu, Feng-Jui ;
Chen, Mu-Yen ;
Chen, Yu-Cheng .
NEUROCOMPUTING, 2018, 279 :11-18
[24]   Minerva: Sequential covering for rule extraction [J].
Huysmans, Johan ;
Setiono, Rudy ;
Baesens, Bart ;
Vanthienen, Jan .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (02) :299-309
[25]   Information gain directed genetic algorithm wrapper feature selection for credit rating [J].
Jadhav, Swati ;
He, Hongmei ;
Jenkins, Karl .
APPLIED SOFT COMPUTING, 2018, 69 :541-553
[26]  
Santana PJ, 2017, J COMPUT SCI TECHNOL, V17, P20
[27]   Exploiting deep convolutional neural networks for a neural-based learning classifier system [J].
Kim, Ji-Yoon ;
Cho, Sung-Bae .
NEUROCOMPUTING, 2019, 354 :61-70
[28]   Self-organizing maps of symbol strings [J].
Kohonen, T ;
Somervuo, P .
NEUROCOMPUTING, 1998, 21 (1-3) :19-30
[29]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[30]   Credit score classification using spiking extreme learning machine [J].
Kuppili, Venkatanareshbabu ;
Tripathi, Diwakar ;
Edla, Damodar Reddy .
COMPUTATIONAL INTELLIGENCE, 2020, 36 (02) :402-426