2-stage modified random forest model for credit risk assessment of P2P network lending to "Three Rurals" borrowers

被引:64
作者
Rao, Congjun [1 ]
Liu, Ming [1 ]
Goh, Mark [2 ,3 ]
Wen, Jianghui [1 ]
机构
[1] Wuhan Univ Technol, Sch Sci, Wuhan 430070, Peoples R China
[2] Natl Univ Singapore, NUS Business Sch, Singapore 119623, Singapore
[3] Natl Univ Singapore, Logist Inst Asia Pacific, Singapore 119623, Singapore
基金
中国国家自然科学基金;
关键词
P2P network lending; Three Rurals" borrowers; Credit risk evaluation; Random forest; GENETIC ALGORITHM; LOAN EVALUATION; DECISION TREE; COST; CLASSIFICATION; INFORMATION; PERFORMANCE; SELECTION; ENSEMBLE;
D O I
10.1016/j.asoc.2020.106570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid growth of the P2P online loan industry in the "Three Rurals" (agriculture, rural areas, and farmers) sector, it is imperative to manage the borrowing risk of borrowers in the rural areas. A credit risk assessment model is proposed to classify the credit worthiness of the "Three Rurals" borrowers. We select the loan data of the Pterosaur Loan platform as the research sample, and establish a 2-stage Syncretic Cost-sensitive Random Forest (SCSRF) model to evaluate the credit risk of the borrowers. From the random forest, we construct a cost relationship from the actual distribution of the data categories, introduce a weighted Mahalanobis distance using the entropy weight method in the cost function, and adopt a weighted voting for the cost-sensitive decision tree base classifier. The parameters of the SCSRF model are optimized via a grid search. We validate the SCSRF classification model against several established credit evaluation models. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 39 条
[1]   Novel Cost-Sensitive Approach to Improve the Multilayer Perceptron Performance on Imbalanced Data [J].
Castro, Cristiano L. ;
Braga, Antonio P. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (06) :888-899
[2]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[3]   Structured random forest for label distribution learning [J].
Chen, Mengting ;
Wang, Xinggang ;
Feng, Bin ;
Liu, Wenyu .
NEUROCOMPUTING, 2018, 320 :171-182
[4]   Description-text related soft information in peer-to-peer lending - Evidence from two leading European platforms [J].
Dorfleitner, Gregor ;
Priberny, Christopher ;
Schuster, Stephanie ;
Stoiber, Johannes ;
Weber, Martina ;
de Castro, Ivan ;
Kammler, Julia .
JOURNAL OF BANKING & FINANCE, 2016, 64 :169-187
[5]   Speed up grid-search for parameter selection of support vector machines [J].
Fayed, Hatem A. ;
Atiya, Amir F. .
APPLIED SOFT COMPUTING, 2019, 80 :202-210
[6]   The performance of the P2P finance industry in China [J].
Gao, Yu ;
Yu, Shih-Heng ;
Shiue, Yih-Chearng .
ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2018, 30 :138-148
[7]   Information gain directed genetic algorithm wrapper feature selection for credit rating [J].
Jadhav, Swati ;
He, Hongmei ;
Jenkins, Karl .
APPLIED SOFT COMPUTING, 2018, 69 :541-553
[8]   Twin Neural Networks for the classification of large unbalanced datasets [J].
Jayadeva ;
Pant, Himanshu ;
Sharma, Mayank ;
Soman, Sumit .
NEUROCOMPUTING, 2019, 343 :34-49
[9]   A Fuzzy Robust Weighted Approach for Multi-Criteria Bilevel Games [J].
Ji, Ying ;
Qu, Shaojian ;
Wu, Zhong ;
Liu, Zhimin .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (08) :5369-5376
[10]  
Ju QX, 2018, MOD MANAG SCI, V302, P111