Two-stage adaptive integration of multi-source heterogeneous data based on an improved random subspace and prediction of default risk of microcredit

被引:0
作者
Anzhong Huang
Fei Wu
机构
[1] Jiangsu University of Science and Technology,School of Economics and Management
[2] Shanghai University of Finance and Economics,School of Law
来源
Neural Computing and Applications | 2021年 / 33卷
关键词
Multi-source heterogeneous data; Adaptive integration; Microcredit risk;
D O I
暂无
中图分类号
学科分类号
摘要
Some scholars have shown that the machine learning methods based on a single-source data can successfully monitor the risks of formal financial activities, but not those of informal financial activities. This is because the data generated by formal financial activities, whether it is the structured or unstructured data, are of high quality and quantity, while the data generated by informal financial activities are not. Therefore, multi-source data are the key to monitor the risks of informal financial activities through machine learning. Although a few studies attempted to use multi-source data for financial risk prediction, they simply stack the obtained multi-source data, but ignore the original sources, heterogeneity, mutual redundancy and other characteristics of the data, so that the improvement of the prediction effect is not obvious. Therefore, TSAIB_RS method based on the two-stage adaptive integration of multi-source heterogeneous data was constructed in the paper, in which the data with different sources and different distributions were adaptively integrated. In order to test the reliability of TSAIB_RS method, the paper takes the default risk of microcredit in China as the test target and compares the prediction results of various test methods. It concludes that TSAIB_RS method can significantly improve the prediction effects.
引用
收藏
页码:4065 / 4075
页数:10
相关论文
共 64 条
[1]  
Rajan RG(1992)Insiders and Outsiders. The choice between informed and Arm’s-length debt J Finance 47 1367-1400
[2]  
Boot AWA(1994)Moral Hazard and secured lending in an infinitely repeated credit market game Int Econ Rev 35 899-920
[3]  
Thakor AV(2014)A comparative study of classifier ensembles for bankruptcy prediction Appl Soft Comput 24 977-984
[4]  
Tsai CF(2012)Spatiotemporal variability of drought and the potential climatological driving factors in the Liao River Hydrol Process 26 1-14
[5]  
Hsu Y-F(2016)Intelligent financial fraud detection: a comprehensive review Comput Secur 57 66-30
[6]  
Yen DC(2013)Measuring credit risk of bank customers using artificial neural network J Manag Res 5 17-41
[7]  
Liu X(2014)Business analytics using random forest trees for credit risk prediction: a comparison study Int J Adv Sci Technol 72 19-613
[8]  
Xu Z(1998)Neural network detection of management fraud using published financial data Int J Intell Syst Account Finance Manag 7 21-5923
[9]  
Yu R(2011)Data mining for credit card fraud: a comparative study Decis Support Syst 50 602-157
[10]  
West J(2013)A cost-sensitive decision tree approach for fraud detection Expert Syst Appl 40 5916-426