Loan default prediction by combining soft information extracted from descriptive text in online peer-to-peer lending

被引:114
作者
Jiang, Cuiqing [1 ]
Wang, Zhao [1 ]
Wang, Ruiya [1 ]
Ding, Yong [1 ]
机构
[1] Hefei Univ Technol, Sch Management, Hefei, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
P2P lending; Default prediction; Soft information; Topic model; CREDIT RISK; SELECTION; BORROWER; FINANCE;
D O I
10.1007/s10479-017-2668-z
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Predicting whether a borrower will default on a loan is of significant concern to platforms and investors in online peer-to-peer (P2P) lending. Because the data types online platforms use are complex and involve unstructured information such as text, which is difficult to quantify and analyze, loan default prediction faces new challenges in P2P. To this end, we propose a default prediction method for P2P lending combined with soft information related to textual description. We introduce a topic model to extract valuable features from the descriptive text concerning loans and construct four default prediction models to demonstrate the performance of these features for default prediction. Moreover, a two-stage method is designed to select an effective feature set containing both soft and hard information. An empirical analysis using real-word data from a major P2P lending platform in China shows that the proposed method can improve loan default prediction performance compared with existing methods based only on hard information.
引用
收藏
页码:511 / 529
页数:19
相关论文
共 35 条
[1]   CREDIT SCORING, STATISTICAL TECHNIQUES AND EVALUATION CRITERIA: A REVIEW OF THE LITERATURE [J].
Abdou, Hussein A. ;
Pointon, John .
INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2011, 18 (2-3) :59-88
[2]   The financing of innovative SMEs: A multicriteria credit rating model [J].
Angilella, Silvia ;
Mazzu, Sebastiano .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2015, 244 (02) :540-554
[3]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   The Relevance of Soft Information for Predicting Small Business Credit Default: Evidence from a Social Bank [J].
Cornee, Simon .
JOURNAL OF SMALL BUSINESS MANAGEMENT, 2019, 57 (03) :699-719
[6]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[7]   Recent developments in consumer credit risk assessment [J].
Crook, Jonathan N. ;
Edelman, David B. ;
Thomas, Lyn C. .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 183 (03) :1447-1465
[8]   Description-text related soft information in peer-to-peer lending - Evidence from two leading European platforms [J].
Dorfleitner, Gregor ;
Priberny, Christopher ;
Schuster, Stephanie ;
Stoiber, Johannes ;
Weber, Martina ;
de Castro, Ivan ;
Kammler, Julia .
JOURNAL OF BANKING & FINANCE, 2016, 64 :169-187
[9]   Evaluating credit risk and loan performance in online Peer-to-Peer (P2P) lending [J].
Emekter, Riza ;
Tu, Yanbin ;
Jirasakuldech, Benjamas ;
Lu, Min .
APPLIED ECONOMICS, 2015, 47 (01) :54-70
[10]   Multiple classifier architectures and their application to credit risk assessment [J].
Finlay, Steven .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2011, 210 (02) :368-378