Improving Credit Risk Prediction in Online Peer-to-Peer (P2P) Lending Using Imbalanced Learning Techniques

被引:16
|
作者
Boiko Ferreira, Luis Eduardo [1 ]
Barddal, Jean Paul [1 ]
Enembreck, Fabricio [1 ]
Gomes, Heitor Murilo [2 ]
机构
[1] Pontificia Univ Catolica Parana, Grad Program Informat PPGIa, Curitiba, Parana, Brazil
[2] Univ Paris Saclay, Telecom ParisTech, LTCI, Paris, France
来源
2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017) | 2017年
关键词
CLASSIFICATION;
D O I
10.1109/ICTAI.2017.00037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Peer-to-peer (P2P) lending is a global trend of financial markets that allow individuals to obtain and concede loans without having financial institutions as a strong proxy. As many real-world applications, P2P lending presents an imbalanced characteristic, where the number of creditworthy loan requests is much larger than the number of non-creditworthy ones. In this work, we wrangle a real-world P2P lending data set from Lending Club, containing a large amount of data gathered from 2007 up to 2016. We analyze how supervised classification models and techniques to handle class imbalance impact creditworthiness prediction rates. Ensembles, cost-sensitive and sampling methods are combined and evaluated along logistic regression, decision tree, and bayesian learning schemes. Results show that, in average, sampling techniques outperform ensembles and cost-sensitive approaches.
引用
收藏
页码:175 / 181
页数:7
相关论文
共 13 条
  • [1] A Deep Learning Based Online Credit Scoring Model for P2P Lending
    Zhang, Zaimei
    Niu, Kun
    Liu, Yan
    IEEE ACCESS, 2020, 8 : 177307 - 177317
  • [2] Resampling ensemble model based on data distribution for imbalanced credit risk evaluation in P2P lending
    Niu, Kun
    Zhang, Zaimei
    Liu, Yan
    Li, Renfa
    INFORMATION SCIENCES, 2020, 536 : 120 - 134
  • [3] Credit Scoring in Peer-to-peer Lending with Macro Variables and Machine Learning as Feature Selection Methods
    Guo, Weidong
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [4] A two-stage credit risk scoring method with stacked-generalisation ensemble learning in peer-to-peer lending
    Wang, Chongren
    Liu, Qigang
    Li, Shuping
    INTERNATIONAL JOURNAL OF EMBEDDED SYSTEMS, 2022, 15 (02) : 158 - 166
  • [5] Multi-view ensemble learning based on distance-to-model and adaptive clustering for imbalanced credit risk assessment in P2P lending
    Song, Yu
    Wang, Yuyan
    Ye, Xin
    Wang, Dujuan
    Yin, Yunqiang
    Wang, Yanzhang
    INFORMATION SCIENCES, 2020, 525 (525) : 182 - 204
  • [6] Leveraging network topology for credit risk assessment in P2P lending: A comparative study under the lens of machine learning
    Liu, Yiting
    Baals, Lennart John
    Osterrieder, Jorg
    Hadji-Misheva, Branka
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [7] A new aspect on P2P online lending default prediction using meta-level phone usage data in China
    Ma, Lin
    Zhao, Xi
    Zhou, Zhili
    Liu, Yuanyuan
    DECISION SUPPORT SYSTEMS, 2018, 111 : 60 - 71
  • [8] 2-stage modified random forest model for credit risk assessment of P2P network lending to "Three Rurals" borrowers
    Rao, Congjun
    Liu, Ming
    Goh, Mark
    Wen, Jianghui
    APPLIED SOFT COMPUTING, 2020, 95
  • [9] A case-based reasoning approach to rate microcredit borrower risk in online Kiva P2P lending model
    Uddin, Mohammed Jamal
    Vizzari, Giuseppe
    Bandini, Stefania
    Imam, Mahmood Osman
    DATA TECHNOLOGIES AND APPLICATIONS, 2018, 52 (01) : 58 - 83
  • [10] New model combination meta-learner to improve accuracy prediction P2P lending with stacking ensemble learning
    Muslim, Much Aziz
    Nikmah, Tiara Lailatul
    Pertiwi, Dwika Ananda Agustina
    Subhan
    Jumanto
    Dasril, Yosza
    Iswanto
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 18