Credit Default Prediction on Time-Series Behavioral Data Using Ensemble Models

被引:1
|
作者
Guo, Kangshuai [1 ]
Luo, Shichao [2 ]
Liang, Ming [3 ]
Zhang, Zhongjian [1 ]
Yang, Huabin [1 ]
Wang, Yan [1 ]
Zhou, Yingjie [4 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] CITIC aiBank Corp Ltd, Beijing, Peoples R China
[3] China Construct Bank, Beijing, Peoples R China
[4] Sichuan Univ, Chengdu, Sichuan, Peoples R China
关键词
Credit default prediction; Time-series behavior analysis; Machine learning; RISK;
D O I
10.1109/IJCNN54540.2023.10191783
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the past few decades, credit default prediction has been central to managing risk in a consumer lending business. Credit default prediction allows lenders to optimize lending decisions, which leads to a better customer experience and sound business economics. Current models exist to help manage risk, but there is still exists space for better models that can outperform those currently in use. In this paper, we proposed a solution for the credit default prediction at double anonymized information including customer profile information and time-series behavioral data. Specifically, at the industrial-scale dataset provided by the credit default prediction competition of American Express, we leverage it to build a machine learning model that challenges the current model in application. Through the analysis of double anonymized information, we design an effective data processing flow, analyze the impact of time-series behavioral data, and derive useful latent features through feature augmentation and feature engineering, which ends up with a multi-model hybrid prediction integration scheme. The model consists of three modules: LightGBM, XGBoost, and Local-Ensemble, We use different feature combinations and individualized prediction schemes for each model to achieve efficient learning. Finally, the multi-model prediction results are ensemble to output the final result of a score-driven ensemble strategy. Experiments show that the method proposed in this paper has obvious advantages in solving the credit default prediction problem. We validate our model in the credit default prediction competition of American Express by ranking Top 10 of 4,874 teams.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models
    Karlsson, Rickard K. A.
    Willbo, Martin
    Hussain, Zeshan
    Krishnan, Rahul G.
    Sontag, David
    Johansson, Fredrik D.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [22] Nonparametric models for functional data, with application in regression, time-series prediction and curve discrimination
    Ferraty, F
    Vieu, P
    JOURNAL OF NONPARAMETRIC STATISTICS, 2004, 16 (1-2) : 111 - 125
  • [23] Wind Power Short-Term Time-Series Prediction Using an Ensemble of Neural Networks
    Ciechulski, Tomasz
    Osowski, Stanislaw
    ENERGIES, 2024, 17 (01)
  • [24] WORKLOAD PREDICTION USING TIME-SERIES ANALYSIS
    CSONTOS, E
    COMPUTER PERFORMANCE, 1984, 5 (02): : 70 - 79
  • [25] A prediction model for healthcare time-series data with a mixture of deep mixed effect models using Gaussian processes
    Hong, Jaehyoung
    Chun, Hyonho
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84
  • [26] Nonlinear time-series prediction with missing and noisy data
    Tresp, V
    Hofmann, R
    NEURAL COMPUTATION, 1998, 10 (03) : 731 - 747
  • [27] PREDICTION AND CONTROL FOR A TIME-SERIES COUNT DATA MODEL
    BRANNAS, K
    INTERNATIONAL JOURNAL OF FORECASTING, 1995, 11 (02) : 263 - 270
  • [28] Time-series Temporal Classification using Feature Ensemble Learning
    Liu, Ruoqian
    Murphey, Yi L.
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [29] Anomaly detection in multivariate time series data using deep ensemble models
    Iqbal, Amjad
    Amin, Rashid
    Alsubaei, Faisal S.
    Alzahrani, Abdulrahman
    PLOS ONE, 2024, 19 (06):
  • [30] TRAINING KSIM MODELS FROM TIME-SERIES DATA
    BLACK, RL
    OLDHAM, WJB
    MARCY, WM
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 1994, 47 (03) : 293 - 307