Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default

被引:5
作者
Teng, Huei-Wen [1 ]
Lee, Michael [2 ]
机构
[1] Natl Chiao Tung Univ, Dept Informat Management & Finance, Hsinchu, Taiwan
[2] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
Artificial intelligence; machine learning; supervised learning; k-nearest neighbors; decision tree; booting; support vector machine; neural network; !text type='Python']Python[!/text; delinquency; default; credit card; credit risk; ART CLASSIFICATION ALGORITHMS; SUPPORT VECTOR MACHINES; STATISTICAL COMPARISONS; BANKRUPTCY PREDICTION; NEURAL-NETWORKS; SCORING MODELS; RISK; CLASSIFIERS; MANAGEMENT; SELECTION;
D O I
10.1142/S0219091519500218
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Machine learning has successful applications in credit risk management, portfolio management, automatic trading, and fraud detection, to name a few, in the domain of finance technology. Reformulating and solving these topics adequately and accurately is problem specific and challenging along with the availability of complex and voluminous data. In credit risk management, one major problem is to predict the default of credit card holders using real dataset. We review five machine learning methods: the k-nearest neighbors decision trees, boosting, support vector machine, and neural networks, and apply them to the above problem. In addition, we give explicit Python scripts to conduct analysis using a dataset of 29,999 instances with 23 features collected from a major bank in Taiwan, downloadable in the UC Irvine Machine Learning Repository. We show that the decision tree performs best among others in terms of validation curves.
引用
收藏
页数:27
相关论文
共 49 条
  • [31] Cost-based feature selection for Support Vector Machines: An application in credit scoring
    Maldonado, Sebastian
    Perez, Juan
    Bravo, Cristian
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 261 (02) : 656 - 665
  • [32] Differentiating between good credits and bad credits using neuro-fuzzy systems
    Malhotra, R
    Malhotra, DK
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2002, 136 (01) : 190 - 211
  • [33] MITCHELL T, 1989, ANNU REV COMPUT SCI, V4, P417
  • [34] Including high-cardinality attributes in predictive models: A case study in churn prediction in the energy sector
    Moeyersoms, Julie
    Martens, David
    [J]. DECISION SUPPORT SYSTEMS, 2015, 72 : 72 - 81
  • [35] Mohri M., 2018, Foundations of Machine Learning
  • [36] Subagging for credit scoring models
    Paleologo, Giuseppe
    Elisseeff, Andre
    Antonini, Gianluca
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 201 (02) : 490 - 499
  • [37] An ensemble uncertainty aware measure for directed hill climbing ensemble pruning
    Partalas, Ioannis
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    [J]. MACHINE LEARNING, 2010, 81 (03) : 257 - 282
  • [38] Putra E.F., 2011, RECENT RES E ACTIVIT, P174
  • [39] Raschka S., 2015, PYTHON MACHINE LEARN
  • [40] Russell Stuart J., 2010, Artificial Intelligence: A Modern Approach, Prentice Hall series in artificial intelligence, V3rd