Estimation Procedures of Using Five Alternative Machine Learning Methods for Predicting Credit Card Default

被引：5

作者：

Teng, Huei-Wen ^{[1
]}

Lee, Michael ^{[2
]}

机构：

[1] Natl Chiao Tung Univ, Dept Informat Management & Finance, Hsinchu, Taiwan

[2] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

REVIEW OF PACIFIC BASIN FINANCIAL MARKETS AND POLICIES | 2019年 / 22卷 / 03期

关键词：

Artificial intelligence; machine learning; supervised learning; k-nearest neighbors; decision tree; booting; support vector machine; neural network; !text type='Python']Python[!/text; delinquency; default; credit card; credit risk; ART CLASSIFICATION ALGORITHMS; SUPPORT VECTOR MACHINES; STATISTICAL COMPARISONS; BANKRUPTCY PREDICTION; NEURAL-NETWORKS; SCORING MODELS; RISK; CLASSIFIERS; MANAGEMENT; SELECTION;

D O I：

10.1142/S0219091519500218

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

Machine learning has successful applications in credit risk management, portfolio management, automatic trading, and fraud detection, to name a few, in the domain of finance technology. Reformulating and solving these topics adequately and accurately is problem specific and challenging along with the availability of complex and voluminous data. In credit risk management, one major problem is to predict the default of credit card holders using real dataset. We review five machine learning methods: the k-nearest neighbors decision trees, boosting, support vector machine, and neural networks, and apply them to the above problem. In addition, we give explicit Python scripts to conduct analysis using a dataset of 29,999 instances with 23 features collected from a major bank in Taiwan, downloadable in the UC Irvine Machine Learning Repository. We show that the decision tree performs best among others in terms of validation curves.

引用

页数：27

共 49 条

[31] Cost-based feature selection for Support Vector Machines: An application in credit scoring
Maldonado, Sebastian
Perez, Juan
Bravo, Cristian
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 261 (02) : 656 - 665
[32] Differentiating between good credits and bad credits using neuro-fuzzy systems
Malhotra, R
Malhotra, DK
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2002, 136 (01) : 190 - 211
[33] MITCHELL T, 1989, ANNU REV COMPUT SCI, V4, P417
[34] Including high-cardinality attributes in predictive models: A case study in churn prediction in the energy sector
Moeyersoms, Julie
Martens, David
[J]. DECISION SUPPORT SYSTEMS, 2015, 72 : 72 - 81
[35] Mohri M., 2018, Foundations of Machine Learning
[36] Subagging for credit scoring models
Paleologo, Giuseppe
Elisseeff, Andre
Antonini, Gianluca
[J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 201 (02) : 490 - 499
[37] An ensemble uncertainty aware measure for directed hill climbing ensemble pruning
Partalas, Ioannis
Tsoumakas, Grigorios
Vlahavas, Ioannis
[J]. MACHINE LEARNING, 2010, 81 (03) : 257 - 282
[38] Putra E.F., 2011, RECENT RES E ACTIVIT, P174
[39] Raschka S., 2015, PYTHON MACHINE LEARN
[40] Russell Stuart J., 2010, Artificial Intelligence: A Modern Approach, Prentice Hall series in artificial intelligence, V3rd

← 1 2 3 4 5 →