Semi-Supervised Clustering for Financial Risk Analysis

被引:0
作者
Yihan Han
Tao Wang
机构
[1] Suzhou Institute of Trade and Commerce,School of Computer Science and Engineering
[2] Nanjing University of Science and Technology,undefined
来源
Neural Processing Letters | 2021年 / 53卷
关键词
Financial risk analysis; Data clustering; Semi-supervised learning; Affinity diffusion;
D O I
暂无
中图分类号
学科分类号
摘要
Many methods have been developed for financial risk analysis. In general, the conventional unsupervised approaches lack sufficient accuracy and semantics for the clustering, and the supervised approaches rely on large amount of training data for the classification. This paper explores the semi-supervised scheme for the financial data prediction, in which accurate predictions are expected with a small amount of labeled data. Due to lack of sufficient distinguishability in financial data, it is hard for the existing semi-supervised approaches to obtain satisfactory results. In order to improve the performance, we first convert the input labeled clues to the global prior probability, and propagate the’soft’ prior probability to learn the posterior probability instead of directly propagating the’hard’ labeled data. A label diffusion model is then constructed to adaptively fuse the information at feature space and label space, which makes the structures of data affinity and labeling more consistent. Experiments on two public real financial datasets validate the effectiveness of the proposed method.
引用
收藏
页码:3561 / 3572
页数:11
相关论文
共 40 条
[1]  
Tay FEH(2002)ε-Descending support vector machines for financial time series forecasting Neural Process Lett 15 179-195
[2]  
Cao LJ(2019)An evaluation of equity premium prediction using multiple kernel learning with financial features Neural Process Lett 52 117-134
[3]  
Arratia A(1977)Maximum likelihood from incomplete data via the EM algorithm J Roy Stat Soc Ser B Meth 39 1-38
[4]  
Belanche LA(1998)Using Kohonen’s self organizing feature map to uncover automobile bodily injury claims fraud J Risk Insur 65 245-274
[5]  
Fábregues L(2001)Clustering technique for risk classification and prediction of claim costs in the automobile insurance industry Intell Syst Acc Finance Manage 10 39-50
[6]  
Dempster AP(2014)Evaluation of clustering algorithms for financial risk analysis using MCDM methods Inf Sci 275 1-12
[7]  
Laird NM(2010)Fuzzy, distributed, instance counting, and default artmap neural networks for financial diagnosis Int J Inform Technol Decis Making 9 959-978
[8]  
Rubin DB(2018)Centroid neural network with pairwise constraints for semi-supervised learning Neural Process Lett 48 1721-1747
[9]  
Brockett P(2004)Learning with local and global consistency Adv Neural Inf Process Syst 16 321-328
[10]  
Xia X(2017)Towards safe semi-supervised classification: adjusted cluster assumption via clustering Neural Process Lett 46 1031-1042