Assessing Banks' Distress Using News and Regular Financial Data

被引:3
作者
Cerchiello, Paola [1 ]
Nicola, Giancarlo [1 ]
Roennqvist, Samuel [2 ]
Sarlin, Peter [3 ]
机构
[1] Univ Pavia, Dept Econ & Management, Pavia, Italy
[2] Univ Turku, Turku Ctr Comp Sci, Data Min Lab, TurkuNLP, Turku, Finland
[3] Hanken Sch Econ, Helsinki, Finland
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2022年 / 5卷
关键词
credit risk; text analysis; deep learning; Doc2Vec; classification model; DEBT;
D O I
10.3389/frai.2022.871863
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we focus our attention on leveraging the information contained in financial news to enhance the performance of a bank distress classifier. The news information should be analyzed and inserted into the predictive model in the most efficient way and this task deals with the issues related to Natural Language interpretation and to the analysis of news media. Among the different models proposed for such purpose, we investigate a deep learning approach. The methodology is based on a distributed representation of textual data obtained from a model (Doc2Vec) that maps the documents and the words contained within a text onto a reduced latent semantic space. Afterwards, a second supervised feed forward fully connected neural network is trained combining news data distributed representations with standard financial figures in input. The goal of the model is to classify the corresponding banks in distressed or tranquil state. The final aim is to comprehend both the improvement of the predictive performance of the classifier and to assess the importance of news data in the classification process. This to understand if news data really bring useful information not contained in standard financial variables.
引用
收藏
页数:11
相关论文
共 51 条
[1]  
[Anonymous], 1986, Parallel Distributed Processing: Explorations in the Microstructure of Cognition
[2]  
[Anonymous], 2014, P 2014 C EMP METH NA, DOI DOI 10.3115/V1/D14-1179
[3]  
[Anonymous], 2015, NEWS NARRATIVES FINA
[4]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[5]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[6]   Predicting distress in European banks [J].
Betz, Frank ;
Oprica, Silviu ;
Peltonen, Tuomas A. ;
Sarlin, Peter .
JOURNAL OF BANKING & FINANCE, 2014, 45 :225-241
[7]  
Bholat D.M., 2015, Text mining for central banks. Retrieved May 15, 2018
[8]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[9]  
Brown P. F., 1992, Computational Linguistics, V18, P467
[10]  
Brunnermeier Markus K., 2012, NBER Working Papers 18398, DOI [10.3386/w18398, DOI 10.3386/W18398]