Predicting distresses using deep learning of text segments in annual reports

被引:49
作者
Matin, Rastin [1 ]
Hansen, Casper [2 ]
Hansen, Christian [2 ]
Molgaard, Pia [1 ]
机构
[1] Danmarks Nationalbank, DK-1093 Copenhagen K, Denmark
[2] Univ Copenhagen, Dept Comp Sci, DK-2100 Copenhagen O, Denmark
关键词
Corporate default prediction; Natural language processing; Convolutional neural networks; Recurrent neural networks; FINANCIAL RATIOS; BANKRUPTCY PREDICTION; NEURAL-NETWORKS;
D O I
10.1016/j.eswa.2019.04.071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Corporate distress models are central to regulators and financial institutions that need to evaluate the default risk of corporate firms. They are traditionally only based on the numerical financial variables in the firms' annual reports. In this paper we develop a model that employs the unstructured textual data in the reports as well, namely the auditors' reports and managements' statements. Our model consists of a convolutional recurrent neural network which, when concatenated with the numerical financial variables, learns a descriptive representation of the text that is suited for corporate distress prediction. We find that the unstructured data provides a statistically significant enhancement of the distress prediction performance, in particular for large firms where accurate predictions are of the utmost importance. Furthermore, we find that auditors' reports are more informative than managements' statements and that a joint model including both managements' statements and auditors' reports displays no enhancement relative to a model including only auditors' reports. Our model demonstrates a direct improvement over existing state-of-the-art models in the field of distress modelling. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:199 / 208
页数:10
相关论文
共 50 条
[41]   Multi-Label Classification of Text Documents Using Deep Learning [J].
Mohammed, Hamza Haruna ;
Dogdu, Erdogan ;
Gorur, Abdul Kadir ;
Choupani, Roya .
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, :4681-4689
[42]   Abstractive text summarization using adversarial learning and deep neural network [J].
Tank, Meenaxi ;
Thakkar, Priyank .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) :50849-50870
[43]   Text line extraction using deep learning and minimal sub seams [J].
Azran, Adi ;
Schclar, Alon ;
Saabni, Raid .
PROCEEDINGS OF THE 21ST ACM SYMPOSIUM ON DOCUMENT ENGINEERING (DOCENG '21), 2021,
[44]   Analysis of Text Feature Extractors using Deep Learning on Fake News [J].
Ahmed, Bilal ;
Ali, Gulsher ;
Hussain, Arif ;
Buriro, Abdul Baseer ;
Ahmed, Junaid .
ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2021, 11 (02) :7001-7005
[45]   DeepEmotex: Classifying Emotion in Text Messages using Deep Transfer Learning [J].
Hasan, Maryam ;
Rundensteiner, Elke ;
Agu, Emmanuel .
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, :5143-5152
[46]   Uncovering SMS Spam in Swahili Text Using Deep Learning Approaches [J].
Mambina, Iddi S. ;
Ndibwile, Jema D. ;
Uwimpuhwe, Deo ;
Michael, Kisangiri F. .
IEEE ACCESS, 2024, 12 :25164-25175
[47]   Labeling Chest X-Ray Reports Using Deep Learning [J].
Monshi, Maram Mahmoud A. ;
Poon, Josiah ;
Chung, Vera ;
Monshi, Fahad Mahmoud .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 :684-694
[48]   Predicting Articulatory Movement from Text Using Deep Architecture with Stacked Bottleneck Features [J].
Wei, Zhen ;
Wu, Zhizheng ;
Xie, Lei .
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
[49]   Review of bankruptcy prediction using machine learning and deep learning techniques [J].
Qu, Yi ;
Quan, Pei ;
Lei, Minglong ;
Shi, Yong .
7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 :895-899
[50]   Text Data Augmentation for Deep Learning [J].
Shorten, Connor ;
Khoshgoftaar, Taghi M. ;
Furht, Borko .
JOURNAL OF BIG DATA, 2021, 8 (01)