Incorporating Multiple Textual Factors into Unbalanced Financial Distress Prediction: A Feature Selection Methods and Ensemble Classifiers Combined Approach

被引:3
|
作者
Li, Shixuan [1 ]
Shi, Wenxuan [2 ]
机构
[1] Wuhan Univ Technol, Sch Safety Sci & Emergency Management, Wuhan, Peoples R China
[2] Wuhan Univ, Sch Informat Management, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Textual factors; Feature selection; Ensemble classifiers; Financial distress prediction; Word embedding; SENTIMENT; COMPANIES;
D O I
10.1007/s44196-023-00342-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Textual-based factors have been widely regarded as a promising feature that can be applied to financial issues. This study focuses on extracting both basic and semantic textual features to supplement the traditionally used financial indicators. The main is to improve Chinese listed companies' financial distress prediction (FDP). A unique paradigm is proposed in this study that combines financial and multi-type textual predictive factors, feature selection methods, classifiers, and time spans to achieve the optimal FDP. The frequency counts, TF-IDF, TextRank, and word embedding approaches are employed to extract frequency count-based, keyword-based, sentiment, and readability indicators. The experimental results prove that financial domain sentiment lexicons, word embedding-based readability analysis approaches, and the basic textual features of Management Discussion and Analysis can be important elements of FDP. Moreover, the finding highlights the fact that incorporating financial and textual features can achieve optimal performance 4 or 5 years before the expected baseline year; applying the RF-GBDT combined model can also outperform other classifiers. This study makes an innovative contribution, since it expands the multiple text analysis method in the financial text mining field and provides new findings on how to provide early warning signs related to financial risk. The approaches developed in this research can serve as a template that can be used to resolve other financial issues.
引用
收藏
页数:24
相关论文
共 14 条
  • [1] Incorporating Multiple Textual Factors into Unbalanced Financial Distress Prediction: A Feature Selection Methods and Ensemble Classifiers Combined Approach
    Shixuan Li
    Wenxuan Shi
    International Journal of Computational Intelligence Systems, 16
  • [2] Incorporating textual and management factors into financial distress prediction: A comparative study of machine learning methods
    Tang, Xiaobo
    Li, Shixuan
    Tan, Mingliang
    Shi, Wenxuan
    JOURNAL OF FORECASTING, 2020, 39 (05) : 769 - 787
  • [3] Financial Distress Prediction Based on Ensemble Classifiers of Multiple Reductions
    Hui Xiao-feng
    Han Jian-guang
    Sun Jie
    2009 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (16TH), VOLS I AND II, CONFERENCE PROCEEDINGS, 2009, : 1247 - +
  • [4] Novel feature selection methods to financial distress prediction
    Lin, Fengyi
    Liang, Deron
    Yeh, Ching-Chiang
    Huang, Jui-Chieh
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (05) : 2472 - 2483
  • [5] An Approach of Multiple Classifiers Ensemble Based on Feature Selection
    Chen, Bing
    Zhang, Hua-Xiang
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 390 - 394
  • [6] Combining feature selection, instance selection, and ensemble classification techniques for improved financial distress prediction
    Tsai, Chih-Fong
    Sue, Kuen-Liang
    Hu, Ya-Han
    Chiu, Andy
    JOURNAL OF BUSINESS RESEARCH, 2021, 130 : 200 - 209
  • [7] Prediction of Financial Distress: An Application to Chinese Listed Companies Using Ensemble Classifiers of Multiple Reductions
    Wu Bao-xiu
    2014 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (ICMSE), 2014, : 1456 - 1461
  • [8] CUS-heterogeneous ensemble-based financial distress prediction for imbalanced dataset with ensemble feature selection
    Du, Xudong
    Li, Wei
    Ruan, Sumei
    Li, Li
    APPLIED SOFT COMPUTING, 2020, 97
  • [9] Financial distress prediction based on ensemble feature selection and improved stacking algorithm
    Wu, Chong
    Chen, Xiaofang
    Jiang, Yongjie
    KYBERNETES, 2024,
  • [10] Financial distress prediction with annual reports-based deep textual feature extraction: A hybrid approach
    Liu, Jiaming
    Jia, Ming
    INFORMATION SCIENCES, 2025, 686