Entropy-based text feature engineering approach for forecasting financial liquidity changes

被引:0
作者
Riabykh, Aleksei [1 ]
Suleimanov, Ilias [2 ]
Nagovitcyn, Ilya [2 ]
Surzhko, Denis [2 ]
Konovalikhin, Maxim [2 ]
Koltsova, Olessia [1 ]
机构
[1] Natl Res Univ Higher Sch Econ, Lab Social & Cognit Informat, 55-2 Sedova St, St Petersburg, Russia
[2] VTB Bank, Dept Data Anal & Modeling, Moscow, Russia
关键词
Feature engineering; Financial time series; Natural language processing; Economic news; Entropy; Stock trade volumes; ATM cash withdrawals; TIME-SERIES; NEWS; PREDICTION; MODEL;
D O I
10.1140/epjds/s13688-025-00535-z
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Changes in individual and institutional financial behavior leading to shifts in liquidity flows often depend on events reflected in news. However, the task of establishing relationship between financial behavior and news remains challenging and understudied. We propose a news-based feature generation approach that allows accounting for news events in liquidity flow time-series predicting tasks, thereby improving the forecasting quality. These features are constructed as different types of entropies and calculated at different levels of text abstraction based on word counts, TF-IDF values, probabilistic topics, and contextual embeddings. We show that this feature engineering procedure is effective for predicting changes in two types of liquidity flows: stock market trading volume and the volume of ATM cash withdrawals. As the first type, we use our original collection of 651, 208 business news articles from a Russian news agency dating to 2013-2021 to predict abnormal jumps in the trade volume of 32 leading Russian companies. With our approach, 97% of them experience an increase in the quality of predicting the differences in daily trading volumes from their median values. For the ATM withdrawals task, we test the impact of economic news from three leading Russian media sources (N = 55, 712) on withdrawals from 100 ATMs located in Moscow. For 95% of them we improve the quality of prediction of year-to-year weekly withdrawal volume change. Additionally, we find that some news sources have a higher predictive power than others. The approach is potentially generalizable for other domains of financial behavior across the globe.
引用
收藏
页数:45
相关论文
共 50 条
  • [21] Hybrid methodologies for electricity load forecasting: Entropy-based feature selection with machine learning and soft computing techniques
    Jurado, Sergio
    Nebot, Angela
    Mugica, Fransisco
    Avellana, Narcis
    ENERGY, 2015, 86 : 276 - 291
  • [22] Entropy-based Term Weighting Schemes for Text Categorization in VSM
    Wang, Tao
    Cai, Yi
    Leung, Ho-fung
    Cai, Zhiwei
    Min, Huaqing
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 325 - 332
  • [23] An Entropy-based Approach for Computing the Aesthetics of Interfaces
    Wang, Chen
    Ren, Xiangshi
    COMPANION PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON INTERACTIVE SURFACES AND SPACES (ISS'18), 2018, : 57 - 61
  • [24] Simplification in translated Chinese: An entropy-based approach
    Liu, Kanglong
    Liu, Zhongzhu
    Lei, Lei
    LINGUA, 2022, 275
  • [25] An entropy-based approach to enhancing Random Forests
    Gaber, Mohamed Medhat
    Atwal, Harinder Singh
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2013, 7 (04): : 319 - 327
  • [26] An entropy-based approach to wide area surveillance
    Collins, Gaemus E.
    Meloon, Mark M.
    Sullivan, Kevin J.
    Chinn, Janice
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XV, 2006, 6235
  • [27] Predicting Imminent Episodes of Ventricular Tachyarrhythmia using an Entropy-based Feature in the EMD Domain
    Riasi, Atiye
    Mohebbi, Maryam
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 88 - 92
  • [28] Financial Forecasting based on LSTM and Text Emotional Features
    Wang, He
    Guo, Zhiqiang
    Chen, Lijun
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 1427 - 1430
  • [29] An entropy-based approach for measuring complexity in supply chains
    Isik, Filiz
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2010, 48 (12) : 3681 - 3696
  • [30] An Entropy-based Approach to Faculty Assessment with Interval Numbers
    Zhang, Quan
    Zhang, Xin
    Zhang, WuNan
    CCDC 2009: 21ST CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-6, PROCEEDINGS, 2009, : 2618 - +