Entropy-based text feature engineering approach for forecasting financial liquidity changes

被引:0
|
作者
Riabykh, Aleksei [1 ]
Suleimanov, Ilias [2 ]
Nagovitcyn, Ilya [2 ]
Surzhko, Denis [2 ]
Konovalikhin, Maxim [2 ]
Koltsova, Olessia [1 ]
机构
[1] Natl Res Univ Higher Sch Econ, Lab Social & Cognit Informat, 55-2 Sedova St, St Petersburg, Russia
[2] VTB Bank, Dept Data Anal & Modeling, Moscow, Russia
关键词
Feature engineering; Financial time series; Natural language processing; Economic news; Entropy; Stock trade volumes; ATM cash withdrawals; TIME-SERIES; NEWS; PREDICTION; MODEL;
D O I
10.1140/epjds/s13688-025-00535-z
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Changes in individual and institutional financial behavior leading to shifts in liquidity flows often depend on events reflected in news. However, the task of establishing relationship between financial behavior and news remains challenging and understudied. We propose a news-based feature generation approach that allows accounting for news events in liquidity flow time-series predicting tasks, thereby improving the forecasting quality. These features are constructed as different types of entropies and calculated at different levels of text abstraction based on word counts, TF-IDF values, probabilistic topics, and contextual embeddings. We show that this feature engineering procedure is effective for predicting changes in two types of liquidity flows: stock market trading volume and the volume of ATM cash withdrawals. As the first type, we use our original collection of 651, 208 business news articles from a Russian news agency dating to 2013-2021 to predict abnormal jumps in the trade volume of 32 leading Russian companies. With our approach, 97% of them experience an increase in the quality of predicting the differences in daily trading volumes from their median values. For the ATM withdrawals task, we test the impact of economic news from three leading Russian media sources (N = 55, 712) on withdrawals from 100 ATMs located in Moscow. For 95% of them we improve the quality of prediction of year-to-year weekly withdrawal volume change. Additionally, we find that some news sources have a higher predictive power than others. The approach is potentially generalizable for other domains of financial behavior across the globe.
引用
收藏
页数:45
相关论文
共 50 条
  • [1] A relative decision entropy-based feature selection approach
    Jiang, Feng
    Sui, Yuefei
    Zhou, Lin
    PATTERN RECOGNITION, 2015, 48 (07) : 2151 - 2163
  • [2] A New LSA and Entropy-Based Approach for Automatic Text Document Summarization
    Yadav, Chandra
    Sharan, Aditi
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2018, 14 (04) : 1 - 32
  • [3] Entropy-Based Approach for Enabling Text Line Segmentation in Handwritten Documents
    Sindhushree, G. S.
    Amarnath, R.
    Nagabhushan, P.
    DATA ANALYTICS AND LEARNING, 2019, 43 : 169 - 184
  • [4] Entropy-Based Watermarking Approach for Sensitive Tamper Detection of Arabic Text
    Al-Wesabi, Fahd N.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (03): : 3635 - 3648
  • [5] The Entropy-Based Time Domain Feature Extraction for Online Concept Drift Detection
    Ding, Fengqian
    Luo, Chao
    ENTROPY, 2019, 21 (12)
  • [6] Complexity of grammatical metaphor: an entropy-based approach
    Zhou, Jiangping
    SEMIOTICA, 2023, 2023 (252) : 173 - 185
  • [7] Entropy-Based Behavioural Efficiency of the Financial Market
    Dinga, Emil
    Oprean-Stan, Camelia
    Tanasescu, Cristina-Roxana
    Bratian, Vasile
    Ionescu, Gabriela-Mariana
    ENTROPY, 2021, 23 (11)
  • [8] Entropy-Based Feature Analysis for Speech Recognition
    Setiawan, Panji
    Hoege, Harald
    Fingscheidt, Tim
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2927 - +
  • [9] Who dominate the information flowing between innovative and traditional financial assets? A multiscale entropy-based approach ☆
    Zhou, Yang
    Xie, Chi
    Wang, Gang-Jin
    Gong, Jue
    Li, Zhao-Chen
    Zhu, You
    INTERNATIONAL REVIEW OF ECONOMICS & FINANCE, 2024, 93 : 329 - 358
  • [10] A novel text mining approach to financial time series forecasting
    Wang, Baohua
    Huang, Hejiao
    Wang, Xiaolong
    NEUROCOMPUTING, 2012, 83 : 136 - 145