Entropy-based text feature engineering approach for forecasting financial liquidity changes

被引:0
作者
Riabykh, Aleksei [1 ]
Suleimanov, Ilias [2 ]
Nagovitcyn, Ilya [2 ]
Surzhko, Denis [2 ]
Konovalikhin, Maxim [2 ]
Koltsova, Olessia [1 ]
机构
[1] Natl Res Univ Higher Sch Econ, Lab Social & Cognit Informat, 55-2 Sedova St, St Petersburg, Russia
[2] VTB Bank, Dept Data Anal & Modeling, Moscow, Russia
关键词
Feature engineering; Financial time series; Natural language processing; Economic news; Entropy; Stock trade volumes; ATM cash withdrawals; TIME-SERIES; NEWS; PREDICTION; MODEL;
D O I
10.1140/epjds/s13688-025-00535-z
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Changes in individual and institutional financial behavior leading to shifts in liquidity flows often depend on events reflected in news. However, the task of establishing relationship between financial behavior and news remains challenging and understudied. We propose a news-based feature generation approach that allows accounting for news events in liquidity flow time-series predicting tasks, thereby improving the forecasting quality. These features are constructed as different types of entropies and calculated at different levels of text abstraction based on word counts, TF-IDF values, probabilistic topics, and contextual embeddings. We show that this feature engineering procedure is effective for predicting changes in two types of liquidity flows: stock market trading volume and the volume of ATM cash withdrawals. As the first type, we use our original collection of 651, 208 business news articles from a Russian news agency dating to 2013-2021 to predict abnormal jumps in the trade volume of 32 leading Russian companies. With our approach, 97% of them experience an increase in the quality of predicting the differences in daily trading volumes from their median values. For the ATM withdrawals task, we test the impact of economic news from three leading Russian media sources (N = 55, 712) on withdrawals from 100 ATMs located in Moscow. For 95% of them we improve the quality of prediction of year-to-year weekly withdrawal volume change. Additionally, we find that some news sources have a higher predictive power than others. The approach is potentially generalizable for other domains of financial behavior across the globe.
引用
收藏
页数:45
相关论文
共 50 条
[41]   Hierarchical Localization using Entropy-based Feature Map and Triangulation Techniques [J].
Rady, Sherine ;
Wagner, Achim ;
Badreddin, Essameddin .
2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
[42]   A tutorial review on entropy-based handcrafted feature extraction for information fusion [J].
Guido, Rodrigo Capobianco .
INFORMATION FUSION, 2018, 41 :161-175
[43]   Entropy-Based Surface Electromyogram Feature Extraction for Knee Osteoarthritis Classification [J].
Chen, Xin ;
Chen, Jun ;
Liang, Jie ;
Li, Yurong ;
Courtney, Carol Ann ;
Yang, Yuan .
IEEE ACCESS, 2019, 7 :164144-164151
[44]   Entropy-Based Feature Extraction for Electromagnetic Discharges Classification in High-Voltage Power Generation [J].
Mitiche, Imene ;
Morison, Gordon ;
Nesbitt, Alan ;
Stewart, Brian G. ;
Boreham, Philip .
ENTROPY, 2018, 20 (08)
[45]   My Approach = Your Apparatus? Entropy-Based Topic Modeling on Multiple Domain-Specific Text Collections [J].
Risch, Julian ;
Krestel, Ralf .
JCDL'18: PROCEEDINGS OF THE 18TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2018, :283-292
[46]   A generalized financial time series forecasting model based on automatic feature engineering using genetic algorithms and support vector machine [J].
Ritzmann Junior, Norberto ;
Nievola, Julio Cesar .
2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
[47]   Research on Approach of Entropy-Based Wavelet Filtering for Nomadic Service [J].
Zhang, Degan ;
Zhang, Xiaoli ;
Wang, Yuanyuan ;
Li, Chao .
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, :3831-3835
[48]   Conformance checking of partially matching processes: An entropy-based approach [J].
Polyvyanyy, Artem ;
Kalenkova, Anna .
INFORMATION SYSTEMS, 2022, 106
[49]   An entropy-based analysis of lane changing behavior: An interactive approach [J].
Kosun, Caglar ;
Ozdemir, Serhan .
TRAFFIC INJURY PREVENTION, 2017, 18 (04) :441-447
[50]   A Novel Backdoor Detection Approach Using Entropy-Based Measures [J].
Surendrababu, Hema Karnam ;
Nagaraj, Nithin .
IEEE ACCESS, 2024, 12 :114057-114072