Feature selection with annealing for forecasting financial time series

被引:0
|
作者
Pabuccu, Hakan [1 ]
Barbu, Adrian [2 ]
机构
[1] Bayburt Univ, Dept Business, TR-69000 Bayburt, Turkiye
[2] Florida State Univ, Stat Dept, Tallahassee, FL 32306 USA
关键词
Financial time-series forecasting; Feature selection; Machine learning; Cryptocurrency; Stock market; Return forecasting; PREDICTING STOCK; NEURAL-NETWORK; RATIOS; MODEL; DIRECTION; FUSION;
D O I
10.1186/s40854-024-00617-3
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Stock market and cryptocurrency forecasting is very important to investors as they aspire to achieve even the slightest improvement to their buy-or-hold strategies so that they may increase profitability. However, obtaining accurate and reliable predictions is challenging, noting that accuracy does not equate to reliability, especially when financial time-series forecasting is applied owing to its complex and chaotic tendencies. To mitigate this complexity, this study provides a comprehensive method for forecasting financial time series based on tactical input-output feature mapping techniques using machine learning (ML) models. During the prediction process, selecting the relevant indicators is vital to obtaining the desired results. In the financial field, limited attention has been paid to this problem with ML solutions. We investigate the use of feature selection with annealing (FSA) for the first time in this field, and we apply the least absolute shrinkage and selection operator (Lasso) method to select the features from more than 1000 candidates obtained from 26 technical classifiers with different periods and lags. Boruta (BOR) feature selection, a wrapper method, is used as a baseline for comparison. Logistic regression (LR), extreme gradient boosting (XGBoost), and long short-term memory are then applied to the selected features for forecasting purposes using 10 different financial datasets containing cryptocurrencies and stocks. The dependent variables consisted of daily logarithmic returns and trends. The mean-squared error for regression, area under the receiver operating characteristic curve, and classification accuracy were used to evaluate model performance, and the statistical significance of the forecasting results was tested using paired t-tests. Experiments indicate that the FSA algorithm increased the performance of ML models, regardless of problem type. The FSA hybrid models showed better performance and outperformed the other BOR models on seven of the 10 datasets for regression and classification. FSA-based models also outperformed Lasso-based models on six of the 10 datasets for regression and four of the 10 datasets for classification. None of the hybrid BOR models outperformed the hybrid FSA models. Lasso-based models, excluding the LR type, were comparable to the best models for six of the 10 datasets for classification. Detailed experimental analysis indicates that the proposed methodology can forecast returns and their movements efficiently and accurately, providing the field with a useful tool for investors.
引用
收藏
页数:26
相关论文
共 50 条
  • [41] Financial time series forecasting model based on CEEMDAN and LSTM
    Cao, Jian
    Li, Zhi
    Li, Jian
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 519 : 127 - 139
  • [42] Time series forecasting based on wavelet decomposition and feature extraction
    Tianhong Liu
    Haikun Wei
    Chi Zhang
    Kanjian Zhang
    Neural Computing and Applications, 2017, 28 : 183 - 195
  • [43] An End-to-End Trainable Feature Selection-Forecasting Architecture Targeted at the Internet of Things
    Nakip, Mert
    Karakayali, Kubilay
    Guzelis, Cuneyt
    Rodoplu, Volkan
    IEEE ACCESS, 2021, 9 : 104011 - 104028
  • [44] Time series forecasting based on wavelet decomposition and feature extraction
    Liu, Tianhong
    Wei, Haikun
    Zhang, Chi
    Zhang, Kanjian
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S183 - S195
  • [45] Regularized least squares fuzzy support vector regression for financial time series forecasting
    Khemchandani, Reshma
    Jayadeva
    Chandra, Suresh
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (01) : 132 - 138
  • [46] Predictive Patterns and Market Efficiency: A Deep Learning Approach to Financial Time Series Forecasting
    Vukovic, Darko B.
    Radenkovic, Sonja D.
    Simeunovic, Ivana
    Zinovev, Vyacheslav
    Radovanovic, Milan
    MATHEMATICS, 2024, 12 (19)
  • [47] Lag-Dependent Regularization for MLPs Applied to Financial Time Series Forecasting Tasks
    Skabar, Andrew
    COMPUTATIONAL SCIENCE - ICCS 2009, 2009, 5545 : 515 - 523
  • [48] A Lightweight and Efficient GA-Based Model-Agnostic Feature Selection Scheme for Time Series Forecasting
    Minh Hieu Nguyen
    Viet Huy Nguyen
    Thanh Trung Huynh
    Thanh Hung Nguyen
    Quoc Viet Hung Nguyen
    Phi Le Nguyen
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT II, 2022, 13758 : 26 - 39
  • [49] Evolutionary Regressor Selection in ARIMA Model for Stock Price Time Series Forecasting
    Stoean, Ruxandra
    Stoean, Catalin
    Sandita, Adrian
    INTELLIGENT DECISION TECHNOLOGIES 2017, KES-IDT 2017, PT II, 2018, 73 : 117 - 126
  • [50] Chaotic Time Series Prediction with Feature Selection Evolution
    Landassuri-Moreno, V.
    Raymundo Marcial-Romero, J.
    Montes-Venegas, A.
    Ramos, Marco A.
    2011 IEEE ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE (CERMA 2011), 2011, : 71 - 76