Forecasting daily stock trend using multi-filter feature selection and deep learning

被引:61
作者
Ul Haq, Anwar [1 ]
Zeb, Adnan [1 ]
Lei, Zhenfeng [1 ]
Zhang, Defu [1 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen 361005, Fujian, Peoples R China
关键词
Stock trend prediction; Feature selection; Deep learning; Machine learning; FUZZY TIME-SERIES; MODEL;
D O I
10.1016/j.eswa.2020.114444
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stock market forecasting has attracted significant attention mainly due to the potential monetary benefits. Predicting these markets is a challenging task due to numerous interrelated factors, and needs a complete and efficient feature selection process to identify the most informative factors. As a time series problem, stock price movements are also dependent on movements on its previous trading days. Feature selection techniques have been widely applied in stock forecasting, but existing approaches usually use a single feature selection technique, which may overlook some important assumptions about the underlying regression function linking the input and output variables. In this study, we combine features selected by multiple feature selection techniques to generate an optimal feature subset and then use a deep generative model to predict future price movements. First, we compute an extended set of forty-four technical indicators from daily stock data of eighty-eight stocks and then compute their importance by independently training logistic regression model, support vector machine and random forests. Based on a prespecified threshold, the lowest ranked features are dropped and the rest are grouped into clusters. The variable importance measure is reused to select the most important feature from each cluster to generate the final subset. The input is then fed to a deep generative model comprising of a market signal extractor and an attention mechanism. The market signal extractor recurrently decodes market movement from the latent variables to deal with stochastic nature of the stock data and the attention mechanism discriminates between predictive dependencies of different temporal auxiliary outputs. The results demonstrate that combining features selected by multiple feature selection approaches and using them as input into a deep generative model outperforms state-of-the-art approaches.
引用
收藏
页数:8
相关论文
共 45 条
[1]  
Adebiyi A. A., 2014, J APPL MATH
[2]  
[Anonymous], 2007, INT J COMPUTER ELECT, DOI 10.5281/zenodo.1333234
[3]  
[Anonymous], 2021, EXPERT SYST APPL, V168
[4]   Surveying stock market forecasting techniques - Part II: Soft computing methods [J].
Atsalakis, George S. ;
Valavanis, Kimon P. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :5932-5941
[5]   GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY [J].
BOLLERSLEV, T .
JOURNAL OF ECONOMETRICS, 1986, 31 (03) :307-327
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Brown R.G, 2004, Smoothing, forecasting and prediction of discrete time series
[8]   Stock market movement forecast: A Systematic review [J].
Bustos, O. ;
Pomares-Quimbaya, A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 156
[9]   A novel stock forecasting model based on fuzzy time series and genetic algorithm [J].
Cai, Qisen ;
Zhang, Defu ;
Wu, Bo ;
Leung, Stehpen C. H. .
2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 :1155-1162
[10]  
Cervello-Royo R., 2020, Financ. Mark. Val., V6, P37, DOI 10.46503/NLUF8557