Research on stock trend prediction method based on optimized random forest

被引:32
作者
Yin, Lili [1 ]
Li, Benling [1 ]
Li, Peng [1 ]
Zhang, Rubo [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp & Sci & Technol, Harbin, Peoples R China
[2] Dalian Nationalities Univ, Dept Comp Sci & Engn, Dalian, Peoples R China
基金
中国国家自然科学基金;
关键词
ensemble learning; finance; random forest; random search; technical indicator; PRICE INDEX; MACHINE; DIRECTION; CLASSIFIERS; MODEL;
D O I
10.1049/cit2.12067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a complex hot problem in the financial field, stock trend forecasting uses a large amount of data and many related indicators; hence it is difficult to obtain sustainable and effective results only by relying on empirical analysis. Researchers in the field of machine learning have proved that random forest can form better judgements on this kind of problem, and it has an auxiliary role in the prediction of stock trend. This study uses historical trading data of four listed companies in the USA stock market, and the purpose of this study is to improve the performance of random forest model in medium- and long-term stock trend prediction. This study applies the exponential smoothing method to process the initial data, calculates the relevant technical indicators as the characteristics to be selected, and proposes the D-RF-RS method to optimize random forest. As the random forest is an ensemble learning model and is closely related to decision tree, D-RF-RS method uses a decision tree to screen the importance of features, and obtains the effective strong feature set of the model as input. Then, the parameter combination of the model is optimized through random parameter search. The experimental results show that the average accuracy of random forest is increased by 0.17 after the above process optimization, which is 0.18 higher than the average accuracy of light gradient boosting machine model. Combined with the performance of the ROC curve and Precision-Recall curve, the stability of the model is also guaranteed, which further demonstrates the advantages of random forest in medium- and long-term trend prediction of the stock market.
引用
收藏
页码:274 / 284
页数:11
相关论文
共 30 条
[1]   Light Gradient Boosting Machine for General Sentiment Classification on Short Texts: A Comparative Evaluation [J].
Alzamzami, Fatimah ;
Hoda, Mohamad ;
El Saddik, Abdulmotaleb .
IEEE ACCESS, 2020, 8 :101840-101858
[2]   River water quality index prediction and uncertainty analysis: A comparative study of machine learning models [J].
Asadollah, Seyed Babak Haji Seyed ;
Sharafati, Ahmad ;
Motta, Davide ;
Yaseen, Zaher Mundher .
JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2021, 9 (01)
[3]   Evaluating multiple classifiers for stock price direction prediction [J].
Ballings, Michel ;
Van den Poel, Dirk ;
Hespeels, Nathalie ;
Gryp, Ruben .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (20) :7046-7056
[4]   Predicting the direction of stock market prices using tree-based classifiers [J].
Basak, Suryoday ;
Kar, Saibal ;
Saha, Snehanshu ;
Khaidem, Luckyson ;
Dey, Sudeepa Roy .
NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2019, 47 :552-567
[5]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[6]   Comparing Technical and Fundamental indicators in stock price forecasting [J].
Beyaz, Erhan ;
Tekiner, Firat ;
Zeng, Xiao-jun ;
Keane, John A. .
IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, :1607-1613
[7]   Remote Aircraft Target Recognition Method Based on Superpixel Segmentation and Image Reconstruction [J].
Chen, Yantong ;
Li, Yuyang ;
Wang, Junsheng .
MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
[8]   A new procedure in stock market forecasting based on fuzzy random auto-regression time series model [J].
Efendi, Riswan ;
Arbaiy, Nureize ;
Deris, Mustafa Mat .
INFORMATION SCIENCES, 2018, 441 :113-132
[9]   Earnings forecasting in a global stock selection model and efficient portfolio construction and management [J].
Guerard, John B., Jr. ;
Markowitz, Harry ;
Xu, GanLin .
INTERNATIONAL JOURNAL OF FORECASTING, 2015, 31 (02) :550-560
[10]   Diagnosis of Human Psychological Disorders using Supervised Learning and Nature-Inspired Computing Techniques: A Meta-Analysis [J].
Kaur, Prableen ;
Sharma, Manik .
JOURNAL OF MEDICAL SYSTEMS, 2019, 43 (07)