Text-based soybean futures price forecasting: A two-stage deep learning approach

被引:14
作者
An, Wuyue [1 ]
Wang, Lin [1 ]
Zeng, Yu-Rong [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Management, Wuhan, Hubei, Peoples R China
[2] Hubei Univ Econ, Sch Informat Engn, Wuhan 430205, Hubei, Peoples R China
关键词
soybean futures price forecast; stepwise regression; text; two-stage LSTM; SENTIMENT ANALYSIS; COMMODITY-MARKETS; TIME-SERIES; DISCOVERY; DYNAMICS; NETWORK; MODEL;
D O I
10.1002/for.2909
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper investigates the soybean futures price prediction problem from a new perspective and proposes an effective prediction model named Two-Stage Hybrid Long Short-Term Memory (TSH-LSTM) by using text data from social media. First, the unstructured text is transformed into structured data by sentiment analysis and text classification methods. The improved sentiment score is computed by combining the degree centrality of sentiment words based on the sentiment dictionary method, and the characteristics of price fluctuations in texts are learned through the text Recurrent Convolutional Neural Networks. Second, the significant relationship between social media features and soybean futures price is assessed through stepwise regression, and the results of such an assessment are used as a basis for the identification of significant factors as input variables of the prediction model. Finally, the TSH-LSTM prediction model is designed, and the final prediction result is acquired through the combination of prediction results of each stage using the error reciprocal method. The empirical results indicate that the incorporation of the social media text feature helps improve forecasting performances. Specifically, the proposed TSH-LSTM is more accurate than univariate LSTM, multivariate LSTM, and eXtreme Gradient Boosting.
引用
收藏
页码:312 / 330
页数:19
相关论文
共 46 条
[1]   Price discovery dynamics in European agricultural markets [J].
Adaemmer, Philipp ;
Bohl, Martin T. .
JOURNAL OF FUTURES MARKETS, 2018, 38 (05) :549-562
[2]   Forecasting food prices: The case of corn, soybeans and wheat [J].
Ahumada, H. ;
Cornejo, M. .
INTERNATIONAL JOURNAL OF FORECASTING, 2016, 32 (03) :838-848
[3]   Out-of-sample testing price discovery in commodity markets: the case of soybeans [J].
Ahumada, Hildegart ;
Cornejo, Magdalena .
AGRICULTURAL ECONOMICS, 2016, 47 (06) :709-718
[4]   Modelling the volatility of TOCOM energy futures: A regime switching realised volatility approach [J].
Alizadeh, Amir H. ;
Huang, Chih-Yueh ;
Marsh, Ian W. .
ENERGY ECONOMICS, 2021, 93
[5]   An empirical study on the regulated Chinese agricultural commodity futures market based on skew Ornstein-Uhlenbeck model [J].
Bai, Yizhou ;
Xue, Cheng .
RESEARCH IN INTERNATIONAL BUSINESS AND FINANCE, 2021, 57
[6]   Volatility forecasting in commodity markets using macro uncertainty [J].
Bakas, Dimitrios ;
Triantafyllou, Athanasios .
ENERGY ECONOMICS, 2019, 81 :79-94
[7]   CROWD-SQUARED: AMPLIFYING THE PREDICTIVE POWER OF SEARCH TREND DATA [J].
Brynjolfsson, Erik ;
Geva, Tomer ;
Reichman, Shachar .
MIS QUARTERLY, 2016, 40 (04) :941-+
[8]   Every tweet counts? How sentiment analysis of social media can improve our knowledge of citizens' political preferences with an application to Italy and France [J].
Ceron, Andrea ;
Curini, Luigi ;
Iacus, Stefano M. ;
Porro, Giuseppe .
NEW MEDIA & SOCIETY, 2014, 16 (02) :340-358
[9]   A multi-source heterogeneous data analytic method for future price fluctuation prediction [J].
Chai, Lei ;
Xu, Hongfeng ;
Luo, Zhiming ;
Li, Shaozi .
NEUROCOMPUTING, 2020, 418 :11-20
[10]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794