A Labeling Method for Financial Time Series Prediction Based on Trends

被引:48
作者
Wu, Dingming [1 ]
Wang, Xiaolong [1 ]
Su, Jingyong [1 ]
Tang, Buzhou [1 ]
Wu, Shaocong [1 ]
机构
[1] Harbin Inst Technol, Coll Comp Sci & Technol, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
financial time series; stock prediction; machine learning; labeling method; deep learning; SUPPORT VECTOR MACHINES; TRAFFIC FLOW PREDICTION; NEURAL-NETWORKS; STOCK; MARKET; CLASSIFICATION; MODEL; REGRESSION; ALGORITHM; LSTM;
D O I
10.3390/e22101162
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Time series prediction has been widely applied to the finance industry in applications such as stock market price and commodity price forecasting. Machine learning methods have been widely used in financial time series prediction in recent years. How to label financial time series data to determine the prediction accuracy of machine learning models and subsequently determine final investment returns is a hot topic. Existing labeling methods of financial time series mainly label data by comparing the current data with those of a short time period in the future. However, financial time series data are typically non-linear with obvious short-term randomness. Therefore, these labeling methods have not captured the continuous trend features of financial time series data, leading to a difference between their labeling results and real market trends. In this paper, a new labeling method called "continuous trend labeling" is proposed to address the above problem. In the feature preprocessing stage, this paper proposed a new method that can avoid the problem of look-ahead bias in traditional data standardization or normalization processes. Then, a detailed logical explanation was given, the definition of continuous trend labeling was proposed and also an automatic labeling algorithm was given to extract the continuous trend features of financial time series data. Experiments on the Shanghai Composite Index and Shenzhen Component Index and some stocks of China showed that our labeling method is a much better state-of-the-art labeling method in terms of classification accuracy and some other classification evaluation metrics. The results of the paper also proved that deep learning models such as LSTM and GRU are more suitable for dealing with the prediction of financial time series data.
引用
收藏
页码:1 / 25
页数:27
相关论文
共 96 条
[1]   The financial market effects of international aviation disasters [J].
Akyildirim, Erdinc ;
Corbet, Shaen ;
Efthymiou, Marina ;
Guiomard, Cathal ;
O'Connell, John F. ;
Sensoy, Ahmet .
INTERNATIONAL REVIEW OF FINANCIAL ANALYSIS, 2020, 69
[2]  
[Anonymous], 2014, C EMPIRICAL METHODS, DOI 10.3115/v1/d14-1179.
[3]   Evolutionary fuzzification of RIPPER for regression: Case study of stock prediction [J].
Asadi, Shahrokh .
NEUROCOMPUTING, 2019, 331 :121-137
[4]   Development of stock market trend prediction system using multiple regression [J].
Asghar, Muhammad Zubair ;
Rahman, Fazal ;
Kundi, Fazal Masud ;
Ahmad, Shakeel .
COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2019, 25 (03) :271-301
[5]   ENTROPY CORRELATION DISTANCE METHOD APPLIED TO STUDY CORRELATIONS BETWEEN THE GROSS DOMESTIC PRODUCT OF RICH COUNTRIES [J].
Ausloos, Marcel ;
Miskiewicz, Janusz .
INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2010, 20 (02) :381-389
[6]   Cytochrome P450 oxidoreductase deficiency caused by R457H mutation in POR gene in Chinese: case report and literature review [J].
Bai, Yang ;
Li, Jinhui ;
Wang, Xiaoli .
JOURNAL OF OVARIAN RESEARCH, 2017, 10
[7]   Risk assessment by failure mode and effects analysis (FMEA) using an interval number based logistic regression model [J].
Bhattacharjee, Pushparenu ;
Dey, Vidyut ;
Mandal, U. K. .
SAFETY SCIENCE, 2020, 132
[8]   A random forest guided tour [J].
Biau, Gerard ;
Scornet, Erwan .
TEST, 2016, 25 (02) :197-227
[9]   The Dow Theory: William Peter Hamilton's track record reconsidered [J].
Brown, SJ ;
Goetzmann, WN ;
Kumar, A .
JOURNAL OF FINANCE, 1998, 53 (04) :1311-1333
[10]  
Caaron A., 2018, ENTROPY, V20, P323, DOI [10.3390/e20050323, DOI 10.3390/E20050323]