A model fusion method based on multi-source heterogeneous data for stock trading signal prediction

被引:2
|
作者
Chen, Xi [1 ,2 ]
Hirota, Kaoru [1 ]
Dai, Yaping [1 ]
Jia, Zhiyang [1 ]
机构
[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China
[2] Fujian Normal Univ, Coll Phys & Energy, Fuzhou 350117, Peoples R China
关键词
Stock trading signal prediction; Model fusion; Multi-source heterogeneous data; Sentiment analysis; PIECEWISE-LINEAR REPRESENTATION; SUPPORT VECTOR MACHINE; DIRECTION;
D O I
10.1007/s00500-022-07714-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the prediction of turning points (TPs) of time series, the improved model of integrating piecewise linear representation and weighted support vector machine (IPLR-WSVM) has achieved good performance. However, due to the single data source and the limitation of algorithm, IPLR-WSVM has encountered challenges in profitability. In this paper, a model fusion method based on multi-source heterogeneous data and different learning algorithms is proposed for the prediction of TPs (MF-MSHD). Multi-source heterogeneous data include weighted unstructured and structured information with different granularities. RF, WSVM, BPNN, GBDT, and LSTM are selected to be the learning algorithms. The differences among meta-models are constructed by different inputs and algorithms as much as possible, and a model fusion rule is designed to determine the final TPs. Moreover, the TPs are generated based on the characteristics of individual stock. For sentiment analysis, a more accurate sentiment dictionary of stock market comments is established. Specifically, the fine-grained data is introduced to jointly determine the accurate trading moment. The prediction level of the proposal improves the accuracy and profitability, and also outperforms the composite indexes. Experimental results show that the profit rate of randomly selected stocks in MF-MSHD reaches 0.5172, while the highest value is 0.2841 in single meta-model and 0.0992 in buy and hold strategy, respectively. The other indicators including the accuracy are also modified. Compared with the increases of 0.1648, 0.4051, and 0.3397 in Shanghai Composite Index, Shenzhen Composite Index, and CSI 300 Index, MF-MSHD shows higher profitability in stock trading signal prediction.
引用
收藏
页码:6587 / 6611
页数:25
相关论文
共 50 条
  • [41] Ecological restoration for mega-infrastructure projects: a study based on multi-source heterogeneous data
    Song, Ruizhen
    Gao, Xin
    Nan, Haonan
    Zeng, Saixing
    Tam, Vivian W. Y.
    ENGINEERING CONSTRUCTION AND ARCHITECTURAL MANAGEMENT, 2024, 31 (09) : 3653 - 3678
  • [42] Ontology construction and mapping of multi-source heterogeneous data based on hybrid neural network and autoencoder
    Zhao, Wenbin
    Fu, Zijian
    Fan, Tongrang
    Wang, Jiaqi
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (36): : 25131 - 25141
  • [43] Ontology construction and mapping of multi-source heterogeneous data based on hybrid neural network and autoencoder
    Wenbin Zhao
    Zijian Fu
    Tongrang Fan
    Jiaqi Wang
    Neural Computing and Applications, 2023, 35 : 25131 - 25141
  • [44] Multi-source data fusion for aspect-level sentiment classification
    Chen, Fang
    Yuan, Zhigang
    Huang, Yongfeng
    KNOWLEDGE-BASED SYSTEMS, 2020, 187
  • [45] Multidimensional Assessment Method for the Risk States of Distribution Networks Under Multi-source Heterogeneous Data Distributions
    Chen C.
    Huang J.
    Sun C.
    Cao Y.
    Li Y.
    Gaodianya Jishu/High Voltage Engineering, 2023, 49 (06): : 2297 - 2307
  • [46] Construction and Application of a College English Blended Teaching System Based on Multi-Source Data Fusion
    Xi, Hongyan
    Sang, Dongyan
    INTERNATIONAL JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY EDUCATION, 2024, 20 (01)
  • [47] Two-stage adaptive integration of multi-source heterogeneous data based on an improved random subspace and prediction of default risk of microcredit
    Huang, Anzhong
    Wu, Fei
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (09): : 4065 - 4075
  • [48] Two-stage adaptive integration of multi-source heterogeneous data based on an improved random subspace and prediction of default risk of microcredit
    Anzhong Huang
    Fei Wu
    Neural Computing and Applications, 2021, 33 : 4065 - 4075
  • [49] Chinese stock trend prediction based on multi-feature learning and model fusion
    Lai, Shanyan
    Ye, Chunyang
    Zhou, Hongyu Jiang Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON SMART DATA SERVICES (SMDS 2021), 2021, : 18 - 23
  • [50] Biofuser: a multi-source data fusion platform for fusing the data of fermentation process devices
    Zhang, Dequan
    Jiang, Wei
    Lou, Jincheng
    Han, Xuanzhou
    Xia, Jianye
    FRONTIERS IN DIGITAL HEALTH, 2024, 6