Learning Optimal Q-Function Using Deep Boltzmann Machine for Reliable Trading of Cryptocurrency

被引:10
作者
Bu, Seok-Jun [1 ]
Cho, Sung-Bae [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul, South Korea
来源
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I | 2018年 / 11314卷
关键词
Deep reinforcement learning; Q-network; Deep Boltzmann Machine; Portfolio management; NETWORKS; STOCK;
D O I
10.1007/978-3-030-03493-1_49
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The explosive price volatility from the end of 2017 to January 2018 shows that bitcoin is a high risk asset. The deep reinforcement algorithm is straightforward idea for directly outputs the market management actions to achieve higher profit instead of higher price-prediction accuracy. However, existing deep reinforcement learning algorithms including Q-learning are also limited to problems caused by enormous searching space. We propose a combination of double Q-network and unsupervised pre-training using Deep Boltzmann Machine (DBM) to generate and enhance the optimal Q-function in cryptocurrency trading. We obtained the profit of 2,686% in simulation, whereas the best conventional model had that of 2,087% for the same period of test. In addition, our model records 24% of profit while market price significantly drops by -64%.
引用
收藏
页码:468 / 480
页数:13
相关论文
共 25 条
[1]  
Agarwal A., 2006, P 23 INT C MACHINE L, P9, DOI DOI 10.1145/1143844.1143846
[2]  
Amjad M., 2017, NIPS 2016 TIME SERIE, P1
[3]  
Bell T, 2016, BITCOIN TRADING AGEN
[4]   A Hybrid System of Deep Learning and Learning Classifier System for Database Intrusion Detection [J].
Bu, Seok-Jun ;
Cho, Sung-Bae .
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, HAIS 2017, 2017, 10334 :615-625
[5]  
Cover T.M., 2011, The Kelly Capital Growth Investment Criterion: Theory and Practice, P181
[6]  
Das P., 2011, P INT C KNOWL DISC D, P1163, DOI DOI 10.1145/2020408.2020588
[7]  
Donahue J, 2015, PROC CVPR IEEE, P2625, DOI 10.1109/CVPR.2015.7298878
[8]   A fast learning algorithm for deep belief nets [J].
Hinton, Geoffrey E. ;
Osindero, Simon ;
Teh, Yee-Whye .
NEURAL COMPUTATION, 2006, 18 (07) :1527-1554
[9]   Forecasting stock market movement direction with support vector machine [J].
Huang, W ;
Nakamori, Y ;
Wang, SY .
COMPUTERS & OPERATIONS RESEARCH, 2005, 32 (10) :2513-2522
[10]  
Jiang ZY, 2017, PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), P905, DOI 10.1109/IntelliSys.2017.8324237