共 16 条
[1]
Park S.G., Kim D.H., Autonomous flying of drone based on PPO reinforcement learning algorithm, Journal of Institute of Control, Robotics and Systems, 26, 11, pp. 955-963, (2020)
[2]
Park K.-W., Kim J.-H., Aircraft collision avoidance modeling and optimization using deep reinforcement learning, Journal of Institute of Control, Robotics and Systems (In Korean), 27, 9, pp. 652-659, (2021)
[3]
Mnih V., Kavukcuoglu K., Silver D., Graves A., Antonoglou I.D.M., Playing Atari with Deep Reinforcement Learning, Arxiv Preprint Arxiv, 1312, (2013)
[4]
Schulman J., Wolski F., Dhariwal P.A.O., Proximal Policy Optimization Algorithms, Arxiv Preprint Arxiv, 1707, (2017)
[5]
Haarnoja T., Zhou A., Hartikainen K., Tucker G., Ha S., Tan J., Kumar V., Zhu H., Gupta A.P.S., Soft Actor-Critic Algorithms and Applications, Arxiv Preprint Arxiv, 1812, (2018)
[6]
Demaine E., Hohenberger S., Liben-Nowell D., Tetris is hard, even to approximate, International Computing and Com-Binatorics Conference, Springer, pp. 351-363, (2003)
[7]
Thiery C., Scherrer B., Improvements on learning Tetris with cross entropy, Icga Journal, 32, 1, pp. 23-33, (2009)
[8]
Szita I., Lorincz A., Learning Tetris using the noisy cross entropy method, Neural Computation, 18, 12, pp. 2936-2941, (2006)
[9]
Gabillon V., Ghavamzadeh M., Scherrer B., Approximate dynamic programming finally performs well in the game of Tetris, Advances in Neural Information Processing Systems, 26, pp. 1754-1762, (2013)
[10]
, “The Game of Tetris in Machine learning,” Arxiv Preprint Arxiv, 1905, (2019)