共 72 条
- [1] Kurinov I(2020)Automated excavator based on reinforcement learning and multibody system dynamics IEEE Access 8 213998-214006
- [2] Orzechowski G(2021)Ant-td: ant colony optimization plus temporal difference reinforcement learning for multi-label feature selection Swarm Evol. Comput. 64 100892-359
- [3] Hämäläinen P(2017)Mastering the game of Go without human knowledge Nature 550 354-489
- [4] Mikkola A(2016)Mastering the game of Go with deep neural networks and tree Nature 529 484-295
- [5] Paniri M(2018)Hierarchical reinforcement learning with monte carlo tree search in computer fighting game IEEE Trans. Games 11 290-2562
- [6] Dowlatshahi MB(2023)Monte carlo tree search: a review of recent modifications and applications Artif. Intell. Rev. 56 2497-1545
- [7] Nezamabadi-pour H(2022)Optimal state space reconstruction via monte carlo decision tree search Nonlinear Dyn. 108 1525-702
- [8] Silver D(2022)Data-driven uncertainty quantification in computational human head models Comput. Meth. Appl. Mech. Eng. 398 115108-62
- [9] Silver D(2017)Combinatorial multi-armed bandits for real-time strategy games J. Artif. Intell. Res. 58 665-223
- [10] Pinto IP(2016)Adaptive playouts for online learning of policies during monte carlo tree search Theoret. Comput. Sci. 644 53-454