共 37 条
- [2] Beyer L., 2019, ARXIV
- [3] Hierarchical learning from human preferences and curiosity [J]. APPLIED INTELLIGENCE, 2022, 52 (07) : 7459 - 7479
- [5] Colas C, 2018, PR MACH LEARN RES, V80
- [6] De Ath G., 2021, ACM Transact. Evolut. Learn. Optimiz, V1, P1
- [7] gymlibrary, Gym documentation
- [8] MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS [J]. NEURAL NETWORKS, 1989, 2 (05) : 359 - 366