共 49 条
[11]
Hessel M., 2018, PROC AAAI C ARTIF, P1
[13]
Huang S., 2022, The 37 Implementation Details of Proximal Policy Optimization
[14]
Lan C.L., 2022, arXiv
[15]
Lillicrap T. P., 2019, Continuous control with deep reinforcement learning
[16]
Towards Unsupervised Deep Graph Structure Learning
[J].
PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22),
2022,
:1392-1403
[19]
Madjiheurem S., 2019, PROC 22 INT C A, P3391
[20]
Learning Scheduling Algorithms for Data Processing Clusters
[J].
SIGCOMM '19 - PROCEEDINGS OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION,
2019,
:270-288