共 40 条
[2]
Akrour R, 2014, PR MACH LEARN RES, V32, P1503
[3]
Akrour R, 2011, LECT NOTES ARTIF INT, V6911, P12, DOI 10.1007/978-3-642-23780-5_11
[4]
[Anonymous], 2013, Policy shaping: Integrating human feedback with reinforcement learning
[5]
Arumugam D., 2019, ABS190204257 CORR
[6]
Biyik E., 2020, ROBOT SCI SYST
[7]
Biyik E, 2019, PR MACH LEARN RES, V100
[8]
Blumberg B, 2002, ACM T GRAPHIC, V21, P417, DOI 10.1145/566570.566597
[10]
Cederborg T, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P3366