共 12 条
[11]
Torabi F., Warnell G., Stone P., Behavioral Cloning Fr Om Observation
[12]
Jeon W., Su C.-Y., Barde P., Doan T., Nowrouzezahrai D., Pineau J., Regularized Inverse Reinforcement Learnin G