共 21 条
[11]
Barto A.(1999)Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning Artificial Intelligence 112 181-211
[12]
Dietterich T.(undefined)undefined undefined undefined undefined-undefined
[13]
Klein C.(undefined)undefined undefined undefined undefined-undefined
[14]
Kim J.(undefined)undefined undefined undefined undefined-undefined
[15]
Lee J.(undefined)undefined undefined undefined undefined-undefined
[16]
Mataric M.(undefined)undefined undefined undefined undefined-undefined
[17]
Pynadath D.(undefined)undefined undefined undefined undefined-undefined
[18]
Tambe M.(undefined)undefined undefined undefined undefined-undefined
[19]
Sutton R.(undefined)undefined undefined undefined undefined-undefined
[20]
Precup D.(undefined)undefined undefined undefined undefined-undefined