A novel off policy Q(λ) algorithm based on linear function approximation

被引:0
作者
Fu, Qi-Ming [1 ]
Liu, Quan [1 ,2 ]
Wang, Hui [1 ]
Xiao, Fei [1 ]
Yu, Jun [1 ]
Li, Jiao [1 ]
机构
[1] Institute of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China
[2] Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2014年 / 37卷 / 03期
关键词
Compendex;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement learning
引用
收藏
页码:677 / 686
相关论文
empty
未找到相关数据