A novel off policy Q(λ) algorithm based on linear function approximation

被引：0

作者：

Fu, Qi-Ming ^{[1
]}

Liu, Quan ^{[1
,2
]}

Wang, Hui ^{[1
]}

Xiao, Fei ^{[1
]}

Yu, Jun ^{[1
]}

Li, Jiao ^{[1
]}

机构：

[1] Institute of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China

[2] Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China

来源：

Jisuanji Xuebao/Chinese Journal of Computers | 2014年 / 37卷 / 03期

关键词：

Compendex;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement learning

引用

页码：677 / 686