共 89 条
[1]
[Anonymous], 2002, Optimal Learning: Computational procedures for Bayes-adaptive Markov decision processes
[2]
[Anonymous], 2014, arXiv
[3]
[Anonymous], IEEE SPECTR
[4]
[Anonymous], 1998, REINFORCEMENT LEARNI
[7]
Baum E. B., 1987, PAPER PRESENTED NIPS
[8]
LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
[J].
IEEE TRANSACTIONS ON NEURAL NETWORKS,
1994, 5 (02)
:157-166
[9]
Bengio Y., 2006, ADV NEURAL INFORM PR, V19
[10]
BENGIO Y, 2006, ADV NEURAL INFORM PR, V18, P107