共 28 条
[1]
Baudet G. M.(1978)Asynchronous iterative methods for multiprocessors Journal of the Association for Computing Machinery 25 226-244
[2]
Bertsekas D. P.(1982)Distributed dynamic programming IEEE Transactions on Automatic Control 27 610-616
[3]
Bertsekas D. P.(1983)Asynchronous distributed computation of fixed points Mathematical Programming 27 107-120
[4]
Bertsekas D. P.(1991)An analysis of stochastic shortest path problems Mathematics of Operations Research 16 580-595
[5]
Tsitsiklis J. N.(2009)Projected equation methods for approximate solution of large linear systems Journal of Computational and Applied Mathematics 227 27-50
[6]
Bertsekas D. P.(2012)Q-learning and enhanced policy iteration in discounted dynamic programming Mathematics of Operations Research 37 66-94
[7]
Yu H.(2012)(Approximate) iterated successive approximations algorithm for sequential decision processes Annals of Operations Research 16 207-239
[8]
Bertsekas D. P.(2006)A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning Discrete Event Dynamic Systems 84 23-29
[9]
Yu H.(1962)Optimal pursuit strategies in discrete state probabilistic systems Transactions of the ASME. Series D. Journal of Basic Engineering 17 392-397
[10]
Canbolat P. G.(1992)Stationary strategies in Borel dynamic programming Mathematics of Operations Research 6 1185-1201