共 31 条
[1]
[Anonymous], 2014, Markov decision processes: discrete stochastic dynamic programming
[2]
[Anonymous], 1996, Neuro-dynamic programming
[3]
[Anonymous], 2007, Advances in neural information processing systems
[4]
[Anonymous], 2015, Reinforcement Learning: An Introduction
[5]
[Anonymous], 2009, MARKOV CHAINS STOCHA
[6]
[Anonymous], 2002, Internat. Ser. Oper. Res. Management Sci.
[7]
Spectral Decomposition of Demand-Side Flexibility for Reliable Ancillary Services in a Smart Grid
[J].
2015 48TH HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS),
2015,
:2700-2709
[8]
BERTSEKAS D. P., 1996, Stochastic optimal control: the discrete-time case
[9]
Boyd L., 2004, CONVEX OPTIMIZATION
[10]
Optimal Control of Observable Continuous Time Markov Chains
[J].
47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008),
2008,
:4269-4274