共 36 条
- [2] [Anonymous], 2014, P INT C INT C MACH L
- [3] Bertsekas Dimitri P, 2011, Dynamic programming and optimal control, VII
- [4] Busoniu Lucian, 2017, Reinforcement Learning and Dynamic Programming Using Function Approximators
- [6] Reinforcement Learning-Based Multiaccess Control and Battery Prediction With Energy Harvesting in IoT Systems [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (02): : 2009 - 2020