共 5 条
- [1] Bellman R., 1957, BELLMANDYNAMIC PROGR
- [2] A comprehensive survey of multiagent reinforcement learning [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02): : 156 - 172
- [3] Bandit based Monte-Carlo planning [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293
- [4] Towards automated incident handling: How to select an appropriate response against a network-based attack? [J]. 2015 NINTH INTERNATIONAL CONFERENCE ON IT SECURITY INCIDENT MANAGEMENT & IT FORENSICS (IMF), 2015, : 51 - 67
- [5] Shameli-Sendi A, 2015, J NETWORK COMPUTER A