共 55 条
- [1] Ramachandram D(2017)Deep multimodal learning: a survey on recent advances and trends IEEE Signal Process. Mag. 34 96-108
- [2] Taylor GW(2015)Concurrent Markov decision processes for robot team learning Eng. Appl. Artif. Intell. 39 223-234
- [3] Girard J(2013)A survey of point-based POMDP solvers Auton. Agent. Multi-Agent Syst. 27 1-51
- [4] Emami MR(2014)Optimizing spatial and temporal reuse in wireless networks by decentralized partially observable Markov decision processes IEEE Trans. Mob. Comput. 13 866-879
- [5] Shani G(2016)Multi-agent reinforcement learning as a rehearsal for decentralized planning Neurocomputing 190 82-94
- [6] Pineau J(2015)Reinforcement learning of informed initial policies for decentralized planning ACM Trans. Auton. Adapt. Syst. 9 1-32
- [7] Kaplow R(2012)Bayesian-game-based fuzzy reinforcement learning control for decentralized POMDPs IEEE Trans. Comput. Intell. AI Games 4 309-328
- [8] Pajarinen J(2014)Scheduling sensors for monitoring sentient spaces using an approximate POMDP policy Pervasive Mobile Comput. 10 83-103
- [9] Hottinen A(2017)Can bounded and self-interested agents be teammates? Application to planning in ad hoc teams Auton. Agent. Multi-Agent Syst. 31 821-860
- [10] Peltonen J(2005)Cooperative information sharing to improve distributed learning in multi-agent systems J. Artif. Intell. Res. 24 407-463