共 31 条
[1]
Alibekov E(2018)Policy derivation methods for critic-only reinforcement learning in continuous spaces Eng Appl Artif Intell 69 178-187
[2]
Kubalík J(2019)Machine learning for 5g/b5g mobile and wireless communications: potential, limitations, and future directions IEEE Access 7 137184-137206
[3]
Babuska R(2018)Interpretable policies for reinforcement learning by genetic programming Eng Appl Artif Intell 76 158-169
[4]
Cayamcela MEM(2015)Human-level control through deep reinforcement learning Nature 518 529-66
[5]
Lee H(2017)Genetic programming for production scheduling: a survey with a unified framework Complex Intell Syst 3 41-1201
[6]
Lim W(2016)Value function discovery in markov decision processes with evolutionary algorithms IEEE Trans Syst Man Cybern Syst 46 1190-2830
[7]
Hein D(2011)Scikit-learn: machine learning in python J Mach Learn Res 12 2825-215
[8]
Udluft S(2019)Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead Nat Mach Intell 1 206-18
[9]
Runkler TA(2017)Mastering the game of go without human knowledge Nature 550 354-349
[10]
Mnih V(2018)An intelligent noninvasive model for coronary artery disease detection Complex Intell Syst 4 11-undefined