共 58 条
[51]
Thrun S. B., 1992, Efficient Exploration in Reinforcement Learning
[52]
Use of Proximal Policy Optimization for the Joint Replenishment Problem
[J].
Vanvuchelen, Nathalie
;
Gijsbrechts, Joren
;
Boute, Robert
.
COMPUTERS IN INDUSTRY,
2020, 119

Vanvuchelen, Nathalie
论文数: 0 引用数: 0
h-index: 0
机构:
Katholieke Univ Leuven, Res Ctr Operat Management, Fac Business & Econ, Naamsestr 69,Box 3500, B-3000 Leuven, Belgium Katholieke Univ Leuven, Res Ctr Operat Management, Fac Business & Econ, Naamsestr 69,Box 3500, B-3000 Leuven, Belgium

Gijsbrechts, Joren
论文数: 0 引用数: 0
h-index: 0
机构:
Katholieke Univ Leuven, Res Ctr Operat Management, Fac Business & Econ, Naamsestr 69,Box 3500, B-3000 Leuven, Belgium Katholieke Univ Leuven, Res Ctr Operat Management, Fac Business & Econ, Naamsestr 69,Box 3500, B-3000 Leuven, Belgium

Boute, Robert
论文数: 0 引用数: 0
h-index: 0
机构:
Katholieke Univ Leuven, Res Ctr Operat Management, Fac Business & Econ, Naamsestr 69,Box 3500, B-3000 Leuven, Belgium
Vlerick Business Sch, Technol & Operat Management Area, Ghent, Belgium Katholieke Univ Leuven, Res Ctr Operat Management, Fac Business & Econ, Naamsestr 69,Box 3500, B-3000 Leuven, Belgium
[53]
Grandmaster level in StarCraft II using multi-agent reinforcement learning
[J].
Vinyals, Oriol
;
Babuschkin, Igor
;
Czarnecki, Wojciech M.
;
Mathieu, Michael
;
Dudzik, Andrew
;
Chung, Junyoung
;
Choi, David H.
;
Powell, Richard
;
Ewalds, Timo
;
Georgiev, Petko
;
Oh, Junhyuk
;
Horgan, Dan
;
Kroiss, Manuel
;
Danihelka, Ivo
;
Huang, Aja
;
Sifre, Laurent
;
Cai, Trevor
;
Agapiou, John P.
;
Jaderberg, Max
;
Vezhnevets, Alexander S.
;
Leblond, Remi
;
Pohlen, Tobias
;
Dalibard, Valentin
;
Budden, David
;
Sulsky, Yury
;
Molloy, James
;
Paine, Tom L.
;
Gulcehre, Caglar
;
Wang, Ziyu
;
Pfaff, Tobias
;
Wu, Yuhuai
;
Ring, Roman
;
Yogatama, Dani
;
Wunsch, Dario
;
McKinney, Katrina
;
Smith, Oliver
;
Schaul, Tom
;
Lillicrap, Timothy
;
Kavukcuoglu, Koray
;
Hassabis, Demis
;
Apps, Chris
;
Silver, David
.
NATURE,
2019, 575 (7782)
:350-+

Vinyals, Oriol
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Babuschkin, Igor
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Czarnecki, Wojciech M.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Mathieu, Michael
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Dudzik, Andrew
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Chung, Junyoung
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Choi, David H.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Powell, Richard
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Ewalds, Timo
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Georgiev, Petko
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Oh, Junhyuk
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Horgan, Dan
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Kroiss, Manuel
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Danihelka, Ivo
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Huang, Aja
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Sifre, Laurent
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Cai, Trevor
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Agapiou, John P.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Jaderberg, Max
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Vezhnevets, Alexander S.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Leblond, Remi
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Pohlen, Tobias
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Dalibard, Valentin
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Budden, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Sulsky, Yury
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Molloy, James
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Paine, Tom L.
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Gulcehre, Caglar
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Wang, Ziyu
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Pfaff, Tobias
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Wu, Yuhuai
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Ring, Roman
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Yogatama, Dani
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Wunsch, Dario
论文数: 0 引用数: 0
h-index: 0
机构:
Team Liquid, Utrecht, Netherlands DeepMind, London, England

McKinney, Katrina
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Smith, Oliver
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Schaul, Tom
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Lillicrap, Timothy
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Kavukcuoglu, Koray
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Hassabis, Demis
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Apps, Chris
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England

Silver, David
论文数: 0 引用数: 0
h-index: 0
机构:
DeepMind, London, England DeepMind, London, England
[54]
WILLIAMS RJ, 1992, MACH LEARN, V8, P229, DOI 10.1007/BF00992696
[55]
From Swarm Intelligence to Metaheuristics: Nature-Inspired Optimization Algorithms
[J].
Yang, Xin-She
;
Deb, Suash
;
Fong, Simon
;
He, Xingshi
;
Zhao, Yu-Xin
.
COMPUTER,
2016, 49 (09)
:52-59

Yang, Xin-She
论文数: 0 引用数: 0
h-index: 0
机构:
Middlesex Univ, Modeling & Optimizat, London N17 8HR, England Middlesex Univ, Modeling & Optimizat, London N17 8HR, England

Deb, Suash
论文数: 0 引用数: 0
h-index: 0
机构:
Stanford Univ, Comp Sci, Stanford, CA 94305 USA
Int Neural Network Soc, Stanford, CA USA Middlesex Univ, Modeling & Optimizat, London N17 8HR, England

论文数: 引用数:
h-index:
机构:

He, Xingshi
论文数: 0 引用数: 0
h-index: 0
机构:
Xian Polytech Univ, Coll Sci, Xian, Peoples R China Middlesex Univ, Modeling & Optimizat, London N17 8HR, England

Zhao, Yu-Xin
论文数: 0 引用数: 0
h-index: 0
机构:
Harbin Engn Univ, Control & Nav, Harbin, Peoples R China Middlesex Univ, Modeling & Optimizat, London N17 8HR, England
[56]
A Review of Deep Reinforcement Learning for Smart Building Energy Management
[J].
Yu, Liang
;
Qin, Shuqi
;
Zhang, Meng
;
Shen, Chao
;
Jiang, Tao
;
Guan, Xiaohong
.
IEEE INTERNET OF THINGS JOURNAL,
2021, 8 (15)
:12046-12063

Yu, Liang
论文数: 0 引用数: 0
h-index: 0
机构:
Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China
Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Nanjing 210003, Peoples R China
Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Xian 710049, Peoples R China Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China

Qin, Shuqi
论文数: 0 引用数: 0
h-index: 0
机构:
Nanjing Univ Posts & Telecommun, Coll Internet Things, Nanjing 210003, Peoples R China Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China

Zhang, Meng
论文数: 0 引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Syst Engn Inst, Key Lab Intelligent Networks & Network Secur, Minist Educ, Xian 710049, Peoples R China Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China

Shen, Chao
论文数: 0 引用数: 0
h-index: 0
机构:
Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China
Xi An Jiao Tong Univ, Sch Cyber Sci & Engn, Xian 710049, Peoples R China Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China

Jiang, Tao
论文数: 0 引用数: 0
h-index: 0
机构:
Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China
Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan Natl Lab Optoelect, Wuhan 430074, Peoples R China Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China

Guan, Xiaohong
论文数: 0 引用数: 0
h-index: 0
机构:
Xi An Jiao Tong Univ, Syst Engn Inst, Key Lab Intelligent Networks & Network Secur, Minist Educ, Xian 710049, Peoples R China Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China
[57]
Neural Network-based control using Actor-Critic Reinforcement Learning and Grey Wolf Optimizer with experimental servo system validation
[J].
Zamfirache, Iuliu Alexandru
;
Precup, Radu-Emil
;
Roman, Raul-Cristian
;
Petriu, Emil M.
.
EXPERT SYSTEMS WITH APPLICATIONS,
2023, 225

Zamfirache, Iuliu Alexandru
论文数: 0 引用数: 0
h-index: 0
机构:
Politehn Univ Timisoara, Dept Automation & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania Politehn Univ Timisoara, Dept Automation & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania

Precup, Radu-Emil
论文数: 0 引用数: 0
h-index: 0
机构:
Politehn Univ Timisoara, Dept Automation & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania
Romanian Acad, Ctr Fundamental & Adv Tech Res, Timisoara Branch, Bd Mihai Viteazu 24, Timisoara 300223, Romania Politehn Univ Timisoara, Dept Automation & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania

Roman, Raul-Cristian
论文数: 0 引用数: 0
h-index: 0
机构:
Politehn Univ Timisoara, Dept Automation & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania Politehn Univ Timisoara, Dept Automation & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania

Petriu, Emil M.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward, Ottawa, ON K1N 6N5, Canada Politehn Univ Timisoara, Dept Automation & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania
[58]
Policy Iteration Reinforcement Learning-based control using a Grey Wolf Optimizer algorithm
[J].
Zamfirache, Iuliu Alexandru
;
Precup, Radu-Emil
;
Roman, Raul-Cristian
;
Petriu, Emil M.
.
INFORMATION SCIENCES,
2022, 585
:162-175

Zamfirache, Iuliu Alexandru
论文数: 0 引用数: 0
h-index: 0
机构:
Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania

Precup, Radu-Emil
论文数: 0 引用数: 0
h-index: 0
机构:
Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania

Roman, Raul-Cristian
论文数: 0 引用数: 0
h-index: 0
机构:
Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania

Petriu, Emil M.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Ottawa, Sch Elect Engn & Comp Sci, 800 King Edward, Ottawa, ON K1N 6N5, Canada Politehn Univ Timisoara, Dept Automat & Appl Informat, Bd V Parvan 2, Timisoara 300223, Romania