共 43 条
[1]
Dynamic Agent-based Bi-objective Robustness for Tardiness and Energy in a Dynamic Flexible Job Shop
[J].
FACTORIES OF THE FUTURE IN THE DIGITAL ENVIRONMENT,
2016, 57
:728-733
[2]
[Anonymous], 2018, P INT C LEARN REPR I
[3]
[Anonymous], 2020, BEST KNOWN LOWER UPP
[4]
[Anonymous], 2018, Reinforcement learning for solving the vehicle routing problem
[5]
Bello I., 2017, WORKSH TRACK ICLR
[8]
Dai HJ, 2018, Arxiv, DOI arXiv:1704.01665
[9]
Learning Heuristics for the TSP by Policy Gradient
[J].
INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, CPAIOR 2018,
2018, 10848
:170-181