Supplementing recurrent neural networks with annealing to solve combinatorial optimization problems

被引:6
|
作者
Khandoker, Shoummo Ahsan [1 ]
Abedin, Jawaril Munshad [1 ]
Hibat-Allah, Mohamed [2 ,3 ]
机构
[1] BRAC Univ, Dept Comp Sci, Dhaka, Bangladesh
[2] Univ Waterloo, Vector Inst Artificial Intelligence, Dept Phys & Astron, Waterloo, ON, Canada
[3] MaRS Ctr, Vector Inst, Toronto, ON M5G 1M1, Canada
来源
关键词
optimization problems; annealing; recurrent neural networks; RNNs; machine learning; statistical physics;
D O I
10.1088/2632-2153/acb895
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combinatorial optimization problems can be solved by heuristic algorithms such as simulated annealing (SA) which aims to find the optimal solution within a large search space through thermal fluctuations. This algorithm generates new solutions through Markov-chain Monte Carlo techniques which can result in severe limitations, such as slow convergence and a tendency to stay within the same local search space at small temperatures. To overcome these shortcomings, we use the variational classical annealing (VCA) framework that combines autoregressive recurrent neural networks (RNNs) with traditional annealing to sample solutions that are uncorrelated. In this paper, we demonstrate the potential of using VCA as an approach to solving real-world optimization problems. We explore VCA's performance in comparison with SA at solving three popular optimization problems: the maximum cut problem (Max-Cut), the nurse scheduling problem (NSP), and the traveling salesman problem (TSP). For all three problems, we find that VCA outperforms SA on average in the asymptotic limit by one or more orders of magnitude in terms of relative error. Interestingly, we reach large system sizes of up to 256 cities for the TSP. We also conclude that in the best case scenario, VCA can serve as a great alternative when SA fails to find the optimal solution.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Matrix Encoding Networks for Neural Combinatorial Optimization
    Kwon, Yeong-Dae
    Choo, Jinho
    Yoon, Iljoo
    Park, Minah
    Park, Duwon
    Gwon, Youngjune
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [42] VLSI Implementation of an Annealing Accelerator for Solving Combinatorial Optimization Problems
    Chen, Yuan-Ho
    Hua, Hsin-Tung
    Nien, Chin-Fu
    Lin, Shinn-Yn
    IEEE NANOTECHNOLOGY MAGAZINE, 2024, 18 (03) : 23 - 30
  • [43] Performance of quantum annealing inspired algorithms for combinatorial optimization problems
    Zeng, Qing-Guo
    Cui, Xiao-Peng
    Liu, Bowen
    Wang, Yao
    Mosharev, Pavel
    Yung, Man-Hong
    COMMUNICATIONS PHYSICS, 2024, 7 (01):
  • [44] Augmented Lagrange chaotic simulated annealing for combinatorial optimization problems
    Tian, FY
    Wang, LP
    IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL VI, 2000, : 475 - 479
  • [45] A MODIFICATION OF THE ANNEALING IMITATION METHOD FOR SOLVING COMBINATORIAL OPTIMIZATION PROBLEMS
    KISELEV, BS
    KULAKOV, NY
    MIKAELYAN, AL
    TELECOMMUNICATIONS AND RADIO ENGINEERING, 1993, 48 (05) : 123 - 125
  • [46] Neural networks and optimization problems
    Gaiduk, AR
    Vershinin, YA
    West, MJ
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, VOLS 1 & 2, 2002, : 37 - 41
  • [47] Optimization problems and neural networks
    Zoppoli, R
    Parisini, T
    AEI AUTOMAZIONE ENERGIA INFORMAZIONE, 1997, 84 (05): : 62 - 71
  • [48] The contribution of Neural Networks to solve corrosion related problems
    Trasatti, Stefano
    CORROSION, PROCESSES AND ADVANCED MATERIALS IN INDUSTRY, 2010, 95 : 23 - 27
  • [49] Solving Convex Optimization Problems Using Recurrent Neural Networks in Finite Time
    Cheng, Long
    Hou, Zeng-Guang
    Homma, Noriyasu
    Tan, Min
    Gupta, Madam M.
    IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 992 - +
  • [50] NEURAL NETWORKS AS TOOLS TO SOLVE PROBLEMS IN PHYSICS AND CHEMISTRY
    DUCH, W
    DIERCKSEN, GHF
    COMPUTER PHYSICS COMMUNICATIONS, 1994, 82 (2-3) : 91 - 103