Memory-efficient Transformer-based network model for Traveling Salesman Problem

被引:18
|
作者
Yang, Hua [1 ]
Zhao, Minghao [1 ]
Yuan, Lei [2 ]
Yu, Yang [2 ]
Li, Zhenhua [1 ]
Gu, Ming [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
关键词
Combinatorial optimization; Deep reinforcement learning; TSP; Transformer; COMBINATORIAL OPTIMIZATION; ALGORITHMS;
D O I
10.1016/j.neunet.2023.02.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combinatorial optimization problems such as Traveling Salesman Problem (TSP) have a wide range of real-world applications in transportation, logistics, manufacturing. It has always been a difficult problem to solve large-scale TSP problems quickly because of memory usage limitations. Recent research shows that the Transformer model is a promising approach. However, the Transformer has several severe problems that prevent it from quickly solving TSP combinatorial optimization problems, such as quadratic time complexity, especially quadratic space complexity, and the inherent limitations of the encoder and decoder itself. To address these issues, we developed a memory-efficient Transformer-based network model for TSP combinatorial optimization problems, termed Tspformer, with two distinctive characteristics: (1) a sampled scaled dot-product attention mechanism with O(L log(L)) (L is the length of input sequences) time and space complexity, which is the most different between our work and other works. (2) due to the reduced space complexity, GPU/CPU memory usage is significantly reduced. Extensive experiments demonstrate that Tspformer significantly outperforms existing methods and provides a new solution to the TSP combinatorial optimization problems. Our Pytorch code will be publicly available on GitHub https://github.com/yhnju/tspFormer.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:589 / 597
页数:9
相关论文
共 50 条
  • [1] An Efficient Hybrid Graph Network Model for Traveling Salesman Problem with Drone
    Yang Wang
    Xiaoxiao Yang
    Zhibin Chen
    Neural Processing Letters, 2023, 55 : 10353 - 10370
  • [2] An Efficient Hybrid Graph Network Model for Traveling Salesman Problem with Drone
    Wang, Yang
    Yang, Xiaoxiao
    Chen, Zhibin
    NEURAL PROCESSING LETTERS, 2023, 55 (08) : 10353 - 10370
  • [3] An Efficient Multivalued Hopfield Network for the Traveling Salesman Problem
    E. Mérida-Casermeiro
    G. Galán-Marín
    J. Muñoz-Pérez
    Neural Processing Letters, 2001, 14 : 203 - 216
  • [4] An efficient multivalued Hopfield network for the traveling salesman problem
    Mérida-Casermeiro, E
    Galán-Marín, G
    Muñoz-Pérez, J
    NEURAL PROCESSING LETTERS, 2001, 14 (03) : 203 - 216
  • [5] A NEW HEURISTIC FOR TRAVELING SALESMAN PROBLEM BASED ON LCO
    Yokoyama, Soichiro
    Suzuki, Ikuo
    Yamamoto, Masahito
    Furukawa, Masashi
    PROCEEDINGS OF THE ASME/ISCIE INTERNATIONAL SYMPOSIUM ON FLEXIBLE AUTOMATION, ISFA 2012, 2013, : 345 - 350
  • [6] An efficient heuristic algorithm for the bottleneck traveling salesman problem
    Ramakrishnan R.
    Sharma P.
    Punnen A.P.
    OPSEARCH, 2009, 46 (3) : 275 - 288
  • [7] Deep reinforcement learning combined with transformer to solve the traveling salesman problem
    Liu, Chang
    Feng, Xue-Feng
    Li, Feng
    Xian, Qing-Long
    Jia, Zhen-Hong
    Wang, Yu-Hang
    Du, Zong-Dong
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01)
  • [8] Optimization Models and Heuristic Method Based on Simulated Annealing Strategy for Traveling Salesman Problem
    Hao, Xu
    MECHANICAL ENGINEERING AND GREEN MANUFACTURING, PTS 1 AND 2, 2010, : 1180 - 1184
  • [9] EFFICIENT CONVOLUTION AND TRANSFORMER-BASED NETWORK FOR VIDEO FRAME INTERPOLATION
    Khalifeh, Issa
    Murn, Luka
    Mrak, Marta
    Izquierdo, Ebroul
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1050 - 1054
  • [10] A memetic neural network for the Euclidean traveling salesman problem
    Creput, Jean-Charles
    Koukam, Abderrafiaa
    NEUROCOMPUTING, 2009, 72 (4-6) : 1250 - 1264