Memory-efficient Transformer-based network model for Traveling Salesman Problem

被引:18
|
作者
Yang, Hua [1 ]
Zhao, Minghao [1 ]
Yuan, Lei [2 ]
Yu, Yang [2 ]
Li, Zhenhua [1 ]
Gu, Ming [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
[2] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
关键词
Combinatorial optimization; Deep reinforcement learning; TSP; Transformer; COMBINATORIAL OPTIMIZATION; ALGORITHMS;
D O I
10.1016/j.neunet.2023.02.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combinatorial optimization problems such as Traveling Salesman Problem (TSP) have a wide range of real-world applications in transportation, logistics, manufacturing. It has always been a difficult problem to solve large-scale TSP problems quickly because of memory usage limitations. Recent research shows that the Transformer model is a promising approach. However, the Transformer has several severe problems that prevent it from quickly solving TSP combinatorial optimization problems, such as quadratic time complexity, especially quadratic space complexity, and the inherent limitations of the encoder and decoder itself. To address these issues, we developed a memory-efficient Transformer-based network model for TSP combinatorial optimization problems, termed Tspformer, with two distinctive characteristics: (1) a sampled scaled dot-product attention mechanism with O(L log(L)) (L is the length of input sequences) time and space complexity, which is the most different between our work and other works. (2) due to the reduced space complexity, GPU/CPU memory usage is significantly reduced. Extensive experiments demonstrate that Tspformer significantly outperforms existing methods and provides a new solution to the TSP combinatorial optimization problems. Our Pytorch code will be publicly available on GitHub https://github.com/yhnju/tspFormer.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:589 / 597
页数:9
相关论文
共 50 条
  • [31] Hysteretic noisy frequency conversion sinusoidal chaotic neural network for traveling salesman problem
    Junfei Qiao
    Zhiqiang Hu
    Wenjing Li
    Neural Computing and Applications, 2019, 31 : 7055 - 7069
  • [32] A novel transformer-based neural network model for tool wear estimation
    Liu, Hui
    Liu, Zhenyu
    Jia, Weiqiang
    Lin, Xianke
    Zhang, Shuo
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2020, 31 (06)
  • [33] Solving traveling salesman problem based on improved particle swarm optimization algorithm
    Wang, CR
    Zhang, JW
    Yang, J
    Sun, CJ
    Feng, HX
    Yuan, HJ
    PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 368 - 373
  • [34] DAN: Decentralized Attention-Based Neural Network for the MinMax Multiple Traveling Salesman Problem
    Cao, Yuhong
    Sun, Zhanhong
    Sartoretti, Guillaume
    DISTRIBUTED AUTONOMOUS ROBOTIC SYSTEMS, DARS 2022, 2024, 28 : 202 - 215
  • [35] Parallelized neural network system for solving Euclidean traveling salesman problem
    Avsar, Bihter
    Aliabadi, Danial Esmaeili
    APPLIED SOFT COMPUTING, 2015, 34 : 862 - 873
  • [36] Amoeba-based Neurocomputing for 8-City Traveling Salesman Problem
    Aono, Masashi
    Zhu, Liping
    Hara, Masahiko
    INTERNATIONAL JOURNAL OF UNCONVENTIONAL COMPUTING, 2011, 7 (06) : 463 - 480
  • [37] A transformer-based adversarial network framework for steganography
    Xiao, Chaoen
    Peng, Sirui
    Zhang, Lei
    Wang, Jianxin
    Ding, Ding
    Zhang, Jianyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 269
  • [38] A Transformer-Based Network for Hyperspectral Object Tracking
    Gao, Long
    Chen, Langkun
    Liu, Pan
    Jiang, Yan
    Xie, Weiying
    Li, Yunsong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [39] Investigation of the impact of different versions of GCC on various metaheuristic-based solvers for traveling salesman problem
    Deniz Dal
    Esra Celik
    The Journal of Supercomputing, 2023, 79 : 12394 - 12440
  • [40] Investigation of the impact of different versions of GCC on various metaheuristic-based solvers for traveling salesman problem
    Dal, Deniz
    Celik, Esra
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (11) : 12394 - 12440