A pointer network based deep learning algorithm for unconstrained binary quadratic programming problem

被引:18
作者
Gu, Shenshen [1 ]
Hao, Tao [1 ]
Yao, Hanmei [1 ]
机构
[1] Shanghai Univ, Sch Mechatron Engn & Automat, 99 Shangda Rd, Shanghai 200444, Peoples R China
基金
美国国家科学基金会;
关键词
UBQP; Pointer network; Supervised learning; Deep reinforcement learning; COMBINATORIAL OPTIMIZATION;
D O I
10.1016/j.neucom.2019.06.111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Combinatorial optimization problems have been widely used in various fields. And many types of combinatorial optimization problems can be generalized into the model of unconstrained binary quadratic programming (UBQP). Therefore, designing an effective and efficient algorithm for UBQP problems will also contribute to solving other combinatorial optimization problems. Pointer network is an end-to-end sequential decision structure and combines with deep learning technology. With the utilization of the structural characteristics of combinatorial optimization problems and the ability to extract the rule behind the data by deep learning, pointer network has been successfully applied to solve several classical combinatorial optimization problems. In this paper, a pointer network based algorithm is designed to solve UBQP problems. The network model is trained by supervised learning (SL) and deep reinforcement learning (DRL) respectively. Trained pointer network models are evaluated by self-generated benchmark dataset and ORLIB dataset respectively. Experimental results show that pointer network model trained by SL has strong learning ability to specific distributed dataset. Pointer network model trained by DRL can learn more general distribution data characteristics. In other words, it can quickly solve problems with great generalization ability. As a result, the framework proposed in this paper for UBQP has great potential to solve large scale combinatorial optimization problems. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 29 条
[1]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[2]  
[Anonymous], ARXIV14064812
[3]  
[Anonymous], 2017, ADV NEURAL INFORM PR
[4]  
[Anonymous], MATH THEOR COMPUT SC
[5]   AN APPLICATION OF COMBINATORIAL OPTIMIZATION TO STATISTICAL PHYSICS AND CIRCUIT LAYOUT DESIGN [J].
BARAHONA, F ;
GROTSCHEL, M ;
JUNGER, M ;
REINELT, G .
OPERATIONS RESEARCH, 1988, 36 (03) :493-513
[6]  
Beasley J.E., 1998, Technical Report
[7]  
Bello I., 2016, INT C LEARNING REPRE
[8]  
Chiu CC, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P4774, DOI 10.1109/ICASSP.2018.8462105
[9]   A Pointer Network Based Deep Learning Algorithm for the Max-Cut Problem [J].
Gu, Shenshen ;
Yang, Yue .
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 :238-248
[10]   The Implementation of a Pointer Network Model for Traveling Salesman Problem on a Xilinx PYNQ Board [J].
Gu, Shenshen ;
Hao, Tao ;
Yang, Shaofu .
ADVANCES IN NEURAL NETWORKS - ISNN 2018, 2018, 10878 :130-138