Optimal pivot path of the simplex method for linear programming based on reinforcement learning

被引:1
作者
Li, Anqi [1 ]
Guo, Tiande [1 ]
Han, Congying [1 ]
Li, Bonan [1 ]
Li, Haoran [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
simplex method; linear programming; pivot rules; reinforcement learning; CARLO TREE-SEARCH; OPTIMIZATION; NETWORKS; RULE; GAME; GO;
D O I
10.1007/s11425-022-2259-1
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Based on the existing pivot rules, the simplex method for linear programming is not polynomial in the worst case. Therefore, the optimal pivot of the simplex method is crucial. In this paper, we propose the optimal rule to find all the shortest pivot paths of the simplex method for linear programming problems based on Monte Carlo tree search. Specifically, we first propose the SimplexPseudoTree to transfer the simplex method into tree search mode while avoiding repeated basis variables. Secondly, we propose four reinforcement learning models with two actions and two rewards to make the Monte Carlo tree search suitable for the simplex method. Thirdly, we set a new action selection criterion to ameliorate the inaccurate evaluation in the initial exploration. It is proved that when the number of vertices in the feasible region is Cnm, our method can generate all the shortest pivot paths, which is the polynomial of the number of variables. In addition, we experimentally validate that the proposed schedule can avoid unnecessary search and provide the optimal pivot path. Furthermore, this method can provide the best pivot labels for all kinds of supervised learning methods to solve linear programming problems.
引用
收藏
页码:1263 / 1286
页数:24
相关论文
共 50 条
  • [41] Reinforcement Learning Based Optimal Energy Management of A Microgrid
    Iqbal, Saqib
    Mehran, Kamyar
    2022 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2022,
  • [42] Intelligent Path Planning of Underwater Robot Based on Reinforcement Learning
    Yang, Jiachen
    Ni, Jingfei
    Xi, Meng
    Wen, Jiabao
    Li, Yang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 1983 - 1996
  • [43] A Reinforcement Learning Based Online Coverage Path Planning Algorithm
    Carvalho, Jose Pedro
    Pedro Aguiar, A.
    2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, : 81 - 86
  • [44] Reinforcement Learning Based Path Planning Method for Underactuated AUV with Sonar Constraint
    Pang, Zhouqi
    Lin, Xiaobo
    Hao, Chengpeng
    Hou, Chaohuan
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2382 - 2387
  • [45] Research on AGV path tracking method based on global vision and reinforcement learning
    Zhu, Qixuan
    Zheng, Zhaolun
    Wang, Chao
    Lu, Yujun
    SCIENCE PROGRESS, 2023, 106 (03)
  • [46] Comparing study between simplex method and Lagrange method in a linear programming problem
    Alsaraireh, Ahmed Atallah
    Almasarweh, Mohammad Salameh
    Al Wadi, S.
    Alnawaiseh, Mahmoud Barakat
    ITALIAN JOURNAL OF PURE AND APPLIED MATHEMATICS, 2019, (42): : 934 - 943
  • [47] Reinforcement learning method based on sample regularization and adaptive learning rate for AGV path planning
    Nie, Jun
    Zhang, Guihua
    Lu, Xiao
    Wang, Haixia
    Sheng, Chunyang
    Sun, Lijie
    NEUROCOMPUTING, 2025, 614
  • [48] Adaptive evolutionary programming based on reinforcement learning
    Zhang, Huaxiang
    Lu, Jing
    INFORMATION SCIENCES, 2008, 178 (04) : 971 - 984
  • [49] Genetic algorithm based on simplex method for solving linear-quadratic bilevel programming problem
    Wang, Guangmin
    Wan, Zhongping
    Wang, Xianjia
    Lv, Yibing
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 56 (10) : 2550 - 2555
  • [50] RLProph: a dynamic programming based reinforcement learning approach for optimal routing in opportunistic IoT networks
    Sharma, Deepak Kumar
    Rodrigues, Joel J. P. C.
    Vashishth, Vidushi
    Khanna, Anirudh
    Chhabra, Anshuman
    WIRELESS NETWORKS, 2020, 26 (06) : 4319 - 4338