Optimal pivot path of the simplex method for linear programming based on reinforcement learning 被引量：1

导出

摘要 Based on the existing pivot rules,the simplex method for linear programming is not polynomial in the worst case.Therefore,the optimal pivot of the simplex method is crucial.In this paper,we propose the optimal rule to find all the shortest pivot paths of the simplex method for linear programming problems based on Monte Carlo tree search.Specifically,we first propose the SimplexPseudoTree to transfer the simplex method into tree search mode while avoiding repeated basis variables.Secondly,we propose four reinforcement learning models with two actions and two rewards to make the Monte Carlo tree search suitable for the simplex method.Thirdly,we set a new action selection criterion to ameliorate the inaccurate evaluation in the initial exploration.It is proved that when the number of vertices in the feasible region is C_(n)^(m),our method can generate all the shortest pivot paths,which is the polynomial of the number of variables.In addition,we experimentally validate that the proposed schedule can avoid unnecessary search and provide the optimal pivot path.Furthermore,this method can provide the best pivot labels for all kinds of supervised learning methods to solve linear programming problems.

作者 Anqi Li Tiande Guo Congying Han Bonan Li Haoran Li

机构地区 School of Mathematical Sciences

出处《Science China Mathematics》 SCIE CSCD 2024年第6期1263-1286,共24页 中国科学（数学）（英文版）

基金 supported by National Key R&D Program of China(Grant No.2021YFA1000403) National Natural Science Foundation of China(Grant No.11991022) the Strategic Priority Research Program of Chinese Academy of Sciences(Grant No.XDA27000000) the Fundamental Research Funds for the Central Universities。

关键词 simplex method linear programming pivot rules reinforcement learning

分类号 O221.1 [理学—运筹学与控制论] TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献1

1Xin Liang,Zhen-Chen Guo,Li Wang,Ren-Cang Li,Wen-Wei Lin.Nearly optimal stochastic approximation for online principal subspace estimation[J].Science China Mathematics,2023,66(5):1087-1122. 被引量：1

同被引文献10

1Xiaohan Chen,Jialin Liu,Wotao Yin.Learning to optimize:A tutorial for continuous and mixed-integer optimization[J].Science China Mathematics,2024,67(6):1191-1262. 被引量：1
2Keke Li,Liping Tang,Xinmin Yang.Alleviating limit cycling in training GANs with an optimization technique[J].Science China Mathematics,2024,67(6):1287-1316. 被引量：1
3Xin Liu,Jianyong Sun,Zongben Xu.Learning to sample initial solution for solving 0-1 discrete optimization problem by local search[J].Science China Mathematics,2024,67(6):1317-1340. 被引量：1
4Rahimeh Neamatian Monemi,Shahin Gelareh,Nelson Maculan,Wei-Kun Chen.A neural branch-and-price for truck scheduling in cross-docks[J].Science China Mathematics,2024,67(6):1341-1358. 被引量：1
5Yuchen Shi,Congying Han,Tiande Guo.NeuroPrim:An attention-based model for solving NP-hard spanning tree problems[J].Science China Mathematics,2024,67(6):1359-1376. 被引量：1
6Shengchao Wang,Liang Chen,Lingfeng Niu,Yu-Hong Dai.Enhancing cut selection through reinforcement learning[J].Science China Mathematics,2024,67(6):1377-1394. 被引量：1
7Tian Xia,Jia Liu,Zhiping Chen.A dynamical neural network approach for distributionally robust chance-constrained Markov decision process[J].Science China Mathematics,2024,67(6):1395-1418. 被引量：1
8Xinmin Yang,Wei Yao,Haian Yin,Shangzhi Zeng,Jin Zhang.Gradient-based algorithms for multi-objective bi-level optimization[J].Science China Mathematics,2024,67(6):1419-1438. 被引量：1
9Ruibin Zeng,Minglong Lei,Lingfeng Niu,Lan Cheng.A unified pre-training and adaptation framework for combinatorial optimization on graphs[J].Science China Mathematics,2024,67(6):1439-1456. 被引量：1
10Haotian Zhang,Jianyong Sun,Thomas Back,Zongben Xu.Learning to select the recombination operator for derivative-free optimization[J].Science China Mathematics,2024,67(6):1457-1480. 被引量：1

引证文献1

1Zhiping Chen,Yu-Hong Dai,Tiande Guo,Xinmin Yang.Preface[J].Science China Mathematics,2024,67(6):1189-1190.

1What China's new economic pivot means for the world[J].一带一路报道（中英文）,2024(2):22-23.
2耿玉冰,赵剑飞,汪德军.智能运维中多维监测指标异常定位算法及试验研究[J].中国新技术新产品,2024(4):4-6.
3Xu Hao.Embracing New Opportunities[J].China Report ASEAN,2024,9(4):26-27.
4Suvo Banik,Troy Loefller,Sukriti Manna,Henry Chan,Srilok Srinivasan,Pierre Darancet,Alexander Hexemer,Subramanian K.R.S.Sankaranarayanan.A Continuous Action Space Tree search for INverse desiGn (CASTING) framework for materials discovery[J].npj Computational Materials,2023(1):500-515.
5彭明芳.语言学习[J].疯狂英语（新读写）,2024(5):23-29.
6Tong Cheng,Zhenfei Tan,Haiwang Zhong.Exploiting Flexibility of Integrated Demand Response to Alleviate Power Flow Violation During Line Tripping Contingency[J].Journal of Modern Power Systems and Clean Energy,2023,11(6):1971-1981.
7黄如,宋国梁.基于卸载策略的物联网边缘计算任务调度优化[J].华东理工大学学报（自然科学版）,2024,50(2):264-273. 被引量：1
8Ma Miaomiao.Learning From Authority[J].Beijing Review,2024,67(21):32-33.
9倪浩原,余贵珍,李涵,陈鹏,刘喜,王文达.露天矿作业区无人矿车协同通行决策方法研究[J].交通运输系统工程与信息,2024,24(3):277-289.
10Jiang Fan,Qin Junwei,Liu Lei,Tian Hui.Associative Tasks Computing Offloading Scheme in Internet of Medical Things with Deep Reinforcement Learning[J].China Communications,2024,21(4):38-52.

<12 >

Science China Mathematics

2024年第6期

Optimal pivot path of the simplex method for linear programming based on reinforcement learning 被引量：1

参考文献1

同被引文献10

引证文献1

相关作者

相关机构

相关主题

Optimal pivot path of the simplex method for linear programming based on reinforcement learning 被引量：1

参考文献1

同被引文献10

引证文献1

相关作者

相关机构

相关主题

微信扫一扫：分享