Optimal pivot path of the simplex method for linear programming based on reinforcement learning

导出

摘要 Based on the existing pivot rules,the simplex method for linear programming is not polynomial in the worst case.Therefore,the optimal pivot of the simplex method is crucial.In this paper,we propose the optimal rule to find all the shortest pivot paths of the simplex method for linear programming problems based on Monte Carlo tree search.Specifically,we first propose the SimplexPseudoTree to transfer the simplex method into tree search mode while avoiding repeated basis variables.Secondly,we propose four reinforcement learning models with two actions and two rewards to make the Monte Carlo tree search suitable for the simplex method.Thirdly,we set a new action selection criterion to ameliorate the inaccurate evaluation in the initial exploration.It is proved that when the number of vertices in the feasible region is C_(n)^(m),our method can generate all the shortest pivot paths,which is the polynomial of the number of variables.In addition,we experimentally validate that the proposed schedule can avoid unnecessary search and provide the optimal pivot path.Furthermore,this method can provide the best pivot labels for all kinds of supervised learning methods to solve linear programming problems.

作者 Anqi Li Tiande Guo Congying Han Bonan Li Haoran Li

机构地区 School of Mathematical Sciences

出处《Science China Mathematics》 SCIE CSCD 2024年第6期1263-1286,共24页 中国科学（数学）（英文版）

基金 supported by National Key R&D Program of China(Grant No.2021YFA1000403) National Natural Science Foundation of China(Grant No.11991022) the Strategic Priority Research Program of Chinese Academy of Sciences(Grant No.XDA27000000) the Fundamental Research Funds for the Central Universities。

关键词 simplex method linear programming pivot rules reinforcement learning

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1What China's new economic pivot means for the world[J].一带一路报道（中英文）,2024(2):22-23.
2耿玉冰,赵剑飞,汪德军.智能运维中多维监测指标异常定位算法及试验研究[J].中国新技术新产品,2024(4):4-6.
3Xu Hao.Embracing New Opportunities[J].China Report ASEAN,2024,9(4):26-27.
4Suvo Banik,Troy Loefller,Sukriti Manna,Henry Chan,Srilok Srinivasan,Pierre Darancet,Alexander Hexemer,Subramanian K.R.S.Sankaranarayanan.A Continuous Action Space Tree search for INverse desiGn (CASTING) framework for materials discovery[J].npj Computational Materials,2023(1):500-515.
5彭明芳.语言学习[J].疯狂英语（新读写）,2024(5):23-29.
6Tong Cheng,Zhenfei Tan,Haiwang Zhong.Exploiting Flexibility of Integrated Demand Response to Alleviate Power Flow Violation During Line Tripping Contingency[J].Journal of Modern Power Systems and Clean Energy,2023,11(6):1971-1981.
7黄如,宋国梁.基于卸载策略的物联网边缘计算任务调度优化[J].华东理工大学学报（自然科学版）,2024,50(2):264-273.
8Ma Miaomiao.Learning From Authority[J].Beijing Review,2024,67(21):32-33.
9倪浩原,余贵珍,李涵,陈鹏,刘喜,王文达.露天矿作业区无人矿车协同通行决策方法研究[J].交通运输系统工程与信息,2024,24(3):277-289.
10Jiang Fan,Qin Junwei,Liu Lei,Tian Hui.Associative Tasks Computing Offloading Scheme in Internet of Medical Things with Deep Reinforcement Learning[J].China Communications,2024,21(4):38-52.

Science China Mathematics

2024年第6期

浏览历史

内容加载中请稍等...

Optimal pivot path of the simplex method for linear programming based on reinforcement learning

相关作者

相关机构

相关主题

浏览历史