The online 3D packing problem has received increasing attention in recent years due to its practical value. However, the problem itself possesses some peculiar properties, such as sequential decision-making and the la...The online 3D packing problem has received increasing attention in recent years due to its practical value. However, the problem itself possesses some peculiar properties, such as sequential decision-making and the large size of the state space, which have made the use of reinforcement learning with Markov decision processes a popular approach for solving this problem. In this paper, we focus on the problem of high variance in value estimation caused by reward uncertainty in the presence of highly uncertain dynamics. To address this, proposed a solution based on auxiliary tasks and intrinsic rewards for the online 3D bin packing problem, guided by a binary-valued network, to assist the agent in learning the policy within the framework of actor-critic deep reinforcement learning. Specifically, the maintenance of two-valued networks and the utilization of multi-valued network estimates are employed to replace the original value estimates, aiming to provide better guidance for the learning of policy networks. Experimentally, it has been demonstrated that our model can achieve more robust learning and outperform previous works in terms of performance.展开更多
Given a list of items and a sequence of variable-sized bins arriving one by one, it is NP-hard to pack the items into the bin list with a goal to minimize the total size of bins from the earliest one to the last used....Given a list of items and a sequence of variable-sized bins arriving one by one, it is NP-hard to pack the items into the bin list with a goal to minimize the total size of bins from the earliest one to the last used. In this paper a set of approximation algorithms is presented for cases in which the ability to preview at most k(〉=2) arriving bins is given. With the essential assumption that all bin sizes are not less than the largest item size, analytical results show the asymptotic worst case ratios of all k-bounded space and offiine algorithms are 2. Based on experiments by applying algorithms to instances in which item sizes and bin sizes are drawn independently from the continuous uniform distribution respectively in the interval [0,u] and [u,l ], averagecase experimental results show that, with fixed k, algorithms with the Best Fit packing(closing) rule are statistically better than those with the First Fit packing(closing) rule.展开更多
A periodic packing mode of trickle-bed reactor (TBR) for the gas limited reaction was proposed. Hy-drogenation of 2-ethylanthraquinone over Pd/Al2O3 in a laboratory-scale TBR was taken as a test reaction for determini...A periodic packing mode of trickle-bed reactor (TBR) for the gas limited reaction was proposed. Hy-drogenation of 2-ethylanthraquinone over Pd/Al2O3 in a laboratory-scale TBR was taken as a test reaction for determining whether the periodic packing mode is advantageous. The effects of operating conditions and packing type on TBR performance were experimentally examined to demonstrate the cause-effect relationships. A mathe-matic model of TBR considering axial dispersion and fractional wetting was developed to quantitatively illuminate the reason of performance enhancement.展开更多
Binary wolf pack algorithm (BWPA) is a kind of intelligence algorithm which can solve combination optimization problems in discrete spaces.Based on BWPA, an improved binary wolf pack algorithm (AIBWPA) can be proposed...Binary wolf pack algorithm (BWPA) is a kind of intelligence algorithm which can solve combination optimization problems in discrete spaces.Based on BWPA, an improved binary wolf pack algorithm (AIBWPA) can be proposed by adopting adaptive step length and improved update strategy of wolf pack. AIBWPA is applied to 10 classic 0-1 knapsack problems and compared with BWPA, DPSO, which proves that AIBWPA has higher optimization accuracy and better computational robustness. AIBWPA makes the parameters simple, protects the population diversity and enhances the global convergence.展开更多
目的针对冷链运输中的生鲜打包及装载优化问题,提出一种允许货物以体积恒定为前提进行尺寸变化的包装装载方案,以最大化集装箱的空间利用率。方法基于上述问题,构建非线性混合整数规划模型,为了方便CPLEX或LINGO等求解器对该非线性混合...目的针对冷链运输中的生鲜打包及装载优化问题,提出一种允许货物以体积恒定为前提进行尺寸变化的包装装载方案,以最大化集装箱的空间利用率。方法基于上述问题,构建非线性混合整数规划模型,为了方便CPLEX或LINGO等求解器对该非线性混合整数规划模型进行求解,采用一种分段线性化方法,将该非线性模型进行线性化处理。由于所研究问题具有NP-hard属性,无论是CPLEX还是LINGO都无法有效求解大规模算例,因此设计一种有效结合遗传算法与深度、底部、左部方向优先装载(Deepest bottom left with fill,DBLF)的算法。结果大小规模算例实验验证结果表明,混合遗传算法能够在合理时间内获得最优解或近似最优解。结论所提出的可变尺寸包装方案有效提高了装载率,有益于客户和物流公司。展开更多
文摘The online 3D packing problem has received increasing attention in recent years due to its practical value. However, the problem itself possesses some peculiar properties, such as sequential decision-making and the large size of the state space, which have made the use of reinforcement learning with Markov decision processes a popular approach for solving this problem. In this paper, we focus on the problem of high variance in value estimation caused by reward uncertainty in the presence of highly uncertain dynamics. To address this, proposed a solution based on auxiliary tasks and intrinsic rewards for the online 3D bin packing problem, guided by a binary-valued network, to assist the agent in learning the policy within the framework of actor-critic deep reinforcement learning. Specifically, the maintenance of two-valued networks and the utilization of multi-valued network estimates are employed to replace the original value estimates, aiming to provide better guidance for the learning of policy networks. Experimentally, it has been demonstrated that our model can achieve more robust learning and outperform previous works in terms of performance.
文摘Given a list of items and a sequence of variable-sized bins arriving one by one, it is NP-hard to pack the items into the bin list with a goal to minimize the total size of bins from the earliest one to the last used. In this paper a set of approximation algorithms is presented for cases in which the ability to preview at most k(〉=2) arriving bins is given. With the essential assumption that all bin sizes are not less than the largest item size, analytical results show the asymptotic worst case ratios of all k-bounded space and offiine algorithms are 2. Based on experiments by applying algorithms to instances in which item sizes and bin sizes are drawn independently from the continuous uniform distribution respectively in the interval [0,u] and [u,l ], averagecase experimental results show that, with fixed k, algorithms with the Best Fit packing(closing) rule are statistically better than those with the First Fit packing(closing) rule.
基金the State Key Development Program for Basic Research of China (No. G2000048005) the SINOPEC (No.X503023).
文摘A periodic packing mode of trickle-bed reactor (TBR) for the gas limited reaction was proposed. Hy-drogenation of 2-ethylanthraquinone over Pd/Al2O3 in a laboratory-scale TBR was taken as a test reaction for determining whether the periodic packing mode is advantageous. The effects of operating conditions and packing type on TBR performance were experimentally examined to demonstrate the cause-effect relationships. A mathe-matic model of TBR considering axial dispersion and fractional wetting was developed to quantitatively illuminate the reason of performance enhancement.
文摘Binary wolf pack algorithm (BWPA) is a kind of intelligence algorithm which can solve combination optimization problems in discrete spaces.Based on BWPA, an improved binary wolf pack algorithm (AIBWPA) can be proposed by adopting adaptive step length and improved update strategy of wolf pack. AIBWPA is applied to 10 classic 0-1 knapsack problems and compared with BWPA, DPSO, which proves that AIBWPA has higher optimization accuracy and better computational robustness. AIBWPA makes the parameters simple, protects the population diversity and enhances the global convergence.
文摘目的针对冷链运输中的生鲜打包及装载优化问题,提出一种允许货物以体积恒定为前提进行尺寸变化的包装装载方案,以最大化集装箱的空间利用率。方法基于上述问题,构建非线性混合整数规划模型,为了方便CPLEX或LINGO等求解器对该非线性混合整数规划模型进行求解,采用一种分段线性化方法,将该非线性模型进行线性化处理。由于所研究问题具有NP-hard属性,无论是CPLEX还是LINGO都无法有效求解大规模算例,因此设计一种有效结合遗传算法与深度、底部、左部方向优先装载(Deepest bottom left with fill,DBLF)的算法。结果大小规模算例实验验证结果表明,混合遗传算法能够在合理时间内获得最优解或近似最优解。结论所提出的可变尺寸包装方案有效提高了装载率,有益于客户和物流公司。