As the hardware industry moves toward using specialized heterogeneous many-core processors to avoid the effects of the power wall,software developers are finding it hard to deal with the complexity of these systems.In...As the hardware industry moves toward using specialized heterogeneous many-core processors to avoid the effects of the power wall,software developers are finding it hard to deal with the complexity of these systems.In this paper,we share our experience of developing a programming model and its supporting compiler and libraries for Matrix-3000,which is designed for next-generation exascale supercomputers but has a complex memory hierarchy and processor organization.To assist its software development,we have developed a software stack from scratch that includes a low-level programming interface and a high-level OpenCL compiler.Our low-level programming model offers native programming support for using the bare-metal accelerators of Matrix-3000,while the high-level model allows programmers to use the OpenCL programming standard.We detail our design choices and highlight the lessons learned from developing system software to enable the programming of bare-metal accelerators.Our programming models have been deployed in the production environment of an exascale prototype system.展开更多
针对化工生产中广泛存在的一类带多工序的异构并行机调度问题,即部分产品需多工序加工,同时不同产品间带序相关设置时间的异构并行机调度问题(heterogeneous parallel machine scheduling problem with multiple operations and sequenc...针对化工生产中广泛存在的一类带多工序的异构并行机调度问题,即部分产品需多工序加工,同时不同产品间带序相关设置时间的异构并行机调度问题(heterogeneous parallel machine scheduling problem with multiple operations and sequence-dependent setup times,HPMSP_MOSST),提出了一种遗传-分布估计算法(genetic algorithm-estimation of distribution algorithm,GA-EDA),用于优化最早完工时间(makespan)。首先,提出了一种基于GA的概率模型训练机制,用来提高概率模型在算法进化初期的信息积累量,进而提高搜索的效率;其次,设计了一种有效的GA与EDA混合策略,使得算法的全局探索和局部开发能力得到合理平衡。计算机模拟验证了GA-EDA的有效性和鲁棒性。展开更多
基金Project supported by the National Key Research and Development Program of China(No.2021YFB0300101)the National Natural Science Foundation of China(No.61972408)the UK Royal Society International Collaboration Grant。
文摘As the hardware industry moves toward using specialized heterogeneous many-core processors to avoid the effects of the power wall,software developers are finding it hard to deal with the complexity of these systems.In this paper,we share our experience of developing a programming model and its supporting compiler and libraries for Matrix-3000,which is designed for next-generation exascale supercomputers but has a complex memory hierarchy and processor organization.To assist its software development,we have developed a software stack from scratch that includes a low-level programming interface and a high-level OpenCL compiler.Our low-level programming model offers native programming support for using the bare-metal accelerators of Matrix-3000,while the high-level model allows programmers to use the OpenCL programming standard.We detail our design choices and highlight the lessons learned from developing system software to enable the programming of bare-metal accelerators.Our programming models have been deployed in the production environment of an exascale prototype system.
文摘针对化工生产中广泛存在的一类带多工序的异构并行机调度问题,即部分产品需多工序加工,同时不同产品间带序相关设置时间的异构并行机调度问题(heterogeneous parallel machine scheduling problem with multiple operations and sequence-dependent setup times,HPMSP_MOSST),提出了一种遗传-分布估计算法(genetic algorithm-estimation of distribution algorithm,GA-EDA),用于优化最早完工时间(makespan)。首先,提出了一种基于GA的概率模型训练机制,用来提高概率模型在算法进化初期的信息积累量,进而提高搜索的效率;其次,设计了一种有效的GA与EDA混合策略,使得算法的全局探索和局部开发能力得到合理平衡。计算机模拟验证了GA-EDA的有效性和鲁棒性。