A general and efficient parallel approach is proposed for the first time to parallelize the hybrid finiteelement-boundary-integral-multi-level fast multipole algorithm (FE-BI-MLFMA). Among many algorithms of FE-BI-M...A general and efficient parallel approach is proposed for the first time to parallelize the hybrid finiteelement-boundary-integral-multi-level fast multipole algorithm (FE-BI-MLFMA). Among many algorithms of FE-BI-MLFMA, the decomposition algorithm (DA) is chosen as a basis for the parallelization of FE-BI-MLFMA because of its distinct numerical characteristics suitable for parallelization. On the basis of the DA, the parallelization of FE-BI-MLFMA is carried out by employing the parallelized multi-frontal method for the matrix from the finiteelement method and the parallelized MLFMA for the matrix from the boundary integral method respectively. The programming and numerical experiments of the proposed parallel approach are carried out in the high perfor- mance computing platform CEMS-Liuhui. Numerical experiments demonstrate that FE-BI-MLFMA is efficiently parallelized and its computational capacity is greatly improved without losing accuracy, efficiency, and generality.展开更多
The method of establishing data structures plays an important role in the efficiency of parallel multilevel fast multipole algorithm(PMLFMA).Considering the main complements of multilevel fast multipole algorithm(M...The method of establishing data structures plays an important role in the efficiency of parallel multilevel fast multipole algorithm(PMLFMA).Considering the main complements of multilevel fast multipole algorithm(MLFMA) memory,a new parallelization strategy and a modified data octree construction scheme are proposed to further reduce communication in order to improve parallel efficiency.For far interaction,a new scheme called dynamic memory allocation is developed.To analyze the workload balancing performance of a parallel implementation,the original concept of workload balancing factor is introduced and verified by numerical examples.Numerical results show that the above measures improve the parallel efficiency and are suitable for the analysis of electrical large-scale scattering objects.展开更多
The strict and high-standard requirements for the safety and stability ofmajor engineering systems make it a tough challenge for large-scale finite element modal analysis.At the same time,realizing the systematic anal...The strict and high-standard requirements for the safety and stability ofmajor engineering systems make it a tough challenge for large-scale finite element modal analysis.At the same time,realizing the systematic analysis of the entire large structure of these engineering systems is extremely meaningful in practice.This article proposes a multilevel hierarchical parallel algorithm for large-scale finite element modal analysis to reduce the parallel computational efficiency loss when using heterogeneous multicore distributed storage computers in solving large-scale finite element modal analysis.Based on two-level partitioning and four-transformation strategies,the proposed algorithm not only improves the memory access rate through the sparsely distributed storage of a large amount of data but also reduces the solution time by reducing the scale of the generalized characteristic equation(GCEs).Moreover,a multilevel hierarchical parallelization approach is introduced during the computational procedure to enable the separation of the communication of inter-nodes,intra-nodes,heterogeneous core groups(HCGs),and inside HCGs through mapping computing tasks to various hardware layers.This method can efficiently achieve load balancing at different layers and significantly improve the communication rate through hierarchical communication.Therefore,it can enhance the efficiency of parallel computing of large-scale finite element modal analysis by fully exploiting the architecture characteristics of heterogeneous multicore clusters.Finally,typical numerical experiments were used to validate the correctness and efficiency of the proposedmethod.Then a parallel modal analysis example of the cross-river tunnel with over ten million degrees of freedom(DOFs)was performed,and ten-thousand core processors were applied to verify the feasibility of the algorithm.展开更多
In order to study the work-ability and establish the optimum hot formation processing parameters for industrial 1060 pure aluminum, the compressive deformation behavior of pure aluminum was investigated at temperature...In order to study the work-ability and establish the optimum hot formation processing parameters for industrial 1060 pure aluminum, the compressive deformation behavior of pure aluminum was investigated at temperatures of 523?823 K and strain rates of 0.005?10 s?1 on a Gleeble?1500 thermo-simulation machine. The influence rule of processing parameters (strain, strain rate and temperature) on flow stress of pure aluminum was investigated. Nine analysis factors consisting of material parameters and according weights were optimized. Then, the constitutive equations of multilevel series rules, multilevel parallel rules and multilevel series ¶llel rules were established. The correlation coefficients (R) are 0.992, 0.988 and 0.990, respectively, and the average absolute relative errors (AAREs) are 6.77%, 8.70% and 7.63%, respectively, which proves that the constitutive equations of multilevel series rules can predict the flow stress of pure aluminum with good correlation and precision.展开更多
模块化多电平换流器高压直流输电(modular multilevel converter based HVDC,MMC-HVDC)面临半桥子模块拓扑不能清除直流故障电流和子模块电容电压排序计算量过大、对传感器实时性要求高等问题。有文献针对低压领域MMC应用中存在的负...模块化多电平换流器高压直流输电(modular multilevel converter based HVDC,MMC-HVDC)面临半桥子模块拓扑不能清除直流故障电流和子模块电容电压排序计算量过大、对传感器实时性要求高等问题。有文献针对低压领域MMC应用中存在的负荷电流过大、电容电压监测困难等问题,提出并联全桥子模块(paralleled full bridge sub-module,P-FBSM)拓扑,对MMC-HVDC具有借鉴意义。该文从P-FBSM结构出发,分析其开关状态集,进而推导以开关函数表示的并联全桥MMC桥臂模型;基于最近电平逼近调制,设计P-FBSM动态分配均压控制策略,实现无需子模块电容电压的自均压控制,有效减小控制器计算量、降低传感器实时性要求。最后,在PSCAD/EMTDC中的电磁暂态仿真结果表明,采用所提均压控制策略可同时实现子模块电容电压的自均衡和直流故障电流快速清除。展开更多
文摘A general and efficient parallel approach is proposed for the first time to parallelize the hybrid finiteelement-boundary-integral-multi-level fast multipole algorithm (FE-BI-MLFMA). Among many algorithms of FE-BI-MLFMA, the decomposition algorithm (DA) is chosen as a basis for the parallelization of FE-BI-MLFMA because of its distinct numerical characteristics suitable for parallelization. On the basis of the DA, the parallelization of FE-BI-MLFMA is carried out by employing the parallelized multi-frontal method for the matrix from the finiteelement method and the parallelized MLFMA for the matrix from the boundary integral method respectively. The programming and numerical experiments of the proposed parallel approach are carried out in the high perfor- mance computing platform CEMS-Liuhui. Numerical experiments demonstrate that FE-BI-MLFMA is efficiently parallelized and its computational capacity is greatly improved without losing accuracy, efficiency, and generality.
基金supported by the National Basic Research Program of China (973 Program) (61320)
文摘The method of establishing data structures plays an important role in the efficiency of parallel multilevel fast multipole algorithm(PMLFMA).Considering the main complements of multilevel fast multipole algorithm(MLFMA) memory,a new parallelization strategy and a modified data octree construction scheme are proposed to further reduce communication in order to improve parallel efficiency.For far interaction,a new scheme called dynamic memory allocation is developed.To analyze the workload balancing performance of a parallel implementation,the original concept of workload balancing factor is introduced and verified by numerical examples.Numerical results show that the above measures improve the parallel efficiency and are suitable for the analysis of electrical large-scale scattering objects.
基金supported by the National Natural Science Foundation of China(Grant No.11772192).
文摘The strict and high-standard requirements for the safety and stability ofmajor engineering systems make it a tough challenge for large-scale finite element modal analysis.At the same time,realizing the systematic analysis of the entire large structure of these engineering systems is extremely meaningful in practice.This article proposes a multilevel hierarchical parallel algorithm for large-scale finite element modal analysis to reduce the parallel computational efficiency loss when using heterogeneous multicore distributed storage computers in solving large-scale finite element modal analysis.Based on two-level partitioning and four-transformation strategies,the proposed algorithm not only improves the memory access rate through the sparsely distributed storage of a large amount of data but also reduces the solution time by reducing the scale of the generalized characteristic equation(GCEs).Moreover,a multilevel hierarchical parallelization approach is introduced during the computational procedure to enable the separation of the communication of inter-nodes,intra-nodes,heterogeneous core groups(HCGs),and inside HCGs through mapping computing tasks to various hardware layers.This method can efficiently achieve load balancing at different layers and significantly improve the communication rate through hierarchical communication.Therefore,it can enhance the efficiency of parallel computing of large-scale finite element modal analysis by fully exploiting the architecture characteristics of heterogeneous multicore clusters.Finally,typical numerical experiments were used to validate the correctness and efficiency of the proposedmethod.Then a parallel modal analysis example of the cross-river tunnel with over ten million degrees of freedom(DOFs)was performed,and ten-thousand core processors were applied to verify the feasibility of the algorithm.
基金Project(51275414)supported by the National Natural Science Foundation of ChinaProject(2015JM5204)supported by the Natural Science Foundation of Shaanxi Province,China+1 种基金Project(Z2015064)supported by the Graduate Starting Seed Fund of the Northwestern Polytechnical University,ChinaProject(130-QP-2015)supported by the Research Fund of the State Key Laboratory of Solidification Processing(NWPU),China
文摘In order to study the work-ability and establish the optimum hot formation processing parameters for industrial 1060 pure aluminum, the compressive deformation behavior of pure aluminum was investigated at temperatures of 523?823 K and strain rates of 0.005?10 s?1 on a Gleeble?1500 thermo-simulation machine. The influence rule of processing parameters (strain, strain rate and temperature) on flow stress of pure aluminum was investigated. Nine analysis factors consisting of material parameters and according weights were optimized. Then, the constitutive equations of multilevel series rules, multilevel parallel rules and multilevel series ¶llel rules were established. The correlation coefficients (R) are 0.992, 0.988 and 0.990, respectively, and the average absolute relative errors (AAREs) are 6.77%, 8.70% and 7.63%, respectively, which proves that the constitutive equations of multilevel series rules can predict the flow stress of pure aluminum with good correlation and precision.
文摘模块化多电平换流器高压直流输电(modular multilevel converter based HVDC,MMC-HVDC)面临半桥子模块拓扑不能清除直流故障电流和子模块电容电压排序计算量过大、对传感器实时性要求高等问题。有文献针对低压领域MMC应用中存在的负荷电流过大、电容电压监测困难等问题,提出并联全桥子模块(paralleled full bridge sub-module,P-FBSM)拓扑,对MMC-HVDC具有借鉴意义。该文从P-FBSM结构出发,分析其开关状态集,进而推导以开关函数表示的并联全桥MMC桥臂模型;基于最近电平逼近调制,设计P-FBSM动态分配均压控制策略,实现无需子模块电容电压的自均压控制,有效减小控制器计算量、降低传感器实时性要求。最后,在PSCAD/EMTDC中的电磁暂态仿真结果表明,采用所提均压控制策略可同时实现子模块电容电压的自均衡和直流故障电流快速清除。