期刊文献+

NVIDIA GPU量子化学ab initio计算性能与并行效率研究

A study of the quantum chemistry ab initio calculation performance and parallel efficiency on the NVIDIA GPU platform
原文传递
导出
摘要 选择Valinomycin,Barnase等8个化学、生命科学领域的典型大分子体系,基函数数目从1160至12716范围,用NVIDIA K80 GPU高性能工作站,测试和分析了基于GPU特性设计的TeraChem量子化学程序的Hartree-Fock ab initio计算性能、并行效率,并与GAMESS程序在CPU典型集群上的计算精度和效率进行对比。结果表明,TeraChem在单一K80 GPU工作站的计算速度是16节点四核CPU集群的2至17倍;当GPU计算核心数增加4倍,GPU平台计算性能还能有2.5倍增长;若正确控制计算过程参数,GPU与CPU平台计算结果的计算精度完全一致。TeraChem在K80 GPU上的卓越计算性能和表现,可让化学家在普通化学实验室而不是超级计算中心,完成大规模的非经验量子化学计算,挑战计算化学极限。 This paper selected 8 representative molecules, including Valinomycin and Barnase, from the fields of chemistry and life science, and tested the Hartree-Fock ab initio calculation performance and efficiency of the TeraChem program on K80 GPU workstation using the number of basis sets from 1160-12716 as the benchmark. By comparing the performance of TeraChem and that of GAMESS, the results showed that the computing speed of TeraCbem based on a K80 GPU is 2 to 17 times faster than that of GAMESS based on a 16-node CPU PC-Cluster. TeraChem computing performance can be increased to 2.5 times more when the GPU core number increases by 4 times. Under correct parameter control, the results and precision are exactly the same as those based on GPU and CPU. Therefore, using GPU-based TeraChem software, chemists can perform large-scale ab initio quantum chemical calculation in a laboratory instead of a super-computing center.
作者 何禹 王一波
出处 《计算机与应用化学》 CAS 2016年第5期575-581,共7页 Computers and Applied Chemistry
基金 贵州省自然科学基金资助项目(20082116)
关键词 GPU Hartree-Fock从头计算 TeraChem GPU Hartree-Fock ab initio TeraChem
  • 相关文献

参考文献23

  • 1Lewars E G. Computational chemistry: introduction to the theory and applications of molecular and quantum mechanics, MA, USA, Kluwer Academic Publishers, 2003.
  • 2Elias R, Emanuel H R and Pawel Satek K S. Density functional theory electronic structure calculations with linearly scaling computational time and memory usage[J]. Journal of Chemical Theory and Computation, 2011, 7:340-350.
  • 3Weigend F. A fully direct RI-HF algorithm: Implementation, optimised auxiliary basis sets, demonstration of accuracy and efficiency[J]. Physical Chemistry Chemical Physics, 2002, 4(18): 4285-429.
  • 4Neese F, Wennmohs F, Hansen A, Becker U. Efficient, Approximate and parallel Hartree-Fock and hybrid DFT calculations. A 'chain-of-spheres' algorithm for the Hartree-Fock exchange[J]. Chemical Physics, 2009, 356:98-109.
  • 5US National Reserach Council. Impact of advances in computing and communications technologies on chemical science and technology[M]. Washington DC: National Academy Press, 1999.
  • 6Goldi M, Nisha K, Abhishek D, et al. Evaluation of rodinia codes on Intel Xeon Phi [D]//Proc of the 4th International Conference on Intelligent Systems, Modelling and Simulation, 2012:415-419.
  • 7http://ark.intel.com/products/84683/Intel-Xeon-Processor-E7-8880- v3-45M-Cache-2_30-GHz?wapkw=e7-8880+v3.
  • 8何禹,王一波.PC Linux集群系统量子化学从头计算性能研究[J].计算机与应用化学,2003,20(5):587-592. 被引量:4
  • 9http://www.nvidia.cn/object/tesla-servers-cn.html.
  • 10Hwu Wenme. GPU Computing Gems Emerald, Burlington, MA, Elsevier Inc, 2011.

二级参考文献68

  • 1..http ://www. topSO0, org/.,.
  • 2..http://www. beowulf. org/beowulf/projects. html.,.
  • 3.Linda参考手册[M].SCA公司出版,..
  • 4.Gaussian98参考手册[M].Gaussian公司出版,..
  • 5Case,D.A.; Darden,T.A.; Cheatham,T.E.,III.; Simmerling,C.L.;Wang,J.; Duke,R.E.; Luo,R.;Walker,R.C.; Zhang,W.; Merz,K.M.; Roberts,B.;Wang,B.; Hayik,S.; Roitberg,A.; Seabra,G.; Kolossváry,I.;Wong,K.F.; Paesani,F.; Vanicek,J.; Liu,J.;Wu,X.; Brozell,S.R.; Steinbrecher,T.; Gohlke,H.; Cai,Q.; Ye,X.;Wang,J.; Hsieh,M.J.; Cui,G.; Roe,D.R.; Mathews,D.H.; Seetin,M.G.; Sagui,C.; Babin,V.; Luchko,T.; Gusarov,S.; Kovalenko,A.; Kollman,P.A.AMBER 11; University of California: San Francisco.
  • 6G?tz,A.W.;W?lfle,T.;Walker,R.C.Annual Reports in Computational Chemistry; Elsevier: Amsterdam,2010; Vol.6,p 21.
  • 7Xu,D.;Williamson,M.J.;Walker,R.C.Annual Reports in Computational Chemistry; Elsevier: Amsterdam,2010; Vol.6,p 1.
  • 8NVIDIA CUDA.Compute Unified Device Architecture Programming Guide Version 3.0.http://www.nvidia.com/object/cuda_develop.html (accessed March 6,2010).
  • 9Comparison of Nvidia Graphics Processing Units.http://en.wikipedia.org/wiki/Comparison_of_Nvidia_graphics_processing_units (accessed March 6,2010).
  • 10Lengyel,J.; Reichert,M.; Donald,B.R.; Greenberg,D.P.Comput.Graph.1990,24,327.

共引文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部