期刊文献+

基于并行程序效率和通用性的实践与研究 被引量:3

Practice and Study of the Efficiency and the Portability Based on Parallel Program
下载PDF
导出
摘要 大型并行应用软件一般都是成本高昂的,但在实际有效速度和适应硬件环境变化两方面却往往不尽人意。因此,如何设计效率高且不依赖硬件环境的程序,使其能够持续保持价值,就成了人们追求的目标。从HPC中的问题规模、同步等待、消息传递延迟、数据分解与定位和指令级并行等不同角度出发,通过实例与实验分析,阐述了影响并行应用程序效率和通用性的主要因素,并提出了可用于参考的规则或注意事项。 Generally,a cosmic and complex parallel applied software is very costly. But it is not satisfactory in efficiency and in adapting itself to environment change of hardware, so it has become an object most programmer hanker for how to write a program of high efficiency and not depending on hardware so as to keep up its value. Starting from HPC's problem scale, synchronization waiting, message passing latency, data decomposition and localization and instruction-level parallelization, this paper expounded the main factors affecting program efficiency and portability and presented some rules or points for attention, by means of instances and experiment analysis.
出处 《计算机科学》 CSCD 北大核心 2009年第6期290-293,共4页 Computer Science
基金 国家自然科学基金(60075006) 河南省基础与前沿技术研究(082300410300)资助
关键词 数据依赖 同步等待 数据分解 Cache利用率 通用性 Data dependence, Synchronization waiting, Data decomposition, Cache using ratio, Portability
  • 相关文献

参考文献10

  • 1Mathur N , Dong Shaojing. Study of stochastic estimates of Qu - ark loops with unbiased subtraction[J]. Lattice2002, at Boston, Nucl. Phys. B,2003,119(Proc. Suppl. )
  • 2Hennessy J L,Patterson D A. Computer Architecture: A Quantitative Approach, Third Edition, (英文版·第3版)[M].北京:机械工业出版社,2004:172-224,678-778
  • 3Hwang K. Advanced Computer Architecture: Parallelism Scalability Programmability [ M]. New York: McGraw-Hill Inc. , 1993:179-188,484-494
  • 4Sohi G S. Instruction issue logic for high-performance, interupti- ble,multiple functional unit, piperlined computers[J]. IEEE Trans. on computers, 39(3) : 318-342
  • 5Lynch N A. Distributed Algorithms[M].北京:机械工业出版社,2004:239-247,546-560
  • 6William Group Skjellum E L A. Using MPI- Portable Parallel Programming with the Massage Interface. , Second Edition [M]. Cambridge Massachusetts, London, England. The MIT Press: 1999
  • 7Lewis H R,Papadimitriou C H. Elements of the theory of computation[M].北京:清华大学出版社,2000,7
  • 8Buyya R. High Performance Cluster Computing. Prentice Hill PTR, 1999:409-434,554-557
  • 9王文义,张行进.基本线性计算模型的CUR分析与并行程序有效速度的实用测定方法[J].计算机工程与应用,2002,38(17):135-137. 被引量:1
  • 10白中英,杨旭东,邝坚.并行机体系结构[M].北京:科学出版社.2006.

二级参考文献1

共引文献1

同被引文献20

  • 1于海泳,宋雨,刘伟达.软件复用机会的识别[J].微机发展,2004,14(12):114-116. 被引量:5
  • 2张启,陈欣.一种基于工控机的飞控系统模拟器[J].工业控制计算机,2005,18(3):33-34. 被引量:3
  • 3吴益明,卢京潮,魏莉莉,潘文平.无人机遥控遥测数据的实时处理研究[J].计算机测量与控制,2006,14(5):681-682. 被引量:20
  • 4Dagum L,Menon R.OpenMP:an industry standard API for shared-memory programming[J],IEEE Computational Science & Engineering,1998,5(1):46-55.
  • 5Gropp W,Lusk E,Skjellum A.Using MPI.portable parallel programming with the message-passing interface[M].2rd ed.Cambridge,Mass:MIT Press,1999.
  • 6Newton P,Browne J C.The CODE 2.0 graphical parallel programming language[C]//Proceedings of the 6th ACM International Conference on Supercomputing.New York:ACM,1992-167-177.
  • 7Szafron D,Schaeffer J.An experiment to measure the usability of parallel programming systems[J].Concurrency:Practice and Experience,1996,8(2):146-166.
  • 8Macdonald S,Anvik J,Bromling S,et al.From patterns to fra-meworks to parallel programs[J].Parallel Computing,2002,28 (12):1663-1683.
  • 9Reinders J.Intel Threading Building Blocks[M].Sebastopol CA:O' Reilly,2OO7.
  • 10Wu HuabeL Design-pattern based parallel programming model and system implementation[C]//Proceedings of the 4th International Conference on Wireless Communications,Networking and Mobile Computing.Piscataway:IEEE,2008,11(3):1-5.

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部