期刊文献+

一种等性能面积的并行计算可扩展性度量方法 被引量:1

A Novel Scalability Metric for Parallel Computing
下载PDF
导出
摘要 可扩展性是并行计算的一个重要性能指标,但传统可扩展性度量机制只是试图从某一个侧面反映并行计算可扩展情况,难以全面度量并行计算系统综合性能.深入和全面地研究可扩展性度量机制,从众多性能指标中精选一组关键参数,对其进行归一化处理,然后用Kiviat图的面积来刻画并行计算的综合性能,由此给出一种新的等性能面积的并行计算扩展性度量机制,并进一步分析新度量机制和传统机制的关系.最后,应用新度量机制分析矩阵相乘算法在LogP计算机模型上的扩展性,并在集群平台上实际运行程序进行扩展性实验,进一步验证新机制的有效性.新度量机制对于指导并行计算体系结构完善,以及并行算法设计和调优有重要意义. Scalability is a very important performance metric of parallel computing, but the traditional scalability metrics only try to reflect the parallel computing scalability from one side, which is difficult to fully measure its comprehensive performance. This paper studies scalability metrics deeply and fully. We choose a group of key metrics from lots of performance parameters and normalize them, then characterize the overall performance of parallel computing by the area of Kiviat graph which is posed by the group of key parameters. Thereby, we propose a novel scalability metric about iso-area of performance for parallel computing which measures the scalability of parallel computing by comparing the initial area of the Kiviat graph with the extended one. And the relationship between the new metric and the traditional ones is further analyzed. Finally, the novel metric is applied to address the scalability of the matrix multiplication algorithm under LogP model, and the experiments about extension are carried out on cluster platforms by running the program for the algorithm in order to further validate the effectiveness of the new metric. It is significant to improve parallel computing architecture and to tune parallel algorithm design.
出处 《计算机研究与发展》 EI CSCD 北大核心 2014年第11期2547-2558,共12页 Journal of Computer Research and Development
基金 国家“八六三”高技术研究发展计划基金项目(2009AA012201) 国家自然科学基金项目(61363041,61272107,61202173,61103068) NSFC-微软亚洲研究院联合资助项目(60970155) 上海市优秀学科带头人计划项目(10XD1404400) 教育部网络时代的科技论文快速共享专项研究课题基金项目(20110740001)
关键词 并行计算 可扩展性 等性能面积 度量机制 Kiviat图 parallel computing scalability iso-area of performance metrics Kiviat graph
  • 相关文献

参考文献19

  • 1Amdahl G M. Validity of the single-processor approach to achieving large scale computing capabilities [C] //Proc of the AFIPS Spring Joint Computer Conf. New York: ACM, 1967:483-485.
  • 2陈书明,陈胜刚,尹亚明.Amdahl定律在层次化片上多核处理器中的扩展[J].计算机研究与发展,2012,49(1):83-92. 被引量:7
  • 3Gustafson J L. Reevaluating amdahl's law [J]. Communications of the ACM, 1988, 31(5): 532-533.
  • 4Zorbas J R, Reble D J, Vankooten R E. Measuring the scalability of parallel computer systems [C] //Proc of Supercomputing. New York: ACM, 1989:832-841.
  • 5Nussbaum D, Agarwal A. Scalability of parallel machines [J]. Communications of the ACM, 1991, a4(3), 57-61.
  • 6Sun X H, Gustafson J L. Toward a better parallel performance metric [J]. Parallel Computing, 1991, 17(10) : 1093-1109.
  • 7Sun X H, Ni L M. Scalable problems and memory-bounded speedup [J]. Journal of Parallel and Distributed Computing, 1993, 19(1): 27-37.
  • 8Grama A, Gupta A, Kumar V. Iso-efficiency: Measuring the scalability of parallel algorithms and architectures [J]. IEEE Parallel & Distributed Technology, 1993,1(3): 12-21.
  • 9Sun X H, Rover D T. Scalability of parallel algorithm- machine combinations[J]. IEEE Trans on Parallel and Distributed Systems, 1994, 5(6): 599-613.
  • 10Zhang X D, Yan Y, He K. Latency metric: An experimental method for measuring and evaluating parallel program and architecture scalability[J]. Journal of Parallel and Distributed Computing, 1994, 22(3): 392-410.

二级参考文献54

  • 1谭明锋,龚正虎.基于ASIC实现的高速可扩展并行IP路由查找算法[J].电子学报,2005,33(2):209-213. 被引量:5
  • 2迟利华.大型稀疏线性方程组在分布式存储环境下的并行计算:博士学位论文[M].长沙:国防科学技术大学计算机学院,1998..
  • 3X.Sun.Scalability versus execution time in scalable system[J].Journal of Parallel and Distributed Computing,2002,62(2):173-192.
  • 4A.Gramma,A.Gupta,V.Kumar.Isoefficiensy function:a scalability metic for parallel algorithms and architecures[J].IEEE Parallel and Distributed Technology,1993,1(3):12-21.
  • 5X.Sun,D.Rover,Scalability of parallel algorithm-machine combinations[J].IEEE Transaction on Parallel Distributed Systems,1994,5:599-613.
  • 6Y Chen,X Sun,M Wu.Algorithm-system scalability of heterogeneous computing[J].Journal of Parallel and Distributed Computing,2008,68(11):1403-1412.
  • 7X.Sun,Y.Chen,M.Wu.Scalability of heterogeneous computing.Proceedings of the 34th International Conference on Parallel Processing.Los A lamitos:The IEEE Computer Society,2005.557-564.
  • 8A Danalis,G Marin,C Mccurdy,et al.The scalable heterogeneous computing(SHOC) benchmark suite.Proceedings of 10th GPGPU.New York:ACM,2010.1-12.
  • 9Agarwal A. Tiled multicore processors: The four stages of reality [keynote][C] //Proc of the 40th Annual IEEE/ACM Int Symp on Microarchitecture (MICRO'07). Los Alamitos, CA: IEEE Computer Society, 2008.
  • 10Sankaralingam K, Nagarajan R, Gratz P, et al. The distributed microarehitecture of the TRIPS prototype processor [C] //Proc of the 39th Int Symp on Microarch tecture (MICRO06). Los Alamitos, CA:IEEE Computer Society, 2006:480-491.

共引文献78

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部