期刊文献+

一种基于推测代价评估的推测多线程并行粒度调节方法 被引量:4

A PARALLEL GRANULARITY TUNING APPROACH FOR SPECULATIVE MULTITHREADING BASED ON SPECULATIVE COST EVALUATION
下载PDF
导出
摘要 传统的推测多线程技术总是假定程序的并行粒度大小应该随着处理器核资源数目的增加而增大,未考虑不同数目的处理器核资源对程序自身并行性能的影响作用。针对这个问题,提出一种自适应的循环并行粒度调节方法用于优化处理器核资源的分配过程。以推测级为单位,通过动态收集循环中所有推测线程的性能量化分析结果,进行推测代价评估。并利用评估结果动态调整循环的并行粒度大小,优化所分配到的处理器核资源的数目,以减少不必要的推测代价。实验表明,该方法不但在SPEC CPU基准测试程序集上能取得较好的性能提升,而且进一步优化了推测时的能耗开销。 Traditional speculative multithreading always assumes that the size of program's parallel granularity should increase as the number of processor core resources increases. It doesn't consider the effect of different number of processor core resources on the parallel performance of a program. Therefore, we proposed a self-adaptive parallel granularity adjustment for loops to optimize the allocation of their processor core resources. This approach took the speculative level as the unit, and performed the speculative cost evaluation by mean of dynamically collecting the results of performance quantitative analysis for all speculative threads within a loop. The results of cost evaluation were used to dynamically adjust the size of loop's parallel granularity and optimize the number of their allocated processor core resources to reduce the unnecessary cost for speculation. The experimental results show that our approach not only achieves better performance on SPEC CPU benchmark assemblies, but also optimizes the power consumption for speculation.
作者 李美蓉 赵银亮 Li Meirong;Zhao Yinliang(Xi’an Aeronautical University, Xi’an 710077,Shaanxi, China;Xi’an Jiaotong University, Xi’an 710049,Shaanxi,China)
出处 《计算机应用与软件》 北大核心 2019年第4期29-36,90,共9页 Computer Applications and Software
基金 国家自然科学基金项目(61640219 61173040) 校级科研基金项目(2016KY1103)
关键词 推测多线程 代价评估 并行粒度 Speculative multithreading Cost evaluation Parallel granularity
  • 相关文献

参考文献4

二级参考文献57

  • 1Yang L and Zhai A. Dynamically dispatching speculative threads to improve sequential execution[J]. ACM Transactions on Architecture and Code Optimization, 2012, 9(3): 13:1-13:31.
  • 2Vijaykumar T N and Sohi G S. Task selection for a multiscalar processor[C]. Proceedings of the 31st Annual ACM/IEEE International Symposium on Microarchitecture, Dallas, 1998: 81-92.
  • 3Hammond L, Hubbert B A, Siu M, et al.. The stanford hydra cmp[J]. IEEE Micro, 2000, 20(2): 71-84.
  • 4Liu W, Tuck J, Ceze L, et al.. POSH: a TLS compiler that exploits program structure[C]. Proceedings of the 8th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, New York, 2006: 158-167.
  • 5Madriles C, García-Qui?ones C, Sánchez J, et al.. Mitosis: a speculative multithreaded processor based on precomputation slices[J]. IEEE Transactions on Parallel and Distributed Systems, 2008, 19(7): 914-925.
  • 6Jialu H, Jablin T B, Beard S R, et al.. Automatically exploiting cross-invocation parallelism using runtime information[C]. Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization, Shenzhen, 2013: 1-11.
  • 7Gao L, Li L, Xue J, et al.. SEED: a statically greedy and dynamically adaptive approach for speculative loop execution[J]. IEEE Transactions on Computers, 2013, 62(5): 1004-1016.
  • 8Sharafeddine M, Jothi K, and Akkary H. Disjoint out-of-order execution processor[J]. ACM Transactions on Architecture and Code Optimization, 2012, 9(3): 19:1-19:32.
  • 9Wang S Y, Yew P C, and Zhai A. Code transformations for enhancing the performance of speculatively parallel threads[J]. Journal of Circuits, Systems and Computers, 2012, 21(2): 1-23.
  • 10Du Z H, Lim C C, Li X F, et al.. A cost-driven compilation framework for speculative parallelization of sequential programs[J]. Association for Computing Machinery Special Interest Group Programming Languages Notices, 2004, 39(6): 71-81.

共引文献8

同被引文献31

引证文献4

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部