摘要
线程级推测技术使在多核上加速传统上难以手工或自动并行化的串行程序成为可能,它不仅需要合理地选择线程的划分策略,而且需要合理地选择适合推测执行的应用.已有的大量研究主要集中在如SPEC CPU这样的桌面应用领域,为了全面地认识TLS技术的应用适用性,本文探讨TLS技术对科学计算应用的性能提升潜力,提出一套TLS适用性的基本判定准则,实验结果表明采用该技术加速SPLASH2中的多数应用可以有效利用16核及以上的计算资源.
Thread-level speculation (TLS) provides an opportunity for speeding up the serial codes on the multicore chips which are traditionally hard to parallel manually or automatically. It requires not only reasonable thread partition strategy,but also the suitable applications. Up to now,lots of researches have been mainly concentrated in the desktop applications,such as the SPEC CPU benchmark suite. In order to fully understand the TLS applicability,this paper explores the TLS potential parallelism in the scientific field,analyzes its impacting factors,and proposes a criterion for selecting the region to be speculatively executed. The experimental results show that the majority of SPLASH2 applications can make an effective use of more than 16 cores' computing resources in the TLS technology.
出处
《小型微型计算机系统》
CSCD
北大核心
2010年第2期264-270,共7页
Journal of Chinese Computer Systems
基金
国家自然科学基金重点项目(60633040)资助
国家"九七三"重点基础研究发展规划基金项目(2005CB321601)资助
国家"八六三"高技术研究发展计划重大项目(2006AA01A102-5-2)资助
教育部-英特尔信息技术专项科研基金项目(MOE-INTEL-08-07)资助
关键词
多核
线程级推测
线程划分
动态剖析
multicore
thread-level speculation
thread partition
dynamic profiling