摘要
针对访存延迟对现代处理器性能的影响,基于片上多处理器分析与测试了访存密集型应用程序的帮助线程数据预取性能。结果表明热点区计算/访存延迟比率对帮助线程预取性能有重大影响。依据热点区计算/访存延迟比率合理安排帮助线程与主线程的访存任务比例时,能达到对帮助线程性能的优化,使帮助线程预取获得更好的性能收益。基准测试程序的测试实验结果表明当热点区计算量很小可以忽略不计时,帮助线程与主线程的访存任务比接近1时,帮助线程预取获得最好的性能收益。
Memory latency has become a critical bottleneck in achieving high performance on modern processors.Prefetching thread based on multiprocessor(CMP) is a well known approach to reducing memory latency and has been explored in different applications.In this paper,we analyze the performance of prefetching thread for memory intensive applications.The analysis and experimental result show that computation/access latency ratio(CALR) of hotspots has an important affect on prefethcing performance.When the memory access ratio between main thread and prefetching thread is close to(1 CALR)/(1+CALR),prefetching thread gains better performance.The thread prefetching performance of several benchmarks from Olden and SPEC2006 benchmark suite is tested,and the experimental results reflect the impact of different memory access ratio between Prefetching thread and main thread.
出处
《电子科技大学学报》
EI
CAS
CSCD
北大核心
2012年第1期85-91,共7页
Journal of University of Electronic Science and Technology of China
基金
教育部-英特尔信息技术专项科研基金(MOE-INTEL-08-10)
北京市重点学科建设项目