期刊文献+

基于取指执行时序范畴的多核共享Cache干扰分析 被引量:4

Analysis of Inter-Thread Interference on Shared Cache Multi-Core Architectures Based on Instruction Fetch Timing Frame
下载PDF
导出
摘要 在多核结构中,获得并行应用线程的安全、精确的最坏情况执行时间(worst case execution time,WCET)的最大挑战之一在于共享资源的竞争冲突检测.在共享Cache的多核处理器中,线程在共享Cache中的指令可能被其他并行线程的指令替换,从而导致了线程间在共享Cache上的干扰,因此多核结构下线程WCET需要考虑并行线程间在共享Cache上的干扰.在现有的简单地址映射干扰分析基础上,考虑了指令取指执行时序因素对干扰的影响,提出了非干扰状态的充分不必要条件,根据指令的取指执行时序范畴判断线程在共享Cache上的干扰状态.通过排除非干扰状态,可以进一步精确多核结构中线程的WCET估值.理论分析证明了该方法的有效性.实验结果表明,与当前现有的考虑执行周期和基于逻辑访问先后顺序的方法相比,基于时序方法下的WCET估值分别可以提高12%和7%的精确度. In real-time systems, designers need to obtain a safe and tight worst-case execution time (WCET) of applications. It is very challenging for multi core processors due to the possible inter- thread interference caused by shared resources. For multi-core platforms with shared caches, instructions may be evicted by another co-running thread, which results in inter-thread interference in shared cache. In this case the execution time of applications can not be analyzed independently. The inter-thread interference should be taken into consideration in WCET analysis on multi-core platforms. This paper proposes a sufficient and unnecessary condition of non-interference to analyze the worst-case inter-thread interference by considering the timing frame of instruction fetch. Our approach introduces the consideration of timing frames into the conservative address analysis. Therefore, the worst-case shared cache misses can be reasonably estimated to determine the latency of inter-thread interference in shared cache. Our experiments indicate that the proposed approach improves the tightness of WCET estimation by 12~//00 as compared with the representative related work which proposes an lifetime-based method, and by 7 ~ as compared with another representative related work which is based on the structure order of shared cache accesses.
出处 《计算机研究与发展》 EI CSCD 北大核心 2013年第1期206-217,共12页 Journal of Computer Research and Development
基金 国家自然科学基金项目(61272097) 高等学校博士学科点专项科研基金项目(20104307110005) 国防科学技术大学优秀研究生创新资助项目(B100601) 湖南省研究生科研创新资助项目(CX2010B026)
关键词 多核体系结构 共享CACHE 干扰 取指执行时序 最坏情况下执行时间 multi-core architecture shared cache interference instruction fetch timing worst case execution time (WCET)
  • 相关文献

参考文献21

  • 1International technology roadmap for semiconductors 2008 update [EB/OL]. 2007 [2010-12-01]. http://public, itrs. net.
  • 2Davis R I, Burns A. A survey of hard real-time scheduling algorithms and schedulability analysis techniques for multiprocessor systems, YCS-2009-443 [R]. York: University of York, Department of Computer Science Technical, 2009.
  • 3Calandrino J, Anderson J, Baumberger D. A hybrid real- time scheduling approach for large-scale multi-core platforms [C] //Proc of the 19th Euromicro Conf on Real-Time Systems. Piscataway, NJ: IEEE, 2007:247-258.
  • 4Puschner P, Koza C. Calculating the maximum execution time of real-time program [J]. Real-Time Systems, 1989, 1 (2) : 159-176.
  • 5Mok A, Amerasinghe P, Tantisirivat K. Evaluating tight execution time hounds of programs by annotations [C] //Proc of the 6th IEEE Workshop on Real-Time Operating System and Software. Piscataway, NJ: IEEE, 1989:74,-80.
  • 6Liu C, Sivasubramaniam A, Kandemir T. Organizing the last line of defense before hitting the memory wall for CMP [C]// Proc of HPCA. Piscataway, NJ: IEEE, 2004: 176.
  • 7Chang J, Sohi G. Cooperative caching for chip multiprocessors[C] //Proc of the 33rd Annual Int Symp on Computer Architecture. Piscataway, NJ: 1EEE, 2006:264- 276.
  • 8Chen Shimin, Gibbons P B, Kozuch M. Scheduling threads for constructive cache sharing on CMPs[C] //Proc of the 19th Annual ACM Symp on Parallel Algorithms and Architectures. New York: ACM, 2007:105-115.
  • 9Suhendra V, Mitra T. Exploring locking and partitioning for predictable shared caches on multi cores [C] //Proc of DAC. New York: ACM, 2008.
  • 10Fedorova A, Seltzer M, Small C, et al. Throughput oriented scheduling on chip multithreading systems, TR-17-04 [R]. Cambridge: Harvard University, 2005.

同被引文献44

  • 1邸楠,王韬,李晓明.LilyTask任务并行环境中基于任务关系的初始任务分配算法[J].计算机学报,2005,28(5):892-899. 被引量:6
  • 2何琨,赵勇,陈阳.分布式环境下多任务调度问题的分析与求解[J].系统工程理论与实践,2007,27(5):119-125. 被引量:12
  • 3Quirk,William J.Verfication and Validation of Real Time Software[M].New York:Springer-Verlag,1985.
  • 4Candra D,Guo Fei,Kim S,et al.Predicting inter-Thread CacheContention on a Chip Multi Processor Architecture[C]//Proceeding of the 11th International Symposium on High Performance Computer Architecture.2005:340-351.
  • 5Subramanian R,Smaragdakis Y,Loh G H,Adaptive caches:Effective shaping of cache behavior to workload[C]//Proc of the 39th Annual IEEE/ACM into Symposium on Microarchitecture.2006:118-131.
  • 6Cousot P,Cousot R.Interpretation:a Unified Lattice Model for Static Analysis of Programs by Construction or Approximation of Fix points[C]//the Proceedings of ACM Symposium on Principles of Programming Language.1997:1-25.
  • 7Baier C,Katoen J P.Principles of Model Checking[M].Cambridge,Massaehusetts:The MIT Press,2007.
  • 8Park C,Shaw A C.Experiment switches a Program Timing Tool Based on Source Level Timing Schema[J].IEEE Transactions on Computers,1991,24 (5):48-57.
  • 9Tsun Y,Li S,Malik S.Performance Analysis of Embedded Software Using Implicit Path Enumeration[C]//Workshop on Languages Compilers and Tools for Real Time Systems.1995:456-461.
  • 10Stappert F,Ermedahl A,Engblom J.Efficient Longest Executable Path Search for Programs with Complex Flows and Pipeline Effects[R].UPPsalaUniversity,2001.

引证文献4

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部