期刊文献+

面向多核处理器系统的Cache感知调度算法

A Cache-aware Scheduling Algorithm on Multicore Systems
下载PDF
导出
摘要 Cache空间的不公平使用和争用直接影响系统的整体性能,现有Linux操作系统的默认调度算法不能感知程序的行为,包括访问cache的失效次数,不了解线程之间访存模式和频度上可能存在的差异,因而无法做出更加合理的调度.本文提出并在Linux环境下实现了一种Cache感知的调度算法CAS,通过监测每个任务每千条指令的共享cache失效次数,把cache失效次数相近的任务聚合到同一个核上,使得cache失效次数差异较大的任务运行在不同的核上,避免了cache失效次数都很大的任务在不同的核上同时运行,从而减小了cache空间的不公平使用和争用.实验表明,CAS算法在大多数情况下,减少了整个负载的共享cache失效次数,提高系统的平均吞吐量约5%左右. The unfair utilization and contention on cache space impose a significant influence on system performance. Current default Linux operating system scheduling algorithm lacks the awareness of program behavior including cache miss rate, instructions retired etc. Therefore it can not learn about the differences of memory access pattern and frequency between different tasks. This paper pro- poses and implements a cache-aware scheduling algorithm called CAS. CAS analyzes the cache miss rate of different tasks and desig- nated tasks with similar cache miss rate to the same core. Tasks with different cache miss rate will run on different cores and hence it is impossible to run simultaneously for two tasks with same cache miss rate, thereby reducing cache space unfair utilization and cont- ention. Experimental results show the algorithm CAS can reduce total cache miss of the whole workload and improve system through- put up to 5 % on the average in most cases.
出处 《小型微型计算机系统》 CSCD 北大核心 2013年第2期365-369,共5页 Journal of Chinese Computer Systems
基金 国家"九七三"重点基础研究发展计划项目(2011CB302501)资助 "核高基"国家科技重大专项项目(2011ZX01028-001-002)资助 北京市教委科技计划面上项目(KM201210028004)资助 计算机体系结构国家重点实验室开放课题项目(CARCH201203)资助
关键词 任务调度 多核处理器 Cache感知 硬件性能计数器 task scheduling multicore cache-aware hardware pert-ormance counter
  • 相关文献

参考文献2

二级参考文献26

  • 1段然,樊晓桠,荆元利,沈戈,梁婕.面向DSP应用的可重构计算(英文)[J].微电子学与计算机,2005,22(4):45-50. 被引量:2
  • 2Alexandra Fedorova, Margo Seltzer, Christopher Small. Performance of multithreaded chip mulfiprocessors and implications for operating system design[ A]. Proceedings of USENIX 2005 Annual Technical Conference[C]. California: USENIX Association, 20(0. 395 - 398.
  • 3Andrew Tucker. Efficient Scheduling on Multiprogrammed Shared-Memory Mulitprocessors [D]. California: Stanford University, 1993.
  • 4Mark S. Squillante, Edward D. Lazowska. Using processorcache affnity in shared-memory multiprocessor scheduling[J]. IEEE Transactions on Parallel and Distributed Systems., 1993, 4(2) : 131 - 143.
  • 5Andrew Tucker, Anoop Gupta. Process control and scheduling issues for mulfipro-grammed shared-memory multiprocessors [A]. Proceedings of the 12th ACM Symposium on Operating Systems Principles[C]. New York: ACM Press, 1989. 159 - 166.
  • 6A. Snavely, D. Tullsen. Symbiotic jobscheduling for a simultaneous mulfithreading architecture[A]. 9th International Conference on Architectural Support for Programming Languages and Operating Systems [C]. New York: ACM Press, 2000. 234 - 244.
  • 7S Parekh, S Eggers, H. Levy. Thread-sensitive Scheduling for SMT Processors[ R ]. Washington: University of Washington, 2000.
  • 8C Cascaval, L DeRose, D A. Padua et al. Compile-time based performance prediction[A]. Proceedings of the 12th International Workshop on Languages and Compilers for Parallel Compuling[C]. California: Springer, 1999.365 - 379.
  • 9Erik Berg,Efik Hagersten. Statcache: a probabilisfic approach to efficient and accurate data locality analysis[ A]. Proceedings of the International Symposium on Performance Analysis of Systems and Software[C]. Texas: ISPASS,2004.1 - 8.
  • 10Alexandra Fedorova. Operating System Scheduling for Chip Multithreaded Processors [ D ]. Massachusetts: Harvard University, 2006.

共引文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部