期刊文献+

面向片上多核处理器的推测多线程机制下的独立栈模型 被引量:1

A Separate Stack Model in Speculative Multithreading Based on Chip Multi-Processor
下载PDF
导出
摘要 在推测多线程并行执行时,各线程中借助共享栈来实现函数调用的过程存在互斥访问问题,为此提出了一种采用片上多处理器(CMP)的推测多线程机制下的独立栈函数调用模型,核栈采用一对一方式,栈之间的有机配合支持推测多线程中的函数调用.通过在模拟器端添加两条指令实现栈空间的管理,从而消除了共享栈模型中的加锁问题.为了保证程序正常运行,编译器对生成的代码作相应的调整,模拟器方面则增添了get和update两条指令,以便管理相应的栈空间.因为独立栈函数调用方法消除了共享栈模型中的栈加锁问题,使得成功线程发起的数目有不同程度的提高,从而提高了程序的并行加速比.Olden基准程序的测试表明,独立栈模型相对于共享栈模型使程序的平均并行加速比提高了3.85%.但是,由于影响程序推测并行加速比的因素复杂,某些测试程序也出现了独立栈的加速比低于共享栈的情况. Function calls in threads adopt the shared stack model in speculative multithreading based on Chip Multi-Processor.A separate stack model is presented to eliminate the locking problem in the shared stack model,in which one core is associated with one stack on a one-to-one basis and the cooperation of those stacks supports the function calls in speculative multithreading.The stack memory management is achieved by means of adding two instructions to the speculative multithreading simulator so that the locking problem can be effectively eliminated.The compiler is modified to generate appropriate code to ensure the semantic correctness of the program,and the two instructions,get and update,are added to the simulator that supports speculative multithreading to manage the corresponding stack memory.Since the locking problem in the separate stack model is eliminated,the number of determinate threads spawned increases in various degrees and the parallel speedup of the program is increased.Experimental results show that the average speedup of the Olden benchmark suite is increased by 3.85%.However,since factors that affect the parallel speedup are complex,there are some cases that the parallel speedup decreases in the separate stack model.
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2010年第12期10-15,共6页 Journal of Xi'an Jiaotong University
基金 国家高技术研究发展计划资助项目(2008AA01Z136)
关键词 推测多线程 函数调用栈 代码生成 存储管理 speculative multithreading; function call stack; code generation; memory management;
  • 相关文献

参考文献8

  • 1KRISHNAM V, TORRELLAS J. A chip-multiprocessor architecture with speculative multithreading [ J ]. IEEE Trans Comput, 1999, 48(9):866-880.
  • 2WANG Shengyue Compiler techniques for thread-level speculation[D]. Minneapolis, Minnesota, USA: University of Minnesota, 2007.
  • 3HOGEN G. LOOGEN R. A new stack technique for the management of runtime structures in distributed implementations[R]. Aachen, Germany: RWTH Aachen University, 1993.
  • 4CHEN Zheng, ZHAO Yinliang, PAN Xiaoyu, et al. An overview of Prophet [C]//Proceedings of the 9th International Conference on Algorithms and Architectures for Parallel Processing. Berlin, Germany.. Springer, 2009: 396-407.
  • 5PAN Xiaoyu, ZHAO Yinliang, CHEN Zheng, et al. A thread partitioning method for speculation multithreading //Proceedings of the 8th International Conference on Embedded Computing. Piscataway, NJ, USA: IEEE, 2009: 285-290.
  • 6DONG Zhaoyu, ZHAO Yinliang, WEI Yuanke, et al Prophet: speculative multithreading execution model with architectural support based on CMP [C] /// International Conference on Embedded Computing. Piscataway, NJ, USA: IEEE, 2009 : 103-108.
  • 7宋少龙,赵银亮,冯博琴,韦远科,王旭昊,赵恒星.支持推测多线程的扩展多核模拟器Prophet+[J].西安交通大学学报,2010,44(10):13-17. 被引量:3
  • 8CARLISLE M C. Olden benchmark suite [EB/OL ]. (1996-06-15) [2007-06-30]. http://www, cs. princeton. edu/-mcc/olden, html.

二级参考文献9

  • 1BRIAN A,RUDOLF E Application of automatic parallelization to modem challenges of scientific computing industries[C]//Proceedings of the 37th International Conference on Parallel Processing.Piscataway,NJ,USA:IEEE,2008:279-286.
  • 2ARMSTRONG B,EIGENMANN R.Challenges in the automatic parallelization of large-scale computational applications[C]//Proceedings of SPIE/ITCOM 2001.Bellingham,WA,USA:SPIE,2001:50-60.
  • 3TIAN C,FENG M,NAGARAIAN V,et al.Copy or discard execution model for speculative parallelization on muhicores[C].Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture.Piscataway,NJ,USA:IEEE,2008,330-341.
  • 4OHSAWA T,TAKAGI M,KAWAHARA S,et al.Pinot:speculative multi-threading processor architecture exploiting parallelism over a wide range of granularities[C]//Proceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture.Piscataway,NJ,USA:IEEE,2005,81-92.
  • 5MADRILES C,QUINONES C,SANCHEZ J,et al.Mitosis:a speculative multithreaded processor hased on precomputation slices[J].IEEE Transactions on Parallel and Distributed Systems,2008,19(7):914-925.
  • 6DONG Zhaoyu,ZHAO Yinliang,WEI Yuanke,et al.Prophet:a speculative multi-threading execution model with architectural support based on CMP[C]//Proceedings of the 2009 International Conference on Scalable Computing and Communications.Piscataway,NJ,USA:IEEE,2009,103-108.
  • 7ZIER D,LEE B.Performance evaluation of dynamic speculative multithreading with the cascadia architecture[J].IEEE Transactions on Parallel and Distributed Systems,2010,21(1):47-59.
  • 8SWEETMAN D.See MIPS run[M].San Francisco,CA,USA:Morgan Kaufmann Publishers,2007.
  • 9GAEKE B R.VMIPS Project[EB/OL].[2008-01-24].http://www.dgate.org/VMIPS.

共引文献2

同被引文献10

  • 1Shen Zhi-yu, Hu Zi-ang, Liao Xiang-ke, et al. Methods of parallel compilation[M]. Beijing:National Defence Industry Press,2000.
  • 2Yan Zhao. Research on parallel recognition method and application[D]. Changchun:Jilin University ,2009.
  • 3Guo Li. Task scheduling algorithm based on reconfigurable cell ar-ray[D]. Shanghai:Shanghai Jiaotong University ,2010.
  • 4Carlos Garcia Quinones,Carlos Madriles,Jesus Sanchez,et al. Mitosis compiler : an infrastructure for speculative threading based onpre-computation slices[ C]. In: Conference on Programming Language Design and Implementation,Chicago,Illinois,USA,2005 : 269-279.
  • 5Li Xin. Research on parsing abstract syntax tree of GCC and the method of creating control dependent subgraph f D]. Harbin ; Harbin Institute of Technology ,2008.
  • 6Chen Zheng, Zhao Yin-liang, Pan Xiao-yu, et al. An overview of prophet [ C ]. In : Proceedings of the International Conference on Algorithms and Architectures for Parallel Processing, Berlin,2009 : 396407.
  • 7Dong Zhao-yu,Zhao Yin-liang,Wei Yuan-ke,et al. Prophet:a speculative multi-threading execution model with architectural support based on CMP[C]. In: International Conference on Scalable Computing and Communications-The 8 th International Conference on Embedded Computing,Dalian,2009: 103-108.
  • 8Pan Xiao-yu, Zhao Yin-liang, Chen Zheng, et al. A thread partitioning method for speculation multithreading[C]. In: Proceedings of the 8th International Conference on Embedded Computing, Piscataway, NJ, USA, 2009:285-290.
  • 9李鑫,王甜甜,苏小红,马培军.消除GCC抽象语法树文本中冗余信息的算法研究[J].计算机科学,2008,35(10):170-172. 被引量:8
  • 10李钟,赵银亮,杜延宁.Prophet推测多线程系统设计与实现[J].计算机科学,2011,38(2):296-301. 被引量:2

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部