期刊文献+

针对多线程应用程序的片上网络优化设计

On-chip network optimized design for multithreaded applications
下载PDF
导出
摘要 在多线程程序执行过程中,遇到同步屏障(Barrier)时,进度快的线程需要等待进度慢的线程,导致执行进度快的处理器处于等待状态,不仅影响性能且浪费功耗。对基于片上网络的多核处理器来说,数据包在网络中的路由过程所耗费的时间占据了较多的线程执行时间。为了均衡多线程的执行进度提升性能,本文提出了线程感知的虚拟通道分配和片上网络路由方案:一种动态的为执行进度慢的线程优先分配虚拟通道的机制AVCA和关键线程拥塞避免的路由策略CTCAR。实验结果表明,与传统没有线程感知路由算法相比,本文所提出的组合方案可以有效降低多线程应用程序数据包在片上网络传输时的平均延迟。 During the execution of a multithreaded application, when encountering a barrier, the thread with fast progress needs to wait for the thread with slow progress, therefore causes the processor with fast execution progress to be in a waiting state, which not only affects the performance, but also wastes power consumption. For multi-core processors based on the on-chip network, the routing process of packets in the network occupies more thread execution time. In order to balance the progress of multiple threads and improve the performance, this paper proposes a thread-aware virtual channel allocation and on-chip network routing scheme: a dynamic mechanism AVCA for assigning virtual channels to threads with slow execution progress in priority, and routing for key thread congestion avoidance strategy CTCAR. The experimental results show that, compared with the traditional thread-unaware routing algorithm, the combined scheme proposed in this paper can effectively reduce the average delay of multi-threaded application packets in the on-chip network transmission.
作者 胡海洋 李悦瑶 李阳 赵玉来 李建华 HU Haiyang;LI Yueyao;LI Yang;ZHAO Yulai;LI Jianhua(School of Computer Science and Information Engineering,Hefei University of Technology,Hefei 230601,China;Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine,Hefei University of Technology,Hefei 230601,China)
出处 《智能计算机与应用》 2022年第12期16-29,共14页 Intelligent Computer and Applications
基金 安徽省重点研究与开发计划(202004d07020004) 安徽省自然科学基金项目(2108085MF203)。
关键词 片上网络 多线程应用程序 拥塞避免 路由算法 虚拟通道 on-chip network multithreaded applications congestion avoidance routing algorithm virtual channel
  • 相关文献

参考文献9

二级参考文献82

  • 1张磊,李华伟,李晓维.用于片上网络的容错通信算法[J].计算机辅助设计与图形学学报,2007,19(4):508-514. 被引量:18
  • 2Kumar R,Farkas K I,Jouppi N P,et al.Single-ISA heterogeneous multi-core architectures:the potential for processor power reduction[C]//Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture.Los Alamitos:IEEE Computer Society Press,2003:81-92.
  • 3Dally W J,Towles B.Principles and practices of interconnection networks[M].San Francisco:Morgan Kaufmann Publishers,2004:1-2.
  • 4Zhang L,Han Y H,Xu Q,et al.Defect tolerance in homogeneous manycore processors using core-level redundancy with unified topology[C]//Proceedings of Design,Automation and Test in Europe Conference & Exhibition.Los Alamitos:IEEE Computer Society Press,2008:891-896.
  • 5Zhang L,Han Y H,Li H W.Fault tolerance mechanism in chip many-core processors[J].Journal of Tsinghua Science and Technology,2007,12(Suppl1):169-174.
  • 6Burger D,Goodman J R.Billion-transistor architectures:there and back again[J].Computer,2004,37(3):22-28.
  • 7Chalasani S,Boppana R V.Fault-tolerance with multimodule routers[C]//Proceedings of the 2nd International Symposium on High-Performance Computer Architecture.Los Alamitos:IEEE Computer Society Press,1996:201-210.
  • 8Linder D H,Harden J C.An adaptive and fault tolerant wormhole routing strategy for k-ary n-cubes[J].IEEE Transactions on Computers,1991,40(1):2-12.
  • 9Ho C T,Stockmeyer L.A new approach to fault-tolerant wormhole routing for mesh-connected parallel computers[J].IEEE Transactions on Computers,2004,53(4):427-438.
  • 10Gómez M E,Nordbotten N A,Flich J,et al.A routing methodology for achieving fault tolerance in direct networks[J].IEEE Transactions on Computers,2006,55(4):400-415.

共引文献49

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部