期刊文献+

改进的阵列处理器数据Cache实时动态迁移机制

Improved Real-Time Dynamic Migration Mechanism of Array Processor Data Cache
下载PDF
导出
摘要 片上分布式存储结构满足了阵列处理器对访存提出的高并行性要求,一定程度上缓解了“存储墙”问题。但是,在远程访问情况下,分布式存储结构存在的长延迟问题仍然十分突出。针对该问题,设计了一种改进的基于分布式数据Cache的实时动态迁移机制,采用四级全互连和迁移互连,以数据访问频率为依据对远程数据进行动态调度,有效降低了远程访存的延迟。并基于阵列处理器分布式Cache结构,通过运动补偿等典型算法的并行实现,对所提出的实时动态迁移机制进行全面验证测试。实验结果表明,采用实时动态迁移机制的分布式Cache在166.9 MHz的工作频率下,最高可提供10.68 GB/s的访存带宽。与同类结构相比,远程访问延迟降低了46.5%。 The on-chip distributed storage structure satisfies the high parallelism requirements of the array processor for memory access,and alleviates the problem of memory wall to some extent.However,in the case of remote access,the long latency problem of distributed storage structure is still very severe.Aiming at this problem,an improved real-time dynamic migration mechanism based on distributed data Cache is designed.It uses four-level fully interconnection and migration interconnection to dynamically schedule remote data based on data access frequency,effectively reducing the delay of remote access.Based on the distributed Cache structure of the array processor,the proposed real-time dynamic migration mechanism is verified by parallel implementation of typical algorithms such as motion compensation.The experimental results show that the distributed Cache with the real-time dynamic migration mechanism can provide data access bandwidth up to 10.68 GB/s at the operating frequency of 166.9 MHz.Compared to similar architectures,remote access latency is reduced by 46.5%.
作者 冯雅妮 蒋林 山蕊 刘阳 张园 FENG Yani;JIANG Lin;SHAN Rui;LIU Yang;ZHANG Yuan(School of Electronic Engineering,Xi'an University of Posts&Telecommunications,Xi'an 710121,China;Laboratory of Integrated Circuit,Xi'an University of Science and Technology,Xi'an 710054,China;School of Computer,Xi'an University of Posts&Telecommunications,Xi'an 710121,China)
出处 《计算机科学与探索》 CSCD 北大核心 2020年第12期2028-2038,共11页 Journal of Frontiers of Computer Science and Technology
基金 国家自然科学基金,Nos.61834005,61772417,61802304,61602377,61634004 陕西省重点研发计划,No.2017GY-060。
关键词 阵列处理器 分布式Cache 动态迁移 CACHE一致性 array processor distributed Cache dynamic migration Cache consistency
  • 相关文献

参考文献4

二级参考文献52

  • 1李浩,谢伦国.片上多处理器末级Cache优化技术研究[J].计算机研究与发展,2012,49(S1):172-179. 被引量:5
  • 2朱小虎,曹阳,王力纬.多级拥塞控制的NOC路由算法[J].北京邮电大学学报,2007,30(5):91-94. 被引量:10
  • 3Kim C,Burger D,Keckler S W.An adaptive,non-uniform cache structure for wire-delay dominated on-chip caches. Proc of Int Conf on Architectural Support for Programming Languages and Operating Systems . 2002
  • 4Chishti Z,Powell M D,Vijaykumar T N.Optimizing replication,communication,and capacity allocation in cmps. Proc of the32nd Annual Int Symp on Computer Architecture . 2005
  • 5Chishti Z,Powell M D,Vijaykumar T N.Distance associativity for high-performance energy-efficient non-uniform cache architectures. Proc of the36th Int Symp on Microarchitecture . 2003
  • 6Bell S,Edwards B,Amann J,et al.Tile64processor:A64-core soc with mesh interconnect. Proc of Int Solid-State Conference . 2008
  • 7Ros A,Acacio M E,Garcia J M.Scalable directory organization for tiled cmp architectures. Proc of Int Conf on Computer Design . 2008
  • 8Guz Z,Keidar I,Kolodny A,et al.Nahalal:Cache organization for chip multiprocessors. IEEE Computer Architecture Letters . 2007
  • 9Cho S Y,Jin L.Managing distributed,shared l2caches through os-level page allocation. Proc of the39th Annual IEEE/ACM Int Symp on Microarchitecture . 2006
  • 10Eisley N,Peh L S,Shang L.Leveraging on-chip networks for data cache migration in chip multiprocessors. Proc of Int Conf on Parallel Architectures and Compilation Techniques . 2008

共引文献30

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部