期刊文献+

基于存储计算的可重构加速架构设计

Design of reconfigurable accelerator based on in-memory computing
下载PDF
导出
摘要 为降低存储墙以及传统的冯诺依曼瓶颈对计算系统高性能和低功耗设计带来的影响,提出一种基于存储计算的硬件加速架构。将排列规则的存储阵列转化为可重构的计算源,在保证原来存储功能的情况下,完成特定运算,实现存储和计算的双重功能;采用后台数据传输机制隐藏处理器和片外存储计算逻辑通信的延时,充分利用存储器的块状组织结构,以高带宽实现不同任务的并行计算,提高系统性能。实验结果表明,相对于传统的加速结构,采用该架构可以使系统以低于2%的硬件开销,提升至少2倍性能。 To address the problem brought by the memory wall and Von Neumann bottleneck features including high energy,high latency and low bandwidth,a hardware accelerator framework based on in-memory computing was proposed.Regular memory array was transformed into configurable computing resources to accelerate variety of tasks-both data and compute intensive,serving dual purpose of storage and computing.Background data transfer was utilized to hide the high communication latency between processors and off-chip mem-computing logic,and the bank-based memory was connected with lightweight processing elements,realizing parallel computing of different tasks with high bandwidth,the performance was improved greatly.Experimental results show that the system can improve the performance by two times at the cost of chip area overhead of 2%,relative to the traditional accelerator framework.
出处 《计算机工程与设计》 北大核心 2016年第4期1071-1075,共5页 Computer Engineering and Design
基金 国家自然科学基金项目(61103008) 上海市科委专项基金项目(12511503700) 三星公司-复旦大学合作基金项目(SLSI-201403DD013)
关键词 存储计算 处理器 后台数据传输 加速器 并行计算 in-memory computing processor background data transfer accelerator parallel computing
  • 相关文献

参考文献13

  • 1Borkar S,Chien AA.The future of microprocessors[J].Communications of the ACM,2011,54(5):67-77.
  • 2王强,陈岚,郝晓冉.一种基于访存行为地址映射机制的混合内存系统[J].小型微型计算机系统,2014,35(6):1201-1206. 被引量:4
  • 3Zhang K.ISSCC 2013tech trends[EB/OL].[2013].http://isscc.org/trends/
  • 4谢佩博,顾华玺,贾林.片上网络路由算法的研究[J].计算机工程与设计,2009,30(13):3078-3081. 被引量:5
  • 5翁启源,杨洪斌,吴悦.层次总线型多核SoC结构系统级模拟[J].计算机工程与设计,2009,30(23):5355-5357. 被引量:1
  • 6Shi Y,Chen HM.Memcomputing:The cape of good hope[C]//Design,Automation and Test in Europe Conference and Exhibition,2014:1-3.
  • 7Paul S,Krishna A,Qian W,et al.MAHA:An energy-efficient malleable hardware accelerator for data-intensive applications[J].IEEE Transactions on Very Large Scale Integration Systems,2014,23(6):1005-1016.
  • 8Zhu Q,Vaidyanathan K,Shacham O,et al.Design automation framework for application-specific logic-in-memory blocks[C]//23rd International Conference on Application-Specific Systems,Architectures and Processors,2012:125-132.
  • 9Morad A,Yavits L,Ginosar R.GP-SIMD processing-inmemory[J].ACM Transactions on Architecture and Code Optimization,2014,11(4):53-77.
  • 10Hussain W,Airoldi R,Hoffmann H,et al.Design of an accelerator-rich architecture by integrating multiple heterogeneous coarse grain reconfigurable arrays over a network-on-chip[C]//25th International Conference on Application-Specific Systems,Architectures and Processors,2014:131-138.

二级参考文献63

  • 1王国明.高性能集群计算系统的结构与设计探讨[J].电脑知识与技术,2005(12):158-159. 被引量:3
  • 2杨庚,王江涛,程宏兵,容淳铭.基于身份加密的无线传感器网络密钥分配方法[J].电子学报,2007,35(1):180-184. 被引量:60
  • 3Dally W J,Towles B.Route packets,not wires:on-chip interconnection networks[C].proceeding of Design Automation Conference.Las Vegas,Nevada,United States:ACM Press,2001:684-689.
  • 4Ahmed H A.Network on a chip:architecture for billion transistor era[C].Proceeding of the IEEE NorChip Conference,2002:166-173.
  • 5Davide BIertozz,Shashi Kumar A P.Networks-on-chip:emerging research topics and novel ideas[Z].VLSI Design,2007.
  • 6Kariniemi H,Nurmi J.Arbitration and routing schemes for onchip packet networks[C].Interconnect-Centric Design for Advanced SoC and NoC,2005:253-282.
  • 7Viller T.Network on chip routing algorithms[R].TUCS Technical Report,2006.
  • 8Yun H,Zincir-Heywond A N.Intelligent ants for adaptive network routing[C].Proceedings of Second Annual Conference on Communication Networks and Services Research.IEEE Computer Society,2004:255-261.
  • 9Towles B,Dally W J,Boyd S.Throughput-centric routing algorithm design[C].Proceedings of the Fifteenth Annual ACM Symposium on Parallel algorithms and Architectures.San Diego,Califomia,USA:ACM Press,2003:200-209.
  • 10Ge-Ming C.The odd-even turn model for adaptive routing[J].IEEE Transactions on Parallel and Distributed Systems,2000,11 (7):729-738.

共引文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部