Server-Based Data Push Architecture for Multi-Processor Environments 被引量：3

Server-Based Data Push Architecture for Multi-Processor Environments

导出

摘要 Data access delay is a major bottleneck in utilizing current high-end computing （HEC） machines. Prefetching, where data is fetched before CPU demands for it, has been considered as an effective solution to masking data access delay. However, current client-initiated prefetching strategies, where a computing processor initiates prefetching instructions, have many limitations. They do not work well for applications with complex, non-contiguous data access patterns. While technology advances continue to increase the gap between computing and data access performance, trading computing power for reducing data access delay has become a natural choice. In this paper, we present a serverbased data-push approach and discuss its associated implementation mechanisms. In the server-push architecture, a dedicated server called Data Push Server （DPS） initiates and proactively pushes data closer to the client in time. Issues, such as what data to fetch, when to fetch, and how to push are studied. The SimpleScalar simulator is modified with a dedicated prefetching engine that pushes data for another processor to test DPS based prefetching. Simulation results show that L1 Cache miss rate can be reduced by up to 97% （71% on average） over a superscalar processor for SPEC CPU2000 benchmarks that have high cache miss rates. Data access delay is a major bottleneck in utilizing current high-end computing （HEC） machines. Prefetching, where data is fetched before CPU demands for it, has been considered as an effective solution to masking data access delay. However, current client-initiated prefetching strategies, where a computing processor initiates prefetching instructions, have many limitations. They do not work well for applications with complex, non-contiguous data access patterns. While technology advances continue to increase the gap between computing and data access performance, trading computing power for reducing data access delay has become a natural choice. In this paper, we present a serverbased data-push approach and discuss its associated implementation mechanisms. In the server-push architecture, a dedicated server called Data Push Server （DPS） initiates and proactively pushes data closer to the client in time. Issues, such as what data to fetch, when to fetch, and how to push are studied. The SimpleScalar simulator is modified with a dedicated prefetching engine that pushes data for another processor to test DPS based prefetching. Simulation results show that L1 Cache miss rate can be reduced by up to 97% （71% on average） over a superscalar processor for SPEC CPU2000 benchmarks that have high cache miss rates.

作者孙贤和 Surendra Byna 陈勇

机构地区 Department of Computer Science Illinois Institute of Technology Department of Computer Science Illinois Institute of Technology

出处《Journal of Computer Science & Technology》 SCIE EI CSCD 2007年第5期641-652,共12页 计算机科学技术学报（英文版）

基金 This research was supported in part by the National Science Foundation of U.S.A.under NSF Grant Nos. EIA-0224377,CNS-0406328,CNS-0509118,and CCF-0621435.

关键词 performance measurement evaluation MODELING simulation of multiple-processor system cache memory performance measurement, evaluation, modeling, simulation of multiple-processor system, cache memory

分类号 TP332 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献42

1DARPA. High productivity computing systems (HPCS), vision: Focus on the lost dimension of HPC “User &: system efficiency and productivity”. http://www.darpa.mil/ipto/programs/hpcs/vision.htm.
2John Hennessy, David Patterson. Computer Architecture: A Quantitative Approach. Fourth edition, Morgan Kaufmann, ISBN: 0123704901, 2006.
3Wm A Wulf, Sally A McKee. Hitting the memory wall: Implications of the obvious. ACM SIGARPH Computer Architecture News, March 1995, 23(1): 20-24.
4Chen T F, Baer J L. Effective hardware-based data prefetching for high performance processors. IEEE Transactions on Computers, 1995, 44(5): 609-623.
5Dahlgren F, Dubois M, Stenstrom P. Fixed and adaptive sequential prefetching in shared-memory multiprocessors. In Proc. International Conference on Parallel Processing (ICPP), Los Alamitos, CA, USA, CRC Press, 1993, Vol.1, pp.56--63.
6Fu J, Patel J H. Data prefetching in multiprocessor vector cache memories. In Proc. the 17th Annual International Symposium on Computer Architecture, Toronto, Canada, 1991, pp.54--63.
7Joseph D, Grunwald D. Prefetching using Markov predictors. In Proc. the 24th International Symposium on Computer Architecture, Denver-Colorado, 1997, pp.252-263.
8Gokul Kandiraju, Anand Sivasubramaniam. Going the distance for TLB prefetching: An application-driven study. In Proc. the International Symposium on Computer Architecture, Anchorage, Alaska, 2002, p.195.
9Alexander T, Kedem G. Distributed predictive cache design for high performance memory system. In Proc. the 2nd International Symposium on High Performance Computer Architecture (HPCA), San Jose, CA, 1996, pp.254-263.
10Collins J, Tullsen D, Wang H, Shen J. Dynamic speculative precomputation. In Proc. the 34th International Symposium on Microarvhitecture, Austin, Texas, 2001, pp.306-317.

同被引文献13

1王卷乐,游松财,谢传节.地学数据共享中的元数据标准结构分析与设计[J].地理与地理信息科学,2005,21(1):16-18. 被引量：24
2汪红兵,佘春东,范植华,李磊,徐帆江.基于JMS的数据推送系统的设计与实现[J].计算机应用,2005,25(B12):366-368. 被引量：6
3廖一兰,王劲峰,孟斌,李新虎.人口统计数据空间化的一种方法[J].地理学报,2007,62(10):1110-1119. 被引量：50
4FRANKLIN M,ZDONIK S."Data in your face":Push technology in perspective:ACM SIGMOD Record,1998[C].ACM.1998.
5BESSIS N.Model architecture for a user tailored data push service in data grids[A].Grid Technology for Maximizing Collaborative Decision Management and Support:Advancing Effective Virtual Organizations[C].2009.235-255.
6DUKE C,STEELE J.Geology and lithic procurement in Upper Palaeolithic Europe:A weights-of-evidence based GIS model of lithic resource potential[J].Journal of Archaeological Science,2010,37(4):813-824.
7孙君曼,方华京,孙慧君,张梅凤.基于推技术的网络化监控报警系统[J].计算机工程,2008,34(7):269-271. 被引量：9
8李新,南卓铜,吴立宗,冉有华,王建,潘小多,王亮绪,李红星,祝忠明.中国西部环境与生态科学数据中心：面向西部环境与生态科学的数据集成与共享[J].地球科学进展,2008,23(6):628-637. 被引量：33
9王大力.数字化地图制图要素分类编码[J].地球信息科学,2008,10(6):736-740. 被引量：3
10诸云强,冯敏,宋佳,刘润达.基于SOA的地球系统科学数据共享平台架构设计与实现[J].地球信息科学,2009,11(1):1-9. 被引量：30

引证文献3

1Surendra Byna,陈勇,孙贤和.Taxonomy of Data Prefetching for Multicore Processors[J].Journal of Computer Science & Technology,2009,24(3):405-417. 被引量：1
2朱晓林,邹宇,易琳,俞肇元.基于模型需求模板匹配的多源地理数据推送方法研究[J].地理与地理信息科学,2016,32(1):24-28. 被引量：9
3Anthony Kougkas,Hariharan Devarajan,Xian-He Sun.I/O Acceleration via Multi-Tiered Data Buffering and Prefetching[J].Journal of Computer Science & Technology,2020,35(1):92-120. 被引量：2

二级引证文献12

1张建勋,古志民.帮助线程预取技术研究综述[J].计算机科学,2013,40(7):19-23. 被引量：3
2马振峰.关于移动网络数字信息个性化定位传输仿真[J].计算机仿真,2018,35(3):132-135. 被引量：2
3朱杰,游雄,夏青.基于任务驱动的战场环境分析数据映射模型设计与实现[J].地理信息世界,2017,24(4):80-85. 被引量：5
4沈军彩.用户行为数据分析下的信息推送系统的设计[J].现代电子技术,2017,40(17):158-161. 被引量：12
5蒋科.基于数据特征矩阵的海量医疗信息特征推送研究[J].机械设计与制造工程,2019,48(3):59-63. 被引量：1
6魏世红,贾建军,姚娜.输电线路路径优选新方法研究[J].中国新技术新产品,2019,0(17):31-32.
7罗琼,林若钦.用于高效检索的数据结构模式快速匹配仿真[J].计算机仿真,2020,37(1):394-397. 被引量：1
8贲敏,刘朝斌,孙雪,刘剑,刘珊,沙琨.医学领域中的智能化图像识别技术应用[J].解放军医院管理杂志,2021,28(S01):63-65.
9何晓斌,高洁,肖伟,陈起,刘鑫,陈左宁.应用透明的超算多层存储加速技术研究[J].计算机工程,2022,48(12):1-8.
10唐彬,张海林,宋孝燕,何禹睿.基于模板化的CTC/TDCS系统工程数据配置方案[J].铁道通信信号,2023,59(9):79-84. 被引量：1

1黄镇谨.协同设计中数据库接口的设计与优化[J].广西工学院学报,2006,17(3):82-85. 被引量：2
2季冬.高性能计算处理器进展[J].中国教育网络,2013(11):33-34.
3郑飞,陆鑫达.新一代RISC微处理器的技术特征与趋向[J].小型微型计算机系统,1995,16(9):56-60. 被引量：1
4战一波.浅谈WMS内置模块:HEC系列模块[J].山东工业技术,2016(18):118-118.
5郑飞.超级流水线处理器MIPS R4000的结构设计及其特征[J].微处理机,1993,14(1):11-15.
6张晓明,赵科,张民选.基于遗传算法的网络处理器异构资源映射方法研究[J].小型微型计算机系统,2007,28(2):341-345.
7章立生,丁丹.Novel Voltage Scaling Algorithm Through Ant Colony Optimization for Embedded Distributed Systems[J].Journal of Beijing Institute of Technology,2007,16(4):430-436.
8张红海,范双景,周立民.超级流水技术研究[J].齐齐哈尔大学学报（自然科学版）,1999,15(3):57-60. 被引量：1
9张建勋,古志民.帮助线程预取技术研究综述[J].计算机科学,2013,40(7):19-23. 被引量：3
10施耐德电气Unity有奖问答获奖名单揭晓[J].软件,2006,27(4):11-11.

Journal of Computer Science & Technology

2007年第5期

浏览历史

内容加载中请稍等...

Server-Based Data Push Architecture for Multi-Processor Environments 被引量：3

参考文献42

同被引文献13

引证文献3

二级引证文献12

相关作者

相关机构

相关主题

浏览历史