期刊文献+

数据网格中请求呈现分组特性的副本管理策略研究 被引量:3

Replication Strategies in Data Grid Systems with Clustered Demands
下载PDF
导出
摘要 在数据网格中,数据使用模式将影响系统性能.根据一些实际系统的测试结果,数据请求呈现出分组特性.为研究当数据请求呈现分组特性时请求分布与副本分布的关系,首先定义了数据网格中副本复制策略的模型,然后研究在数据请求呈现分组特性时平均访问延迟最小的最优策略.采用拉格朗日乘子法以及二分法对上述模型进行求解,得到了一个在请求分组模式下的最优下载副本策略.通过模拟实验对最优策略以及均匀复制策略、比例复制策略、平方根复制策略、LRU缓存策略的性能进行了比较.结果表明,最优策略所需广域网带宽最少,平均访问延迟最小. In data grid systems, data usage pattern plays an important role in system performance. According to some recent traces about real systems, data request and replica distribution exhibit clustering properties. Considered in this paper is the relationship between request distribution and replica distribution in data grid where request exhibits clustering properties. First the formal model of replication strategies in federated data grid system is given. The performance metrics include cumulative hit ratios and average access latency. Then investigated is what is the optimal way to replicate data with the objective of minimizing average access latency when request exhibits clustering properties. In the sense of minimizing average access latency, it is found that the more popular a file in a subgrid, the more replicas should be created in this subgrid furthermore, when requests distribute uniformly in system, replicas should be uniformly distributed in system too. The optimization model is solved by means of Lagrange multiplier method and bisection method. Then, an optimization downloading replication strategy for clustering demands is obtained. The performance of this strategy is compared with that of uniform replication strategy, proportional replication strategy, square root replication strategy and LRU caching strategy through simulation. Simulation results validate the effectiveness of optimal strategy. Compared with these popular strategies, the optimal strategy has advantages of least wide area network bandwidth requirement and least average access latency.
出处 《计算机研究与发展》 EI CSCD 北大核心 2009年第2期186-193,共8页 Journal of Computer Research and Development
基金 国家自然科学基金项目(90412006 90412011 60573110 90612016 60673152) 国家"九七三"重点基础研究发展计划基金项目(2004CB318000 2003CB317007) 国家"八六三"高技术研究发展计划基金项目(2006AA01A101 2006AA01A108 2006AA01A111)~~
关键词 数据网格 分组 副本策略 访问延迟 最优分布 data grid clustering replication strategy access latency optimized distribution
  • 相关文献

参考文献14

  • 1王意洁,肖侬,任浩,卢锡城.数据网格及其关键技术研究[J].计算机研究与发展,2002,39(8):943-947. 被引量:108
  • 2Foster I, Kesselman C. The Grid: Blueprint for a New Computing Infrastructure [M]. Beijing: China Machine Press, 2005:391-429
  • 3Bell W H, Cameron D G, Carvajal-Schiaffino R, et al. Evaluation of an economy-based file replication strategy for a data grid [C] //Proc of the 3rd IEEE/ACM int Syrup on Cluster Computing and the Grid. Los Alamitos, CA: IEEE Computer Society, 2003
  • 4Ekow O, Doron R, Alexandru R. Optimal file-bundle caching algorithms for data-grids [C] //Proc of the 2004 ACM/IEEE Conf on Supercomputing. Los Alamitos, CA: IEEE Computer Society, 2004
  • 5Iamnitchi A, Doraimani S, Garzoglio G. Filecules in high- energy physics: Characteristics and impact on resource management [C] //Proc of the 15th IEEE Int Syrup on High Performance Distributed Computing. Los Alamitos, CA: IEEE Computer Society, 2006 : 69-80
  • 6Cohen E, Shenker S. Replication strategies in unstructured peer-to-peer networks [C] //Proc of the Conf on Applications, Technologies, Architectures, and Protocols for Computer Communications. New York: ACM, 2002:177- 190
  • 7Lv Q, Cao P, Cohen E, et al. Search and replication in unstructured peer-to-peer networks [C] // Proc of the 16th Int Conf on Supercomputing. New York: ACM, 2002: 84- 95
  • 8Tewari S, Kleinrock L. Proportional replication in peer-to- peer network [C]//Proc of the 25th IEEE Int Conf on Computer Communications. Los Alamitos, CA: IEEE Computer Society, 2006
  • 9Tewari S, Kleinrock L. Optimal search performance in unstructured peer-to-peer networks with clustered demands [J]. IEEE Journal on Selected Areas in Communications, 2007, 25(1): 84-95
  • 10Iamnitchi A, Ripeanu M, Foster I. Srnall-world file-sharing communities [C] //Proc of the 23rd IEEE Computer Communications. Los Alamitos, Computer Society, 2004 Int Conf on CA : IEEE

二级参考文献6

  • 1[1]I Foster, C Kesselman. The Grid: Blueprint for a Future Computing Infrastructure. San Francisco, USA: Morgan Kaufmann Publishers, 1999
  • 2[2]Wolfgang Hoschek, Javier Jaen-Martinez. Data management in an international data grid project. In: ACM Int'l Workshop on Grid Computing (Grid'2000). Bangalore, India, 2000. 17~20
  • 3[3]W Allcock, A Chervenak, I Foster et al. The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets. Journal of Network and Computer Applications, 2000, 23(3): 187~200
  • 4[4]Bill Allcock, Ian Foster, Veronika Nefedova. High-performance remote access to climate simulation data: A challenge problem for data grid technologies. In: The Conf of High-Performance Networking and Computing, Denver SC'2001. USA, 2001. 283~297
  • 5[5]M Baldonado, C Chang, L Gravano et al. The stanford digital library metadata architecture. International Journal Digital Libraries, 1997, 1(2): 108~121
  • 6[6]Chaitanya Baru, Reagan Moore, Arcot Rajasekar et al. The SDSC storage resource broker. In: Proc CASCON'98 Conference. Toronto, Canada, 1998

共引文献107

同被引文献19

  • 1庞丽萍,陈勇.网格环境下数据副本创建策略[J].计算机工程与科学,2005,27(2):1-2. 被引量:7
  • 2FOSTER I,KESSELMAN C.The grid:blueprint for a new computing infrastructure[M].Beijing:China Machine Press,2005:391-429.
  • 3FEITELSON D G,RUDOLPH L.Metrics and benchmarking for parallel job scheduling[C]//Proc of the Workshop on Job Scheduling Strategies for Parallel Processing.Berlin:Springer,1998:1-24.
  • 4BELL W H,CAMERON D G,CAPOZZA L.OptorSim:a grid simulator for studying dynamic data replication strategies[J].High Performance Computing Applications,2003,17(4):403-416.
  • 5TANG Ming,LEE B S,YEO C K.Dynamic replication algorithms for the multi-tier data grid[J].Future Generation Computer Systems,2005(4):775-790.
  • 6CAMERON D G,CARVAJAL-SCHIAFFINO R.OptorSim v2.1:installation and user guide[EB/OL].(2008-03-19)[2008-06-15].http://sourceforge.net/projects/optorsim.
  • 7QaisarRas001.数据网格中数据复制的研究[D].哈尔滨:哈尔滨工业大学,2008.
  • 8Andrea Domenici, Flavia Donno, Gianni Pucciani, et al. Replica consistency in a data grid[J]. Nuclear Instrument and Methods in Physics Research, 2004, 534:24-28.
  • 9Atakan Do, an. A study on performance of dynamic file replication algorithms for real-time fileaccess in data grids [ J ]. Future Generation Computer Systems, 2009, 25 (8) :829-839.
  • 10XIONG Jin, WU Sining, MENG Dan, et al. Design and performance of the dawning cluster file system[ C ]//Pro- ceedings of IEEE International Conference on Cluster Computing. Washington: IEEE Computer Society, 2003 : 232 -239.

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部