期刊文献+

HDFS数据动态分布设计与实现

Design and Realization of Data Dynamic Distribution Algorithm for HDFS
下载PDF
导出
摘要 指出了由于SSD(Hard Disk Driver,HDD)的不对称I/O特性和价格因素,在分布式系统Hadoop中,由SSD和HDD组成混合存储系统是一种有效的解决方案。HDFS是针对同构集群设计的,对存储介质的类型未加以区分,在为数据块分配存储空间和保存数据块的过程中,不考虑存储介质之间的性能差异,导致新型存储介质的优势无法完全地发挥出来。为此,提出了一种动态的数据分布算法,将写频率较高的数据块存储在写性能较好的存储介质上,将读频率较高的数据块存储在读性能较好的存储介质上,以提高Hadoop系统的数据存取速度。测试结果表明:数据调度线程能够根据系统中数据块的读写情况,自适应地选择数据块的存放位置,提高了系统的数据存取速度。 Due to SSD’s I/O performance is asymmetrical and the high price,SSD can’t completely replace the traditional hard disk drive.Thus hybrid storage systems composed by the SSD and HDD gradually become the main way.HDFS is designed for the homogeneous cluster.There is no distinction between the types of storage media.In the process of allocating storage space for data blocks and saving data blocks,the performance differences between storage media are not taken into account,which inhibits the advantage of the storage medium.A dynamic data distribution algorithm is proposed to store data blocks with higher write frequency on storage media with better write performance and store data blocks with higher read frequency on storage media with better read performance to improve HDFS data access speed..The test results show that the block scheduling threads can adaptively select the storage location of block in the system to improve the data access speed of the system.
作者 蔡宇昂 张鑫䶮 Cai Yuang;Zhang Xinyan(Hubei University of Police,Wuhan,Hubei,430034,China;Huazhong Universily of Science and Technology,Wuhan,Hubei,430074,China)
出处 《绿色科技》 2020年第6期222-225,共4页 Journal of Green Science and Technology
关键词 HDFS分布式文件系统 数据分布 数据块调度 HDFS data distribution optimization data block scheduling
  • 相关文献

参考文献1

二级参考文献14

  • 1Lee S W, Park D J, Chung T S, Lee D H, Park S, Song H J. A log buffer-based flash translation layer using fully asso- ciative sector translation. ACM Transactions on Embedded Computing Systems (TECS), 2007, 6(3): 18.
  • 2Park D, Debnath B, Du D. CFTL: A convertible flash translation layer adaptive to data access patterns//Proceedings of the ACM SIGMETRICS International Conference on Meas- urement and Modeling of Computer Systems. New York, USA, 2010:365-366.
  • 3Ma D, Feng J, Li G. Lazyftl: A page-level flash translation layer optimized for nand flash memory//Proceedings of the 2011 International Conference on Management of Data (SIGMODll). Athens, Greece, 2011:1-121.
  • 4Graefe G. The five-minute rule twenty years later, and how flash memory changes the rules//Proceedings of the 3rd In- ternational Workshop on Data Management on New Hard- ware(DaMoN'07). New York, USA, 2007: 6.
  • 5Kgil T, Roberts D, Mudge T. Improving NAND flash based disk caches//Proceedings of the 35tb International Symposi um on Computer Architecture (ISCA ' 08). Beijing, China, 2008:327-338.
  • 6Kgil T, Mudge T. Flashcache: A NAND flash memory file cache for low power web servers//Proceedings of the 2006 International Conference on Compilers, Architecture and Synthesis for Embedded Systems(CASES'06). New York,USA, 2006:103-112.
  • 7Koltsidas I, Viglas S D. Flashing up the storage layer. Pro- ceedings of the VLDB Endowment, 2008, 1(1): 514-525.
  • 8Jung H, Shim H, Park S, Kang S, Cha J. Lruwsr: Integra- tion of LRU and writes sequence reordering for flash memory. IEEE Transactions on Consumer Electronics, 2008, 54(3): 1215-1223.
  • 9Ou Y, HOrder T, Jin P. CFDC: A flash-aware replacement policy for database buffer management//Proceedings of the 5th International Workshop on Data Management on New Hardware(DaMoN'09). New York, USA, 2009: 15-20.
  • 10Li Z, Jin P, Su X, Cui K, Yue L. CCF-LRU: A new buffer replacement algorithm for flash memory. IEEE Transactions on Consumer Electronics, 2009, 55(3) : 1351-1359.

共引文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部