期刊文献+

基于Hadoop的海量图片存储平台的设计与开发 被引量:2

Design and Development of the Mass Image Storage Platform Based on Hadoop
下载PDF
导出
摘要 随着Internet的飞速发展与深入应用,海量图片数据的存取问题显得越发突出,传统存储架构已突显管理效率不高、存储能力不足及成本太高等问题,Hadoop为我们提供了一种新的解决问题的思路,Hadoop可以充分利用集群的威力进行高速运算和存储,但是小文件过多时Hadoop的Name Node将导致内存出现瓶颈问题,使得系统效率变得极为低下。该文提出了一种基于Hadoop的、可对海量图片文件进行高效处理的存储架构,通过预处理模块的归类算法,并引入扩展一级索引机制,能较好地解决海量图片的处理问题,并避免内存瓶颈问题。实验表明,该系统易维护、具有良好的可扩展性,其稳定性、安全性、并发性均有较大改善。 With fast development and deep appliance of the Internet,problem of mass image data storage stand out,so the problem of low management efficiency,low storage ability and high cost of traditional storage framework has appeared.The appearance of Hadoop provides a new thought.However,Hadoop itself is not suit for the handle of small files.This paper puts forward a storage framework of mass image files based on Hadoop,and solved the internal storage bottleneck of Name Node when small files are excessive through classification algorithm of preprocessing module and lead-in of high efficiency and first-level of index mechanism.The test manifests that the system is safe,easy to defend and has fine extension quality;as a result,it can reach to a fine effect.
出处 《电脑知识与技术》 2018年第6Z期135-137,共3页 Computer Knowledge and Technology
基金 西华师范大学英才科研基金项目(项目编号:17YC178) 西华师范大学科研创新团队项目(项目编号:CXTD2017-6) 四川省科技厅项目(项目编号:2018ZR0235)
关键词 海量图片 HADOOP 分布式计算 存储架构 Massive Images Hadoop Distributed Calculation storage framework
  • 相关文献

参考文献4

二级参考文献35

  • 1巨鲸网[EB/OL].[2011-11-08].http://topl00.on/.
  • 2WHITE T. Hadoop: The definitive guide[ M]. [ S. 1. ] : O'Reilly Media, 2009.
  • 3Small files problem[ EB/OL]. [ 2011- 11 - 10]. http://www, cloud- era. conr/blog/2009/02/the-small-files-problem/.
  • 4MACKEY G, SEHRISH S, WANG JUN. Improving metadata man- agement for small files in HDFS[ C]//Proceedings of 2009 IEEE In- ternational Conference on Cluster Computing and Workshops. Piscat- away: IEEE Press, 2009:1 -4.
  • 5LIU XUHUI, HAN JIZHONG, ZHONG YUNQIN, et al. Implemen- ting WebGIS on Hadoop: A case study of improving small file I/O performance on HDFS[ C]//2009 IEEE International Conference on Cluster Computing and Workshops. Piscataway: IEEE Press, 2009: 1-8.
  • 6DONG BO, QIU JIE, ZHENG QINGHUA, et al. A novel approach to improving the efficiency of storing and accessing small files on Ha- doop: a case study by PowerPoint flies[ C]// Proceedings of the 2010 IEEE International Conference on Services Computing. Wash- ington, DC: IEEE Computer Society, 2010:65 -72.
  • 7Hadoop sequence file[ EB/OL]. [ 2011- 11- 12]. http://hadoop, a- pache, org/common/docs/current/api/org/apache/hadoop/io/Se- quenceFile, htm.
  • 8MP3文件格式[EB/OL].[2011-11-13].http://en.wikipedia.org/wiki/MP3.
  • 9CouchDB[ EB/OL]. [ 2011 - 11 - 14]. http://couchdb, apache, org/ docs/overview, html.
  • 10Memcached[ EB/OL]. [ 2011 - 11 - 15]. http://memcached, org/.

共引文献177

同被引文献11

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部