期刊文献+

基于Hadoop云计算平台的文本处理算法的研究与改进 被引量:3

A Hadoop Cloud Platform Based Text Processing Algorithm:Research and Improvement
下载PDF
导出
摘要 Hadoop是Apache基金会下的一个开源分布式计算平台,以分布式文件系统HDFS(Hadoop Distributed File System)和Map Reduce分布式计算框架为核心,为用户提供了底层细节透明的云分布式基础设施。在对Hadoop进行深入分析和研究的基础上,搭建基于Hadoop的云计算平台,并完成分布式文本文件处理任务以及对文件文本内容处理算法的改进和实现。 Hadoop is an open source distributed computing platform under Apache Foundation. Taking HDFS (Hadoop Distributed File System)and MapReduce distributed computing framework as the core, it provides users with details of transparent distributed cloud infrastructure of the lower tier. Based on an in-depth analysis and study of Hadoop, a Hadoop- based cloud computing platform was established and distributed text file processing tasks and algorithms were completed.
作者 陈静
出处 《天津科技》 2016年第1期52-55,共4页 Tianjin Science & Technology
关键词 云计算 HADOOP 数据去重算法 HDFS MAPREDUCE cloud computing Hadoop text processing data deduplication algorithm HDFS MapReduce
  • 相关文献

参考文献5

  • 1陈康,郑纬民.云计算:系统实例与研究现状[J].软件学报,2009,20(5):1337-1348. 被引量:1312
  • 2林清滢.基于Hadoop的云计算模型[J].现代计算机,2010,16(7):114-116. 被引量:27
  • 3HDFS [EB/OL]. http: //hadoop.Apache.org/common/ does/r0. 20. 2/hdfs_user_guide. Html. Wikipedia.
  • 4Cloud Computing[EB/OL]. http: //en. wikipedia, org/wiki/Cloud_computing.
  • 5Wang Y D, Que X Y, Yu W K, et al. DhirajSehgal- Hadoop acceleration through network levitated mer- ge [C]. Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, 2011.

二级参考文献33

  • 1Sims K. IBM introduces ready-to-use cloud computing collaboration services get clients started with cloud computing. 2007. http://www-03.ibm.com/press/us/en/pressrelease/22613.wss
  • 2Boss G, Malladi P, Quan D, Legregni L, Hall H. Cloud computing. IBM White Paper, 2007. http://download.boulder.ibm.com/ ibmdl/pub/software/dw/wes/hipods/Cloud_computing_wp_final_8Oct.pdf
  • 3Zhang YX, Zhou YZ. 4VP+: A novel meta OS approach for streaming programs in ubiquitous computing. In: Proc. of IEEE the 21st Int'l Conf. on Advanced Information Networking and Applications (AINA 2007). Los Alamitos: IEEE Computer Society, 2007. 394-403.
  • 4Zhang YX, Zhou YZ. Transparent Computing: A new paradigm for pervasive computing. In: Ma JH, Jin H, Yang LT, Tsai JJP, eds. Proc. of the 3rd Int'l Conf. on Ubiquitous Intelligence and Computing (UIC 2006). Berlin, Heidelberg: Springer-Verlag, 2006. 1-11.
  • 5Barroso LA, Dean J, Holzle U. Web search for a planet: The Google cluster architecture. IEEE Micro, 2003,23(2):22-28.
  • 6Brin S, Page L. The anatomy of a large-scale hypertextual Web search engine. Computer Networks, 1998,30(1-7): 107-117.
  • 7Ghemawat S, Gobioff H, Leung ST. The Google file system. In: Proc. of the 19th ACM Symp. on Operating Systems Principles. New York: ACM Press, 2003.29-43.
  • 8Dean J, Ghemawat S. MapReduce: Simplified data processing on large clusters. In: Proc. of the 6th Symp. on Operating System Design and Implementation. Berkeley: USENIX Association, 2004. 137-150.
  • 9Burrows M. The chubby lock service for loosely-coupled distributed systems. In: Proc. of the 7th USENIX Symp. on Operating Systems Design and Implementation. Berkeley: USENIX Association, 2006. 335-350.
  • 10Chang F, Dean J, Ghemawat S, Hsieh WC, Wallach DA, Burrows M, Chandra T, Fikes A, Gruber RE. Bigtable: A distributed storage system for structured data. In: Proc. of the 7th USENIX Symp. on Operating Systems Design and Implementation. Berkeley: USENIX Association, 2006. 205-218.

共引文献1336

同被引文献11

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部