期刊文献+

MapReduce编程模型中key值二次分类算法 被引量:1

Two times classification algorithm of Key value in Map Reduce programming model
下载PDF
导出
摘要 Map Reduce编程模型是分布式计算中最常用的编程模型,其主要目的是将单个巨大计算任务分割成多个小计算任务,并分别交由不同的计算机去处理。Map Reduce将任务分成map阶段和reduce阶段,每个阶段都是用key/value键值对作为输入和输出。针对Map Reduce中Map数量少,Reduce数量多的情况,文章将Map阶段任务中的Key值进行二次划分,提出一种Map Reduce编程模型中Key二次分类的方法。实验,证明该方法能够在原有基础上提高数据处理效率。 MapReduce programming model is the most commonly used programming model in the distributed computing.It divides a single huge computing task into multiple small computing tasks,which are processed by different computers respectively.MapReduce divides the task into the Map phase and the Reduce phase,each of which is used as input and output with the key/value key value pair.In view of the fact that the number of Map in MapReduce is small and the number of Reduce is large,the Key value of Mapphase task is divided in two times,and a method of two times classification of Key value in MapReduce programming model is proposed.Experiments show that the method can improve the efficiency of data processing on the original basis.
作者 刘帅 Liu Shuai(Department of Computer Application, Xinzhou Vocational and Technical College, Xinzhou, Shanxi 034000, China)
出处 《计算机时代》 2018年第3期58-59,62,共3页 Computer Era
关键词 MAPREDUCE Key/value 二次分类 MapReduce Key/Value two times classification
  • 相关文献

参考文献6

二级参考文献31

  • 1周锋,李旭伟.一种改进的MapReduce并行编程模型[J].科协论坛(下半月),2009(2):65-66. 被引量:14
  • 2刘远超,王晓龙,刘秉权.一种改进的k-means文档聚类初值选择算法[J].高技术通讯,2006,16(1):11-15. 被引量:23
  • 3吴宝贵,丁振国.基于Map/Reduce的分布式搜索引擎研究[J].现代图书情报技术,2007(8):52-55. 被引量:9
  • 4孙广中,肖锋,熊曦.MapReduce模型的调度及容错机制研究[J].微电子学与计算机,2007,24(9):178-180. 被引量:26
  • 5Han Jiawei,Kamber M.Data mining:concepts and tech- niques[M].San Francisco:Morgan Kaufmann Publishers, 2000.
  • 6Januzaj E, Kriegel H P, Pfeifle M.DBDC : Density-Based Distributed Clustering[C]//Proceedings of 9th International Conference on Extending Database Technology(EDBT). Oakland: IEEE Computer Press, 2004 : 88-105.
  • 7Samatova N F, Ostrouchov G.RACHET : an efficient cov- er-based merging of clustering hierarchies from distribut- ed datasets[J].Distributed and Parallel Databases,2002, 11 (2) : 157-180.
  • 8Johoson E, KarguPta H.Collective, hierarchical clustering from distributed, heterogeneous data[C]//Lecture Notes in Computer Science.Berlin: Springer, 2000 : 221-244.
  • 9Kargupta H.Sclable, distributed data mining using an agent based architecture[C]//Proceedings of 3rd Interna- tional Conference on Knowledge Discovery and Data Mining.Oakland .. AAAI Press, 1997 .. 211-214.
  • 10Hearst M A.Texttiling: segmenting text into multi-para- graph subtopic passages[J].Computational Linguistics, 1997,23(1) :33-64.

共引文献192

同被引文献12

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部