期刊文献+

基于Hadoop的海量共现矩阵生成方法 被引量:13

A Method for Generating Co-occurrence Matrix of Mass Data Based on Hadoop
下载PDF
导出
摘要 海量数据的处理分析是当前信息处理技术的热点之一,介绍开源并行系统Hadoop的体系结构以及基于Hadoop的MapReduce编程框架,并在Hadoop基础上提出一种通过多重MapReduce操作,实现海量共现矩阵的生成方法。 Mass data processing is a focal point of information techniques. This paper introduces architecture of open source parallel system - Hadoop, analyzes the MapReduce programming framework based on Hadoop, and proposes a method for generating co - occurrence matrix of mass data through multiple MapReduce operations.
出处 《现代图书情报技术》 CSSCI 北大核心 2009年第4期23-26,共4页 New Technology of Library and Information Service
基金 国家"十一五"科技支撑计划子课题"网络科技信息监测与评价"(项目编号:2006BAH03B05)的研究成果之一
关键词 HADOOP MAPREDUCE 共现矩阵 开源软件 Hadoop MapReduce Co - occurrence matrix Open - source - software
  • 相关文献

参考文献8

  • 1HDFS Architecture [ EB/OL ]. [ 2008 - 12 - 10 ]. http ://hadoop. apache. org/core/docs/current/hdfs_design. html.
  • 2Hadoop Cluster Setup [ EB/OL]. [ 2008 - 12 - 15 ]. http://hadoop. apache. org/core/docs/current/clustcr_setup. html.
  • 3HadoopMapReduce [ EB/OL]. [ 2008 - 12 - 16 ]. http://wiki. apache. org/hadoop/HadoopMapReduce.
  • 4Distributed Computing with Linux and Hadoop. [ EB/OL]. [2009 - 01 -101. http ://www. ibm. com/developerworks/linux/library/l - hadoop/index. html.
  • 5Hbase [ EB/OL ]. [ 2009 - 01 - 10 ]. http ://hadoop. apache. org/ hbase/.
  • 6Hive[ EB/OL]. [2009 -01 - 15 ]. http://hadoop. apache. org/hive/.
  • 7Pig [ EB/OL ]. [ 2009 - 01 - 15 ]. http ://hadoop. apache. org/pig/.
  • 8CloudBase [ EB/OL ]. [ 2009 - 01 - 16 ]. http ://sourceforge. net/ projects/cloudbase/.

同被引文献120

引证文献13

二级引证文献310

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部