期刊文献+

基于Map/Reduce的分布式搜索引擎研究 被引量:9

Research of Distributed Search Engine Based on Map/Reduce
下载PDF
导出
摘要 在对Map/Reduce算法进行分析的基础上,利用开源Hadoop软件设计出高容错高性能的分布式搜索引擎,以面对搜索引擎对海量数据的处理和存储问题。 This paper analyzes the algorithm of Map/Reduce and uses open source Hadoop software to design high fault -tolerant, high -performance distributed search engines, which will be in the face of large - scale data processing and storage problems.
出处 《现代图书情报技术》 CSSCI 北大核心 2007年第8期52-55,共4页 New Technology of Library and Information Service
关键词 映射/规约 分布式搜索引擎 HADOOP Map/Reduce Distributed search engine Hadoop
  • 相关文献

参考文献14

  • 1王斌 张刚 孙健.大规模分布式并行信息检索技术.信息技术快报,2005,3(2):1-9.
  • 2姚树宇,赵少东.一种使用分布式技术的搜索引擎[J].计算机应用与软件,2005,22(10):127-129. 被引量:7
  • 3董华山,孙济庆.基于P2P的分布式检索模式的研究[J].情报学报,2004,23(6):683-688. 被引量:7
  • 4Dean J, Ghemawat S. Map/Reduce: Simplied Data Processing on Large Clusters [ C ]. In : OSDI 2004, San Francisco ,2004,137 - 150.
  • 5Borthakur D. The Hadoop Distributed File System : Architecture and Design [ 2007 ] [ EB/OL ]. [ 2007 - 06 - 15]. http ://lucene. apache. org/hadoop/index.pdf
  • 6Yang H C, Dasdan A, Hsiao R L, et al. Map - Reduce - Merge: Simplified Relational Data Processing on Large Vlusters [ C ]. International Conference on Management of Data Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data:1029 - 1040.
  • 7盂岩.Map Reduce - the Free Lunch is not over[EB/OL].[ 2006 - 06 - 15 ] . http ://www. mengyan. org/blog/archives/ 2006/11/15/138. html.
  • 8Cutting D. Scalable Computing with Hadoop[ EB/OL]. [ 206 - 06 - 12 ]. http ://wiki. apache. org/lucene - hadoop - data/attachments/HadoopPresentations/attachments/yahoo - sds. pdf.
  • 9江南白衣.Hadoop-海量文件的分布式计算处理方案[EB/0L].[2007-06-15].http://www.blogjava.net/calvin/archive/2007/02/08/98688.html.
  • 10Ghemawat S, Gobioff H, Leung S T. The Google File System. In: 19th ACM Symposium on Operating Systems Principles[ C ]. Lake George, NY, October,2003.

二级参考文献14

  • 1翁惠玉,马范援,朱义军,杨传厚.网络搜索引擎的现状分析[J].情报学报,1999,18(S1):105-107. 被引量:25
  • 2雷葆华,杨明川.P2P技术的组网模式与业务模式探讨[J].电信技术,2004(11):54-57. 被引量:16
  • 3[1]Ross K W,Rubenstein D.Tutorial on P2P systems.Presented at Infocorn 2003,San Francisco,California,USA,2003
  • 4[5]Breokshier D,Govoni D,Krishnan N,et al.JXTA-JAVA P2P progrmnnfing.Sams Publishing,2002
  • 5[6]Waterhouse S.JXTA search:distributed search for distributed networks.Sun Mierosystems,Inc.http://search.jxta.org
  • 6[7]Dean J,Ghemawat S.Map reduce:Simplified data processing on large clusters.OSDI04:Sixth Symposium on Operating System Design and Implementation,San Francisco,CA,December,2004
  • 7许斌,王克宏.JXTA-java P2P网络编程技术.北京:清华大学出版社,2003
  • 8赵阳."CALIS高校学位论文全文数据库"系统框架和功能介绍.http://www.lib.tsinghu a.edu.cn/digitallib/CALIS-frame.ppt,2003.12
  • 9瞿艳,卢增祥,李衍达.分布式网络信息查询系统[J].清华大学学报(自然科学版),2000,40(1):124-128. 被引量:13
  • 10肖诗源,叶俊,刘贤德.一种基于Agent的分布式搜索引擎[J].计算机工程,2002,28(7):38-39. 被引量:13

共引文献29

同被引文献67

引证文献9

二级引证文献152

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部