期刊文献+

基于Lucene的“农搜”并行索引技术研究 被引量:3

Research on Lucene Parallel Index Solution Based on Agricultural Search Engine
下载PDF
导出
摘要 Lucene作为高度优化的倒排索引搜索引擎为搜索向垂直化和专业行业化发展提供了可能,打破了搜索的高技术壁垒。但在实际应用过程中遇到了两个主要问题:①随着被索引文件的增多,索引时间成线性增长,导致建索引的过程会影响搜索体验;②搜索服务器的硬件门槛导致无法实现分布式索引。本文采用多台PC同时建索引再合并索引的方法形成了一个可扩展的搜索引擎解决方案。极大地缓解了建索引给搜索带来的问题。 As a highly optimized inverted index search engine, Lucene enable the search to develop toward the vertical and professional services, with breaking the barriers of high search technology. However, the two main difficuhies are encountered in the course of practical applications. First, with the increasing numbers of indexed documents, linear growth of indexing time results in the impacts of the process of building index on the search experience. Secondly; the threshold of the search server hardware lead to failure of implementing distributed indexing. Through the method of merger indexing after many PC Units create index at the same time, a scalable search engine solution was found. It greatly ease the searching problem introduced in creating index.
出处 《农业网络信息》 2009年第8期30-31,50,共3页 Agriculture Network Information
基金 国家"十一五"科技支撑计划课题(2006BAD10A05)
关键词 LUCENE 并行索引 搜索引擎 Lucene parallel index search engine
  • 相关文献

参考文献6

  • 1http://hi.baidu.com/injava/blog/item/b9ee84581c0b6edc9c8204c2.html .
  • 2http://damies.javaeye.com/blog/159835 .
  • 3http://java.csdn.net/page/d16587b6-4999-48af-92bc-debacd-fcb2aa .
  • 4http://topic.csdn.net/u/20070830/09/0598baea-5362-4d49-a0b6-dbd363e79a5d.html?1100639756 .
  • 5http://it.dianping.com/lucene-index-create-delete.htm .
  • 6Sanjay Ghemawat,Howard Gobioff,Shun-Tak Leung.The Google File System[]..

同被引文献31

  • 1刘俊熙.搜索引擎的搜索、索引和检索技术的关联分析[J].图书馆学研究,2005(9):84-86. 被引量:2
  • 2晁岳峰,曹作良,郭英玲.基于Lucene的搜索引擎在远程教育平台中的实现[J].天津理工大学学报,2005,21(6):23-25. 被引量:2
  • 3叶云,梁京章.基于Lucene的搜索引擎在远程教育平台中的应用[J].现代计算机,2007,13(4):53-55. 被引量:2
  • 4林浩.基于综合倒排索引的个性化搜索技术研究[D].贵阳:贵州大学,2008.
  • 5Liu Chun, Guo Qing Ping. Analysis and Research of Web Chinese Retrieval System Based Lunece [ J ]. Computer society,2009 (12) :1051-1055.
  • 6Zhang Yong, Li Jian-lin. Research and Improvement of Search Engine Based on Lucene [ C ] //International Conference on Intelligent Human- Machine Systems and Cybernetics. Zhejiang: [ s. n. ] ,2009:270-273.
  • 7Zhou Ning, Wu JiaXin, Zhang ShaoLong, et al. Mining Weighted Association Rules with Lucene Index [ J ]. Wireless Communications, Networking and Mobile Computing, 2007 (9) :3697-3700.
  • 8Kim Min-Soo, Whang Kyu-Young, Lee Jae-Gil, et al. n- Gram/2L: A Space and Time Effieient Two-Level n-Gram Inverted Index Structure [ C ]//Proceedings of the 31 st international conference on Very large data bases. Trondheim,Norway : [ s. n. ] ,2005:325-336.
  • 9Hatcher E, Gospodnetic O. lucene in action [ M]. Greenwich, CT, USA : Manning Publications Co,2004.
  • 10Baeza-Yates R, Gionis A, Junqueira F P, et al. Design Trade-Offs for Search Engine Caching[ J]. ACM Transactions on the Web,2008(10) :1-28.

引证文献3

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部