期刊文献+

基于Web的Lucene全文搜索排序算法优化 被引量:2

Optimization of Lucene Ranking Pages Algorithm Based on Web
下载PDF
导出
摘要 基于Lucene向量空间模型搜索的排序算法缺乏对自然语言语义理解的能力,直接有效的方法是根据用户个体对搜索文档的喜好,对选中的文档得分加权,由此提出Download-through Rank算法,对原有的排序算法进行了改进,设计并实现了个性化搜索引擎。实验证明,改进后的搜索排序算法能够有效提高信息检索的准确度。 The ranking pages algorithm based on Lucene VSM lacks the ability to understand natural language. The direct and effective method is to weight the selected document accormng to me user s individual preferences. It proposed an approach to improve the original ranking algorithm by Download-through Rank algorithm, and designed personalized search engine. Experimental results showed that the proposed sorting algorithm can improve the accuracy of the information retrieval.
出处 《蚌埠学院学报》 2015年第5期34-38,共5页 Journal of Bengbu University
基金 安徽工程大学青年基金项目(2013YQ29)
关键词 lucene向量空间模型 相似度 排序算法 lucene VSM similarity sorting algorithm
  • 相关文献

参考文献9

  • 1王知津.信息存储与检索[M].北京:机械工业出版社,2011:6.
  • 2葛帅.开放源代码的全文检索引擎Lucene[EB/OL].http://www.lcene.com.cn/abou.thtm_Toe43005313.2010-08-15.
  • 3百度百科.Lucene[EB/OL].http://baike.baidu.com/view/371811.htm?fr=aladdin.2012-08-18.
  • 4吴昆.1ucene系统结构分析.volume.DOI[DB/OL].http://hi.baidu.com/hustwukun/item/1a189885c0734d5d26ebd9f0.
  • 5(美)Michael,McCandless.Lucene实战[M].牛长流,等,译.2版.北京:人民邮电出版社,2011:81-82.
  • 6陈建峡,黄日,马忠宝.基于PageRank的Lucene排序算法优化与实现[J].计算机工程与科学,2012,34(10):123-127. 被引量:12
  • 7Page L, Brin S, Motwani R, et al. The PageRank Citation Ranking : Bringing Order to the Web [ D ]. Palo Alto: Stan- ford University, 1995.
  • 8Huang Lan. A suervery on Web Information Retrieval Technologies [ R ]. State University of New York, Depart- ment of Computer Sciende ECSL, Technical Report TR - 120,2000.
  • 9李庆华,赵彦斌,赵峰,彭进劲.基于向量空间模型的并行信息检索算法[J].小型微型计算机系统,2005,26(9):1560-1562. 被引量:8

二级参考文献16

  • 1黄知义,周宁.Google搜索引擎的PageRank技术及其优化研究[J].图书馆学研究,2005(8):21-23. 被引量:1
  • 2Berry M W, Dumais S T, O'Brien G W. Using linear algebra for intelligent information retrieval[J]. SIAM Rev.. 1995,37 :573-595.
  • 3Berry M, Witter D. Intelligent information management using latent semantic indexing [C]. In: Proceedings of Interface'97,Interface of North America Foundation, Fairfax Station, VA.1997.
  • 4Dekel E. Dnassimi, Sahni S. Parallel matrix and graph algorithms [J]. SIAM Journal on Computing. Nov. 1981. 10 (4):657-675.
  • 5Berg D. A guide to the Oxford English Dictionary[M]. Oxford University Press. Oxford, UK, 1993.
  • 6Sparck Jones K. A statistical interpretation of term specity and its applications in retrieval[J]. Documentation, 1972, 28.11-21.
  • 7Bharat K, Broder A. Estimating the relative size and overlap of public web searchengines[C]. In: 7th International World Wide Web Conference, paper FP37, Elsevier Science, New York,1998.
  • 8Michael W Berry. Zlatko Drmac, Elizabeth R Jessup. Matrices.vector space information retrieval [J]. SIAM Review. 1999.41(2).
  • 9Nandigam J, Gudivada V N, Hamou-L A. Learning SoftwareEngineering Principles Using Open Source Software [ C]//Proc of the 38th Frontiers in Edueation Conference? 2008 :S3H-18-S3H-23.
  • 10Lucene[EB/OL]. [2012-08-01]. http://baike. baidu. com/view/371811. htm.

共引文献19

同被引文献13

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部