期刊文献+

基于Lucene的书目搜索相似度评分算法改进研究 被引量:3

The Research and Improvement of Catalog Search Scoring Algorithm of Similarity Based on Lucene
原文传递
导出
摘要 对Lucene内部的相似度评分算法进行研究分析,指出其在书目搜索中,未考虑图书的受欢迎度这一因素对搜索结果排序的影响。提出一种改进的算法并予以实现。实验结果证明,改进的算法能将较受读者欢迎的图书排列在前,提升读者的书目搜索体验。 It is a better solution to develop catalog search system based on Lucene. After analyzing and studying on the default scoring algorithm of Lucene similarity, the author points out that it does not consider the popularity for the sor- ting of search results when searching books catalog. The author puts forward and achieves an improving algorithm. The ex- perimental results show that the improving algorithm can put the popular books ahead of the sorting list as well as improve the readers' catalog searching experience.
作者 王泽贤
机构地区 广州大学图书馆
出处 《图书情报工作》 CSSCI 北大核心 2014年第4期94-98,共5页 Library and Information Service
基金 广州市教育科学"十二五"规划课题项目"关于用开源软件实现OPAC 2.0的研究"(项目编号:11A147)研究成果之一
关键词 LUCENE 书目搜索 相似度 Lucene catalog search similarity
  • 相关文献

参考文献11

  • 1Spink A, Jansen B J, Wolfram D, et al. From e - sex to e - com- merce: Web search changes [ J ]. IEEE Computer, 2002,35 ( 3 ) : 107 - 109.
  • 2范晨熙,黄理灿,李雪利.基于Lucene的BM25模型的评分机制的研究[J].工业控制计算机,2013,26(3):78-79. 被引量:15
  • 3陈建峡,黄日,马忠宝.基于PageRank的Lucene排序算法优化与实现[J].计算机工程与科学,2012,34(10):123-127. 被引量:12
  • 4白培发,王成良,徐玲.一种融合词语位置特征的Lucene相似度评分算法[J/OL].[2012-07-16].http://www.cnki.net/kcms!detail/11.2127.TP.20120716.1501.033.html.
  • 5黄承慧,印鉴,陆寄远.一种改进的Lucene语义相似度检索算法[J].中山大学学报(自然科学版),2011,50(2):11-15. 被引量:13
  • 6Salton G, Yang C S. On the specification of term values in automat- ic indexing [ J ]. Journal of Documentation, 1973,29 ( 4 ) : 351- 372.
  • 7Salton G, Buckley C. Term - weighting approaches in automatic text retrieval[ J]. Information Processing & Management, 1988, 24 (5) :513 -523.
  • 8Church K, Gale W. Inverse document frequency(IDF) :A measure of deviations from poisson [ C ]//Proceedings of the 3rd Workshop on Very Large Corpora. Boston, 1995:121 -130.
  • 9Classic Scoring Formula: Formula of Lueene' s classic Vector Space implementation[ EB/OL]. [2013 -08 - 15 ]. http://lucene, a- pache, org/eore/4 4 0/core/org,/apache/lucene/seareh/similari- ties/TFIDFSimilaritv, html.
  • 10李克潮,梁正友.基于多特征的个性化图书推荐算法[J].计算机工程,2012,38(11):34-37. 被引量:26

二级参考文献39

  • 1曾庆辉,邱玉辉.一种基于协作过滤的电子图书推荐系统[J].计算机科学,2005,32(6):147-150. 被引量:14
  • 2黄知义,周宁.Google搜索引擎的PageRank技术及其优化研究[J].图书馆学研究,2005(8):21-23. 被引量:1
  • 3管建和,甘剑峰.基于Lucene全文检索引擎的应用研究与实现[J].计算机工程与设计,2007,28(2):489-491. 被引量:71
  • 4Lucene. Lucene Java 3. O. 1 [ EB/OL]. ( 2010 - 02 - 26 ) [2010 - 03 - 30]. http ://lucene. apache, org/.
  • 5Nutch. Apache Nutch 1.0 release[ EB/OL]. (2009 -03 -23) [ 2010 - 03 - 30 ]. http://lucene. apache. org,/ nutch/.
  • 6YANG C, YANG K C, YUAN H C. Improving the search process through ontology-based adaptive semantic search [ J ]. Metadata and Semantics for Digital Libraries, 2007, 25 (2) : 234 - 248.
  • 7ZHU D Y, DREHER H. Determining and satisfying search users real needs via socially constructed search concept classification [ C ]//IEEE DEST 2007, 2007.
  • 8BUSCALDI D, ROSSO P. A bag-of-words based ranking method for the Wikipedia question answering task [ C ]// CLEF 2006, 2007 : 550 - 553.
  • 9DU L, JIN H D, DE VEL O, et al. A latent semantic indexing and WordNet based information retrieval model for digital forensics[ C ]// IEEE ISI 2008, 2008 : 70 - 75.
  • 10RAVISHANKAR D, THIRUNARAYAN K, IMMANENI T. A modular approach to document indexing and se- mantic search[C]// WTAS 2005, 2005: 165- 170.

共引文献62

同被引文献32

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部