期刊文献+

基于语句查询扩展和高性能计算平台的分布式信息检索系统DQSSQE

A Distributed Information Retrieval System DQSSQE Based on Sentences Query Expansion and High Performance Computing Platform
原文传递
导出
摘要 提出了一种基于语句的查询扩展方法以及语句向量的融合策略,使得扩展后的查询语句的查询性能优于原始查询语句;基于微软高性能计算平台HPC Server和查询扩展策略,设计实现了一个分布式文本检索系统DQSSQE.实验结果表明,在检索性能方面,所提出的查询扩展策略能够有效的提高查准率,召回率上也有一定的提高;在分布式检索计算性能方面,DQSSQE系统具有较好的计算加速比,随着文本集规模的增加,其计算性能的优越性体现明显. In this paper, a query expansion method based on sentences and a sentence vectors combination strategy are proposed to improve the query performance. A distributed text retrieval system DQSSQE is designed based on Mi- crosoft HPC Server platform and query expansion strategy. The experiment result shows that the proposed query ex- pansion strategy improves the precision ration greatly, and improves the recall ratio as well. At the same time, DQSSQE system gets a higher computation speedup ratio, and the more large the text set is, the higher performance the system will get, compared to the ordinary text retrieval systems.
出处 《武汉大学学报(理学版)》 CAS CSCD 北大核心 2012年第3期243-250,共8页 Journal of Wuhan University:Natural Science Edition
基金 国家自然科学基金(61070083 2011-2013) 国家软件工程重点实验室开放基金(2009.1-2010.12) 武汉市科技晨光计划(201150431105)资助项目
关键词 信息检索 查询扩展 高性能计算 分布式 information retrieval query expansion high performance computing distributed systems
  • 相关文献

参考文献23

  • 1Baeza-Yates R, Ribeiro-Neto B. Modern Information Retrieval[M]. Wokingham : Addison-Wesley, 1999.
  • 2田俊华,杨晓江.分布式并行信息检索系统的设计与实现——基础教育资源搜索引擎个案研究[J].现代图书情报技术,2007(8):76-79. 被引量:3
  • 3Voorhees E, Harman D. TREC: Experiment and Evaluation in Information Retrieval [ M ]. Cam- bridge : MIT Press, 2005.
  • 4Ganguly D, Leveling J, Jones G. Query Expansion for Language Modeling Using Sentence Similarities [ DB/ OL][2011-08-12]. http://dl, acm. org/citation. cfm? id= 2018151.
  • 5Ananth G, Vipin K, Anshul Gupta,et al. Introduce to Parallel Computing[M]. San Francisco: Benjamin- Cummings Publishing Company, Ipc, 1994.
  • 6Witsehel H F, Holz F, Heinrich G,et al. An Evalua- tion Measure for Distributed Information Retrieval Systems[DB/OL]. [2011-06-08]. hnp://zvortschatz. uni-leipzig, de/- fwitschel/ papers/ecirO8, pd f .
  • 7安俊秀.基于服务器集群的云检索系统的研究与示范[J].计算机科学,2010,37(7):179-182. 被引量:7
  • 8Michel S, Bender M, Ntarmos N,et al. Discovering and Exploiting Keyword and Attribute-value Co-occur- rences to Improve P2P Routing Indices [DB/OL]. [2011-05-23]. http://qid3, mmci. uni-saarland, de/ publications/cikm396michel, pd f .
  • 9Clay B. The Art of Concurrency :A Thread Monkey ' s Guide to Writing Parallel Applications [M]. Sebas topol,CA:.O'Reilly Media, Inc, 2009.
  • 10Allan J,Aslam J, Belkin N,et al. Challenges in infor- mation retrieval and language modeling[J]. SIGIR Forum, 2003,37( 1 ) : 31-47.

二级参考文献32

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部