期刊文献+

分布式检索在异构科技信息资源中的应用及优化 被引量:5

APPLICATION AND OPTIMIZATION OF DISTRIBUTED HETEROGENEOUS RETRIEVAL IN SCIENTIFIC AND TECHNOLOGICAL INFORMATION RESOURCES
下载PDF
导出
摘要 为解决使用传统集中式检索处理海量异构科技信息资源时存在单点故障、性能低、不易扩展等问题,提出一种在异构科技资源下应用的分布式高性能检索系统(DHRS),并对其核心技术进行重点研究和分析。针对检索结果资源访问开销大的问题,给出基于访问代价的评估算法。并结合实际应用场景对算法进行优化,优化后请求数减少了80%,实验环境下的性能平均提高了68%。同时通过真实数据集的测试,验证了DHRS检索海量科技资源的可行性,能够适用于对检索和扩展性能要求较高的场景。 When using the traditional centralized retrieval method to deal with massive heterogeneous technology information resources,there are many problems such as single point of failure,poor performance and extensibility. To solve this problem,a distributed high-performance retrieval system( DHRS) applied to heterogeneous technology resources is proposed. First,key techniques of the DHRS were studied and analyzed. Aiming at the problem of large access cost of retrieval results,an evaluation algorithm based on access cost was proposed. Secondly,the algorithm was optimized according to the practical application scenario. The number of requests after optimization was reduced by80%,and the performance in the experimental environment was improved by 68%. Finally,the test of real data sets proves the feasibility of DHRS retrieval of large amount of scientific and technological resources. It can be applied to search and extend performance requirements of the scene.
出处 《计算机应用与软件》 2017年第10期78-84,156,共8页 Computer Applications and Software
基金 国家自然科学基金项目(61462053) 中国博士后科学基金项目(2016M602730)
关键词 科技资源 分布式检索 海量数据 ElasticSearch 异构资源 Scientific and technological resources Distributed retrieval Massive data ElasticSearch Heterogene-ous resources
  • 相关文献

参考文献6

二级参考文献46

  • 1吴广君,王树鹏,陈明,李超.海量结构化数据存储检索系统[J].计算机研究与发展,2012,49(S1):1-5. 被引量:30
  • 2张玲,石子夜,王胜海.WCQA:CQL的一个通用语法分析器[J].数字图书馆论坛,2006(7):19-22. 被引量:1
  • 3YONG H K, CHOI I S. An efficient Web information extracting sys- tem[ C] // Proceedings of the IEEE ISIE 2001. Pusan, Korea: IEEE ISIE, 2001:1771 - 1774.
  • 4Apache. SOLR[EB/OL]. [2012- 10- 19]. http://lucene, a- pache, org/solr/.
  • 5JEFFERY D, SANJAY G. Map/Reduce: simplified data processing on large clusters[ C]//Proceedings of the IEEE Sixth Symposium on Operating System Design and Implementation. San Francisco, CA, USA: IEEE OSDI: 2004:107 -113.
  • 6Apache. Lucene [ EB/OL]. [ 2012 - 10 - 19]. http://lucene, a- pache, org/.
  • 7OsChina. IKAnalyzer[ EB/OL]. [ 2012 - 10 - 19]. http://www. oschina, net/p/ikanalyzer.
  • 8L1U Z Y, OUYANG C P, LI Y, et al. CloudMat: A Framework for Materials Web Resources Inte:ation in the Resource Description Framework-Based Cloud [J]. Advanced Science Letters, 2012(7): 111-115.
  • 9吴广印.RMS程序员开发指南[M].北京:北京万方软件有限公司,2012.
  • 10MapReduce: Simplified Data l:oeessing on Large Clu[OL]. [20i3-04:02]. h'ttp://static.googleusercontent.com/external_eontent/untrusted_dlep/researeh. google.eom/zh-CN//arehive/mapreduee-osdi04.pdf.

共引文献20

同被引文献44

引证文献5

二级引证文献35

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部