期刊文献+

改进的大数据检索自适应性切换搜索算法 被引量:2

Improved Adaptive Switching Search Algorithm for Big Data Retrieval
下载PDF
导出
摘要 针对搜索引擎搜索索引大小不同时的效率不一致问题,文中在Hadoop分布式计算平台上利用Map-Reduce框架搭建分布式的搜索引擎及查询子系统,生成一种改进的自适应性切换搜索算法,以索引文件实现了高效的检索,索引文件较小时将其直接存入内存,索引文件较大时建立二级索引并读取内存中的索引列表,进行分布式的查询,通过设置足够多的集群的节点数进行测试。测试结果表明:当索引大小达到1000 MB,搜索时间由原始搜索算法的16.631 s缩短至7.259 s,文中方法对索引文件的搜索效率有显著提高。在索引文件更大的情况下,文中算法的优势也更明显,从而可以为网络论坛、网站以及其他用户提供高效的分布式搜索服务。 A search engine has different efficiency for different search index size.In order to solve this problem,the distributed search engine and query subsystem are built on the Hadoop distributed computing platform,generating an improved adaptive switch search algorithm.The algorithm implements efficient retrieval based on the index size.When the index file is small,it is directly stored in the memory.When the index file is large,the secondary index is established and the index list in the memory is read.Then the distributed query is performed.The number of nodes in multiple clusters is tested.The results show that when the index size reaches 1000 MB,the search time is reduced to 7.259 s from 16.631 s of the original search algorithm,indicating that the search efficiency of the proposed algorithm is significantly improved.For the much larger index files,the algorithm has obvious advantages,which can provide efficient distributed search services for web forums,websites and other users.
作者 吴雨晨 刘萍萍 徐江涛 WU Yuchen;LIU Pingping;XU Jiangtao(School of Computer Science and Engineering,Xi’an Technological University,Xi’an 710021,China)
出处 《西安工业大学学报》 CAS 2019年第6期688-695,共8页 Journal of Xi’an Technological University
基金 新型网络与检测控制国家地方联合工程实验室基金项目(GSYSJ20170009)
关键词 大数据平台 HADOOP 搜索引擎 自适应性切换搜索算法 big data platform Hadoop search engine adaptive switching search algorithm
  • 相关文献

参考文献6

二级参考文献42

  • 1赵德平,刘阳,李鹏.基于Lucene的房产信息垂直搜索引擎的研究[J].沈阳建筑大学学报(自然科学版),2011,27(1):178-183. 被引量:6
  • 2蔡建超,郭一平,王亮.基于Lucene.Net校园网搜索引擎的设计与实现[J].计算机技术与发展,2006,16(11):73-75. 被引量:14
  • 3Hadoop[EB/OL]. [2012-03-19]. http://hadoop.apache.org.
  • 4TomWhite.Hadoop权威指南[M].2版.北京:清华大学出版社,2011:15-73,167-188.
  • 5Tom White. Hadoop The Definitive Guide 2nd Edition [M]. Oreilly, 2010:41-73,167-188.
  • 6Jason Venner. Pro Hadoop[ M]. Apress, 2009 : 27-53.
  • 7李峰,王高丽,苏厚勤.离散数学[M].北京:清华大学出版社,2011:22-110.
  • 8T B. Ding, A Hirose. Fading channel prediction based on combi-nation of complex - valued neural networks and chirp Z - transform[J ]. IEEE Transactions on Networks and Learning Systems,2014,25(9) :1686-1695.
  • 9X H Xu, J Y Zhang, J Z Shi. Exponential stability of complex-valued neural networks with mixed delays [ J ]. Neurocomputing,2014,128(3);4B3-490.
  • 10ME Valle. Complex-valued recurrent correlation neural networks[J]. IEEE Transactions on Networks and Learning Systems,2014,25(9) :1600-1612.

共引文献81

同被引文献22

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部