期刊文献+

分块组织技术的倒排索引方法研究 被引量:4

Research of inverted index method based on block organizing technology
下载PDF
导出
摘要 为了进一步提高检索系统的整体效率,提出了一种分块组织技术的倒排索引方法。具体研究过程是在数据统计的基础上产生倒排索引的检索性能模型,分析倒排文件分块索引项的组织策略,通过仿真实验对检索性能模型进行验证。研究结果表明:分块组织倒排文件方法可以在较小的检索算法循环次数下,获得较高的算法效率,显著减少检索算法的执行时间,验证了倒排文件分块索引方法的可行性。 In order to further improve the overall efficiency of retrieval system,this paper proposes a method of inverted index based on block organizing technology.The specific studying process is as follows.Retrieval performance model of inverted index is generated based on data statistics.Organizational strategy of inverted file block index is analyzed.Retrieval performance model is verified through simulation experiment.The result shows that the method of inverted file block organization can get higher algorithm efficiency under the condition of less cycle numbers in the search algorithm,and also reduce the execution time of search algorithm significantly,which can verify the feasibility of inverted file block index method.
作者 杨晓波
出处 《计算机工程与应用》 CSCD 2012年第5期113-117,共5页 Computer Engineering and Applications
基金 浙江省自然科学基金(No.Y1110023)
关键词 检索性能模型 分块组织 倒排索引 算法仿真 retrieval performance model block organization inverted index algorithm simulation
  • 相关文献

参考文献9

  • 1Yung C M,Tien F C.Posting file partitioning and parallel information retrieval[J].Joumal of Systems and Software, 2002, 63 (2):113-127.
  • 2Marin M, Costa V G.High-performance distributed inverted files[C]// Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, CIKM' 07, New York, 2007: 935-938.
  • 3Wan J, Pan S Y.Performance evaluation of compressed inverted index in lucene[C]//Intemational Conference on Research Challenges in Computer Science, ICRCCS' 09, Shanghai, China, 2009: 178-181.
  • 4Anh V N,Moffat A.Improved word-aligned binary compression for text indexing[J].IEEE Transactions on Knowledge and Data Engineering, 2006,18 (6) : 857-861.
  • 5Robert W P L, Wai L.Efficient in-memory extensible inverted file[J].Information Systems,2007,32(5) :733-754.
  • 6Cambazoglu B B, Aykanat C.Performance of query processing implementations in ranking-based text retrieval systems using inverted indices[J].Information Processing & Management,2006, 42(4) : 875-898.
  • 7Huang S M, David C Y.An investigation of Zipf's law for fraud detection[J].Decision Support Systems,2008,46( 1 ) :70-83.
  • 8Taksa I, Spink A.Evaluating usability of a long query metasearch engine[C]//Proceedings of the 40th Hawaii International Conference on System Sciences, HICS S' 07, Hawaii, 2007:1-10.
  • 9Singer J, Brown G.Intelligent selection of application-specific garbage collectors[C]//Proceedings of the 6th International Symposium on Memory Management, ISMM' 07, New York, 2007: 91-102.

同被引文献35

引证文献4

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部