期刊文献+

搜索引擎倒排文件的一种分块组织技术 被引量:9

A Blocking Inverted File Structure for Search Engine
下载PDF
导出
摘要 检索效率对大规模信息检索系统至关重要 .本文基于Web搜索应用背景 ,针对用户查询的统计特性 ,提出了一种分块组织倒排文件的方法 .通过建立检索性能模型 ,进行分析和仿真实验 ,结果表明这一方法下的检索算法可以有效的减少检索执行时间 ,并得到这一组织方式中分块参数的优化选择方法 . The efficiency of retrieval system is crucial for large scale information retrieval systems.By analyzing the documents and the users' query logs of a real search engine,a blocking inverted file structure is proposed.Simulation results show that the retrieval algorithm under the new organization of the inverted file can decrease its execution time significantly,and the optimal parameter selection for this blocking organization is discussed.
作者 彭波 李晓明
出处 《电子学报》 EI CAS CSCD 北大核心 2005年第2期358-362,共5页 Acta Electronica Sinica
基金 国家 973计划项目 (No G1 9990 32 70 6) 教育部博士点基金 (No 2 0 0 30 0 0 1 0 76)
关键词 搜索引擎 信息检索 倒排文件 检索效率 search engine information retrieval inverted file retrieval efficiency
  • 相关文献

参考文献15

  • 1B-S Jeong,E Omiecinski.Inverted file partitioning schemes in multiple disk systems[J].IEEE Transactions on Parallel and Distributed Systems,1995,6(2):142-153.
  • 2A Tomasic,H Garcia-Molina.Performance of inverted indices in shared-nothing distributed text document information retrieval systems[A].Proc PDIS Conf[C].San Diego,CA,1993.
  • 3F Scholer,H E Williams,J Yiannis,J Zobel.Compression of inverted indexes for fast query evaluation[A].Proceedings of the 25th annual international ACM SIGIR conference on research and development in information retrieval[C].Tampere,Finland,2002.222-229.
  • 4G Navarro,E Moura,M Neubert,N Ziviani,R Baeza-Yates.Adding compression to block addressing inverted indexes[J].Kluwer Information Retrieval Journal,2000.3(1):49-77.
  • 5Anh NgocVo,Alistair Moffat.Compressed inverted files with reduced decoding overheads[A].Proceedings of the 21st International Conference on Research and Development in Information Retrieval[C].New York City:ACM Press,August 1998.290-297.
  • 6Witten I H,Moffat A,Bell T C.Managing Gigabytes:Compressing and Indexing Documents and Images[M].Van Nostrand Reinhold,New York,1994.
  • 7A Moffat,J Zobel.Self-indexing inverted files for fast text retrieval[J].ACM Transactions on Information Systems,1996,14(4):349-379.
  • 8M Persin,J Zobel,R Sacks-Davis.Filtered document retrieval with frequency-sorted indexes[J].Journal of the American Society for Information Science,1996,47(10):749-764.
  • 9S Brin,L Page.The anatomy of a large-scale hypertexual Web search engine[A].In Proceedings of the 7th WWW conference[C].Computer Networks,Amsterdam,1998.
  • 10Lua K T.Frequency-rank curves and entropy for Chinese characters and words[J].Computer Processing of Chinese & Oriental Languages,1994,8(1):37-52.

二级参考文献1

  • 1Li W,IEEE Trans Information Theory,1992年,38卷,6期,1842页

共引文献14

同被引文献85

引证文献9

二级引证文献87

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部