期刊文献+

一种高效的倒排索引存储结构 被引量:22

Effective storage structure of inverted index
下载PDF
导出
摘要 倒排索引是信息检索系统的核心部分,其存储结构对检索的效率和效果起着至关重要的作用,根据汉语词汇的频率分布情况和当前的软硬件环境,提出一种高效的倒排索引结构,在一定程度上能够节省磁盘空间,提高检索效率,并且支持增量更新和删除。 Inverted index is the core component of an information retrieval system,the storage structure of it plays a crucial role in effect and efficiency of retrieval.In this paper,according to the frequencies distribution of Chinese vocabulary and the current hardware and software environment,the authors introduce an effective storage structure of inverted index that can save the disk usage and improve the efficiency of retrieval,as well as supporting real time update and delete.
作者 邓攀 刘功申
出处 《计算机工程与应用》 CSCD 北大核心 2008年第31期149-152,共4页 Computer Engineering and Applications
基金 国家自然科学基金No.60502032 No.60672068~~
关键词 倒排索引 词典 容量 追加块 inverted index dictionary capacity add-on block
  • 相关文献

参考文献9

  • 1Scholer F,Williams H E,Yiannis J,et al.Compression of inverted indexes for fast query evaluation[C]//Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,Tampere,Finland,2002:222- 229.
  • 2Persin M,Zobel J,Sacks D R.Fihered document retrieval with frequency sorted indexes[J].Journal of the American Society for Information Science, 1996,47(10) :749-764.
  • 3Putz S.Using a relational database for an inverted text index,SSL- 91-20[R].Xerox PARC, 1999.
  • 4彭波,李晓明.搜索引擎倒排文件的一种分块组织技术[J].电子学报,2005,33(2):358-362. 被引量:9
  • 5贾崇,陆玉昌,鲁明羽.一种支持高效检索的即时更新倒排索引方法[J].计算机工程与应用,2003,39(29):198-201. 被引量:10
  • 6吴恒山,刘兴宇,左琼.一种基于可扩展散列表的倒排索引更新策略[J].计算机工程,2004,30(8):83-84. 被引量:6
  • 7Brin S,Page L.The anatomy of a large-scale hypertextual Web search engine[D].CA: Stanford University,2000.
  • 8Zipf G K.Human behavior and the principle of least effort[M].[S.l.]: Addison-wesley Press, 1949.
  • 9Jon P,Hamilton K M.A file system based inverted index[D].UK: Loughborough University of Technology, 1995.

二级参考文献36

  • 1[1]Zipf G K.Human Behavior and the Principle of Least Effort. Addisonwesley Press, 1949
  • 2[2]Fagin R,Nievergelt J,Pippenger N,et al. Extendible Hashing:a Fast Aecess Method for Dynamic Files. ACM Trans.on Database Systems,1979,4(3):315-344
  • 3[3]Melnik S,Raghavan S,Yang B,et al. Building a Distributed Full-text Index for the We b. In: Proceed ings of WWW 1 0, 2001
  • 4[4]Cutting D,Pedersen J.Optimizafion for Dynamic Inverted Index Maintenance. SIGIR90,1990:405-41 l
  • 5[5]Garcia-Molina H,Tomasic A,Shoens K.Incremental Updates of Inverted Lists for Text Document Retrieval.SIGMOD94,1994,23(2):289-300
  • 6[6]Chiueh T, Huang L.Efficient Real-time Index Updates in Text Retrieval Systems. ECSL Technical Report 66,1999
  • 7Brown E W,Callan J P,Croft W B.Fast Incremental Indexing for Full-Text Information Retrieval[C].In:Proceedings of the 20^th International Conference.
  • 8Moffat A,Zobel J.Compression and Fast Indexing for Multi-Gigabit Text Databases[J].Australian Comput J, 1994;26( 1 ) : 19.
  • 9Clark C L A,Cormack G V,Burkowski F J.Fast Inverted Indexes with On-Line Update[R].Technical Report CS-94-40,University of Waterloo Coputer Science Department, 1994-11.
  • 10Tzi-cker Chiueh.Lan Huang.Efficient Real-Time Index Upeates in Text Retrieval Systems, 1999-04-01.

共引文献22

同被引文献179

引证文献22

二级引证文献93

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部