摘要
倒排索引是信息检索系统的核心部分,其存储结构对检索的效率和效果起着至关重要的作用,根据汉语词汇的频率分布情况和当前的软硬件环境,提出一种高效的倒排索引结构,在一定程度上能够节省磁盘空间,提高检索效率,并且支持增量更新和删除。
Inverted index is the core component of an information retrieval system,the storage structure of it plays a crucial role in effect and efficiency of retrieval.In this paper,according to the frequencies distribution of Chinese vocabulary and the current hardware and software environment,the authors introduce an effective storage structure of inverted index that can save the disk usage and improve the efficiency of retrieval,as well as supporting real time update and delete.
出处
《计算机工程与应用》
CSCD
北大核心
2008年第31期149-152,共4页
Computer Engineering and Applications
基金
国家自然科学基金No.60502032
No.60672068~~
关键词
倒排索引
词典
容量
追加块
inverted index
dictionary
capacity
add-on block