摘要
从信息检索角度出发,提出一种高效的索引,在结构索引中集成了倒排文档,可同时查询XML结构部分和关键词。双重索引策略很好地解决了基于路径表达式查询效率低的问题。
This paper examined an XML collection from the viewpoint of information retrieval (IR), and suggested an efficient index which combining the inverted file with a structure index, it could implement retrieval both on context and structure. The problem of low query efficiency based on path expression was well solved with dual index strategy.
出处
《计算机应用研究》
CSCD
北大核心
2007年第11期63-64,73,共3页
Application Research of Computers
基金
国家"863"计划资助项目(2001AA113182)
陕西省科技攻关计划基金资助项目(2002K06-G5)
关键词
可扩展标记语言
路径表达式
双重索引
倒排文档
XML( extensible markup language)
path expression
dual index
inverted file