摘要
XML在数据交换中的应用越来越广泛,但由于标记引入而使其空间膨胀较大,对传输及存储资源耗费严重。压缩后的XML数据容量明显减少,但怎样基于压缩后的XML数据直接进行高效的查询处理,当前研究工作较少。以反向算术压缩为基本压缩算法,提出针对XML数据库中压缩XML文件的索引结构ArithRegion,基于该索引结构,可高效处理形如element1element2…elmentm的查询。
Even XML is used as a popular data exchange standard over Internet and Intranet, its space expansion nvakes the transmitting arid storing of XML data vary expensive in terms of resources because of adding tags to every different semantic content unit. After compressed, its size will be much smaller, but how to evaluate query efficiently artd directly based or, the compressed data is still a necessary work. The authors propose an XML index structure using B + tree as its' backbone structure, on compressed data which is resulted from revert arithmetic compression, ArithRegion. Queries as the form of //element1/ elemet2/.../elementm can be evaluated efficieutly using ArithRcgion.
出处
《北京大学学报(自然科学版)》
EI
CAS
CSCD
北大核心
2006年第1期103-109,共7页
Acta Scientiarum Naturalium Universitatis Pekinensis
基金
973国家重点基础研究发展规划(G1999032705)
863数据库重大专项课题(2002AA4Z3440)资助项目
关键词
XML
索引
B+树
算术压缩
XML
index
B + tree
arithmetic compression