摘要
随着存储系统规模的不断扩大,如何有效组织、管理和查询存储系统中的资源,成为了研究者必须应对的一个问题。目前存储系统中的查询需求主要来自系统管理员对元数据的查询以及普通用户对关键字内容的查询等两个方面。而内容感知存储系统自身所具备的重复数据删除和块相似性检测能力并没有被用于优化上述查询过程。为了充分利用存储系统感知到的上层语义和底层重复数据块信息,为使用者提供高效、便捷的查询服务,提出了内容感知网络存储系统中的两阶段检索策略。该策略将上层基于元数据和关键字的查询与底层存储系统的块相似性查询相结合,利用两次查询相关度的加权平均值作为相似度评价指标。最终的实验结果表明了该策略在降低失效性、提高查全率等方面的有效性。
As the storage capacity approach Exabytes,how to efficiently organize,find and manage data is becoming increasingly difficult for us.The query requests in storage system are coming from two aspects,the first one is metadata retrieval delivered by administrator and the second one is user's common
出处
《计算机科学》
CSCD
北大核心
2011年第5期20-23,48,共5页
Computer Science
基金
国家自然科学基金(60673001)
部委基金"基于服务定制的智能存储系统研究"资助
关键词
元数据
数据迁移
内容寻址存储
两阶段检索
内容感知
query.But the functions of de-duplication and block similarity detection in content aware storage system are not utilized to enhance the above query processing.In order to take advantage of the upper semantic information and the lower storage system's duplicate block information to deliver efficient query service for users
a two-phrase retrieval strategy was introduced.It combined metadata/keyword query with block similarity query and utilized ranking coefficient to evaluate similarity among query results.The experiments indicate that the retrieval strategy has efficiently enhanced the retrieval recall. Key words Metadata
Data migration
Content addressable storage
Two-phrase retrieval
Content aware