期刊文献+

内容感知存储系统中的两阶段检索策略 被引量:1

Two-phrase Retrieval Strategy in Content Aware Network Storage System
下载PDF
导出
摘要 随着存储系统规模的不断扩大,如何有效组织、管理和查询存储系统中的资源,成为了研究者必须应对的一个问题。目前存储系统中的查询需求主要来自系统管理员对元数据的查询以及普通用户对关键字内容的查询等两个方面。而内容感知存储系统自身所具备的重复数据删除和块相似性检测能力并没有被用于优化上述查询过程。为了充分利用存储系统感知到的上层语义和底层重复数据块信息,为使用者提供高效、便捷的查询服务,提出了内容感知网络存储系统中的两阶段检索策略。该策略将上层基于元数据和关键字的查询与底层存储系统的块相似性查询相结合,利用两次查询相关度的加权平均值作为相似度评价指标。最终的实验结果表明了该策略在降低失效性、提高查全率等方面的有效性。 As the storage capacity approach Exabytes,how to efficiently organize,find and manage data is becoming increasingly difficult for us.The query requests in storage system are coming from two aspects,the first one is metadata retrieval delivered by administrator and the second one is user's common
出处 《计算机科学》 CSCD 北大核心 2011年第5期20-23,48,共5页 Computer Science
基金 国家自然科学基金(60673001) 部委基金"基于服务定制的智能存储系统研究"资助
关键词 元数据 数据迁移 内容寻址存储 两阶段检索 内容感知 query.But the functions of de-duplication and block similarity detection in content aware storage system are not utilized to enhance the above query processing.In order to take advantage of the upper semantic information and the lower storage system's duplicate block information to deliver efficient query service for users a two-phrase retrieval strategy was introduced.It combined metadata/keyword query with block similarity query and utilized ranking coefficient to evaluate similarity among query results.The experiments indicate that the retrieval strategy has efficiently enhanced the retrieval recall. Key words Metadata Data migration Content addressable storage Two-phrase retrieval Content aware
  • 相关文献

参考文献14

  • 1Blanco R, Barreiro A. Probabilistic static pruning of inverted files[J]. ACM Transactions on Information Systems, 2010,28 (1):1-33.
  • 2Mitra S, Winslett M, Hsu W W. Query-based partitioning of documents and indexes for information lifecycle management [C]//Proceedings of ACM SIGMOD on Management of Data. 2008 : 623-636.
  • 3Hua Yu, Jiang Hong, Zhu Yifeng, et al. SmartStore : a new metadata organization paradigm with semantic-awareness for nextgeneration file systems[C]//Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis. 2009: 1-12.
  • 4Leung A, Miller E L. Scalable Full-text Search for Petascale File Systems[C]//Proceedings of the 2008 Petascale Data Storage Workshop. 2008.
  • 5Raghuveer A, Jindal M, et al. Towards efficient search on unstructured data: an intelligent storage approach[C]//Proceedings of the Sixteenth ACM Conference on CIKM. 2007:951-954.
  • 6Factor M, Dalit Naor, Simona R, et al. Preservation DataStores: new storage paradigm for preservation environments[C]//Proceedings of IEEE Conference on MSST. 2008:3-15.
  • 7You L L, Pollack K T, Long D D E. Deep Store: An archival storage system architecture[C]//21st International Conference on Data Engineering. 2005 : 804-815.
  • 8Leung AW, Shao Minglong, Bisson T, et al. Spyglass: fast, scalable metadata search for large-scale storage systems[C]//Proceedings of the 7th conference on FAST. 2009:153-166.
  • 9Lester N, Zobel J, Williams H E. In-place versus re-build versus re-merge:Index maintenance strategies for text retrieval systems [C]// Proc of the Australasian Computer Science Conference. 2004:15-22.
  • 10Zhou Jingli, Liu Ke, Qin Leihua, et al. Block-Ranking: Content similarity retrieval based on data partition in network storage environment[J].International Journal of Digital Content Technology and its Applications, 2010,4 (3) :85-94.

同被引文献11

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部