期刊文献+

存储系统中的频繁访问模式挖掘

Mining frequent access patterns in storage systems
下载PDF
导出
摘要 研究、分析了影响经典的模式挖掘方法挖掘频繁访问模式的效率,使其难以被存储系统接受的主要因素——噪音的产生原因和表现类型,提出一种具有噪音过滤能力,适应存储系统频繁访问序列模式挖掘的新方法——Z-Miner。Z-Miner使用全局分支裁剪和分支聚类方法来过滤噪音,对实际系统工作负载的模拟结果显示,Z-Miner指导的预取可以使缓存失效率降低40%~66%,平均响应时间降低26%~66%。相对经典挖掘方法,Z-Miner的挖掘开销有1~2个数量级的下降,而预取优化效果提高了1倍。 Based on the analysis of the effect mechanism of the noise, a major factor that lowers the efficiency of frequent access pattern mining and makes classic mining methods unacceptable for storage systems, this paper proposes a novel mining method Z-Miner. The Z-Miner employs a global-branch-cutting and branch-clustering approach for noise filtering. The simulation results under real workloads show that the prefetching directed by the Z-Miner could reduce the cache miss ratio by 40 % - 66 %, and the average response time by 26 % - 66 %. Compared with classic mining methods, the overhead of the Z-Miner is 1 to 2 orders of magnitude less, while the efficiency of the prefetching is two times more.
出处 《高技术通讯》 EI CAS CSCD 北大核心 2009年第7期699-705,共7页 Chinese High Technology Letters
基金 863计划(2007AA01Z402) 973计划(2004CB318205)资助项目。
关键词 频繁访问模式 数据块关系 序列模式挖掘 聚类 预取 frequent access pattern, block correlations, sequential pattern mining, clustering, prefetching
  • 相关文献

参考文献10

  • 1Li Z M,Chen Z F,Srinivasan S M, et al.C-Miner: mining block correlations in storage systems[].Proceedings of the rd USENIX Conference on File and Storage Technologies.2004
  • 2Kuenning G H.The design of the SEER predictive caching scheme system[].Proceedings of the Workshop on Mobile Computing Systems and Applications.1994
  • 3Kuenning G H,Popek G J.Automated hoarding for mobile computers[].Proceedings of the th Symposium on Operating Systems Principles.1997
  • 4Grimsrud K S,Archibald J K,Nelson B E.Multiple prefetch adaptive disk caching[].IEEE Transactions on Knowledge and Data Engineering.1993
  • 5Han J,Pei J,Mortazavi-Asl B,et al.FreeSpan: Frequent pattern-projected sequential pattern mining[].Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining.2000
  • 6Yan X F,Han J,Afshar R.CloSpan:Mining Closed Sequential Patterns in Large Datasets[].Proceedings of the Third SIAM International Conference on Data Mining.2003
  • 7Pei Jian,Han Jia-Wei,Mortazavi-Asl B,et al.Mining sequential patterns by pattern-growth: the PrefixSpan approach[].IEEE Transactions on Knowledge and Data Engineering.2004
  • 8MacQueen J.Some methods for classification and analysis of multivariate observations[].Proceedings of the Fifth Berkeley Symposium on Mathematics Statistics and Science.1967
  • 9E.G. Coffman,,P.J. Denning.Operating Systems Theory[]..1973
  • 10Zhenmin Li,Zhifeng Chen,Yuanyuan Zhou.Mining block correlations to improve storage performance[].ACM Transactions on Storage.2005

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部