期刊文献+

Improving Scalability of Cloud Monitoring Through PCA-Based Clustering of Virtual Machines 被引量:3

Improving Scalability of Cloud Monitoring Through PCA-Based Clustering of Virtual Machines
原文传递
导出
摘要 Cloud computing has recently emerged as a leading paradigm to allow customers to run their applications in virtualized large-scale data centers. Existing solutions for monitoring and management of these infrastructures consider virtual machines (VMs) as independent entities with their own characteristics. However, these approaches suffer from scalability issues due to the increasing number of VMs in modern cloud data centers. We claim that scalability issues can bc addressed by leveraging the similarity among VMs behavior in terms of resource usage patterns. In this paper we propose an automated methodology to cluster VMs starting from the usage of multiple resources, assuming no knowledge of the services executed on them. The innovative contribution of the proposed methodology is the use of the statistical technique known as principal component analysis (PCA) to automatically select the most relevant information to cluster similar VMs. We apply the methodology to two case studies, a virtualized testbed and a real enterprise data center. In both case studies, the automatic data selection based on PCA allows us to achieve high performance, with a percentage of correctly clustered VMs between 80% and 100% even for short time series (1 day) of monitored data. Furthermore, we estimate the potential reduction in the amount of collected data to demonstrate how our proposal may address the scalability issues related to monitoring and management in cloud computing data centers. Cloud computing has recently emerged as a leading paradigm to allow customers to run their applications in virtualized large-scale data centers. Existing solutions for monitoring and management of these infrastructures consider virtual machines (VMs) as independent entities with their own characteristics. However, these approaches suffer from scalability issues due to the increasing number of VMs in modern cloud data centers. We claim that scalability issues can bc addressed by leveraging the similarity among VMs behavior in terms of resource usage patterns. In this paper we propose an automated methodology to cluster VMs starting from the usage of multiple resources, assuming no knowledge of the services executed on them. The innovative contribution of the proposed methodology is the use of the statistical technique known as principal component analysis (PCA) to automatically select the most relevant information to cluster similar VMs. We apply the methodology to two case studies, a virtualized testbed and a real enterprise data center. In both case studies, the automatic data selection based on PCA allows us to achieve high performance, with a percentage of correctly clustered VMs between 80% and 100% even for short time series (1 day) of monitored data. Furthermore, we estimate the potential reduction in the amount of collected data to demonstrate how our proposal may address the scalability issues related to monitoring and management in cloud computing data centers.
出处 《Journal of Computer Science & Technology》 SCIE EI CSCD 2014年第1期38-52,共15页 计算机科学技术学报(英文版)
关键词 cloud computing resource monitoring principal component analysis k-means clustering cloud computing,resource monitoring,principal component analysis,k-means clustering
  • 相关文献

参考文献1

二级参考文献12

  • 1V. Bruno, T. Pieter, D.L. Marc, D.T. Filip, D. Bart, D. Piet, A scalable and preferment grid monitoring and information framework, The 2005 International Multiconference in Computer Science & Computer Engineering, 2005.
  • 2B. Tierney, R. Aydt, D. Gunter, W. Smith, M. Swany, V. Taylor, R. Wolski, A grid monitoring architecture, GGF Performance Working Group, 2002.
  • 3D. Lee, J.J. Dongarra., R.S. Ramakrishna., VISPERF: monitoring tool for grid computing, computational science, ICCS 2003, 2003, pp. 692-692.
  • 4A. Giovanni, C. Dario, E. Cosimo, A.G.P. Marra, A.G. Mastrantonio, A. Gianvito Quarta, Globus, Monitoring and discovery service and sensorML for grid sensor networks, In 15th IEEE International Workshops, 2006.
  • 5T.K.H. Chen, Sensor-grid computing and sensorGrid architecture for event detection classification and decision-making, Sensor Networks and Configuration, Springer-Verlag, 2006.
  • 6A. Giovanni, C. Dario, E. Cosimo, A.G.P. Marra, A.G. Mastrantonio, A. Gianvito Quarta, Globus, SensorML for Grid Sensor Networks, 2006.
  • 7V. Hingne, A. Joshi, E. Houstis, J.A.M.J. Michopoulos, On the grid and sensor networks, In Proceedings Fourth International Workshop on, 2003.
  • 8B.L. Hock, M.T. Yong, P. Mukherjee, A.T.L. Vinh, A. F.W. Weng, S.A. See, Sensor grid: integration of wireless sensor networks and the grid, In Local Computer Networks IEEE Conference on, 2005.
  • 9B. Zoltan, K. Peter, P. Norbert, V. Ferenc, Comparison of representative grid monitoring tools, Computer andAutomation Research Institute of the Hungarian Academy of Sciences (2000).
  • 10F. Steve, K. Markus, Ganglia native windows node agent, Technical report from APR Consulting, 2006.

共引文献1

同被引文献23

  • 1冯少冲,邸彦强,朱元昌,杨文兵.IaaS云计算中虚拟机部署算法研究[J].华中科技大学学报(自然科学版),2012,40(S1):359-364. 被引量:4
  • 2Aceto G,Botta A,de Donato W,et al.Cloud monitoring:a survey[J].Computer Networks,2013,57(9):2093-2115.
  • 3Boulon J,Konwinski A,Qi R,et al.Chukwa,a large-scale monitoring system[C]∥Proceedings of the 24th International Conference on Large Installation System Administration.Chicago:ACM,2008:1-5.
  • 4Litty L,Lagar-Cavilla H A,Lie D.Computer meteorology:monitoring compute clouds[C]∥Proceedings of the 12th Conference on Hot Topics in Operating Systems(HotOS'09).Berkeley:USENIX Association,2009:4-4.
  • 5Park J S,Yu H C,Chung K S,et al.Markov chain based monitoring service for fault tolerance in mobile cloud computing[C]∥Proceedings of 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications.Biopolis:ACM,2011:520-525.
  • 6Zou Deqing,Zhang Wenrong,Qiang Weizhong,et al.Design and implementation of a trusted monitoring framework for cloud platforms[J].Future Generation Computer Systems,2013,29(8):2092-2102.
  • 7Tian Wenhong,Sun Xiashuan,Cao Jun,et al.CloudMoni:a monitoring framework for on demand virtual machine allocation in cloud data centers[J].Journal of Information and Computational Science,2013,10(4):4639-4646.
  • 8Chen Huacai,Jin Hai,Hu Kan,et al.Scheduling overcommitted VM:behavior monitoring and dynamic switching-frequency scaling[J].Future Generation Computer Systems,2013,29(1):341-351.
  • 9Shao Zhiyuan,He Ligang,Lu Zhiqiang,et al.VSA:an offline scheduling analyzer for Xen virtual machine monitor[J].Future Generation Computer Systems,2013,29(8):2067-2076.
  • 10Brodsky B E,Darkhovsky B S.Nonparametric methods in change-point problems[M].Dordrecht:Kluwer Academic Publishers,1993.

引证文献3

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部