Journal Home > Volume 26 , Issue 1

Virtualization is the most important technology in the unified resource layer of cloud computing systems. Static placement and dynamic management are two types of Virtual Machine (VM) management methods. VM dynamic management is based on the structure of the initial VM placement, and this initial structure will affect the efficiency of VM dynamic management. When a VM fails, cloud applications deployed on the faulty VM will crash if fault tolerance is not considered. In this study, a model of initial VM fault-tolerant placement for star topological data centers of cloud systems is built on the basis of multiple factors, including the service-level agreement violation rate, resource remaining rate, power consumption rate, failure rate, and fault tolerance cost. Then, a heuristic ant colony algorithm is proposed to solve the model. The service-providing VMs are placed by the ant colony algorithms, and the redundant VMs are placed by the conventional heuristic algorithms. The experimental results obtained from the simulation, real cluster, and fault injection experiments show that the proposed method can achieve better VM fault-tolerant placement solution than that of the traditional first fit or best fit descending method.


menu
Abstract
Full text
Outline
About this article

A Multi-Objective Optimization Method of Initial Virtual Machine Fault-Tolerant Placement for Star Topological Data Centers of Cloud Systems

Show Author's information Wei ZhangXiao ChenJianhui Jiang( )
School of Software Engineering, Tongji University, Shanghai 201804, China.

Abstract

Virtualization is the most important technology in the unified resource layer of cloud computing systems. Static placement and dynamic management are two types of Virtual Machine (VM) management methods. VM dynamic management is based on the structure of the initial VM placement, and this initial structure will affect the efficiency of VM dynamic management. When a VM fails, cloud applications deployed on the faulty VM will crash if fault tolerance is not considered. In this study, a model of initial VM fault-tolerant placement for star topological data centers of cloud systems is built on the basis of multiple factors, including the service-level agreement violation rate, resource remaining rate, power consumption rate, failure rate, and fault tolerance cost. Then, a heuristic ant colony algorithm is proposed to solve the model. The service-providing VMs are placed by the ant colony algorithms, and the redundant VMs are placed by the conventional heuristic algorithms. The experimental results obtained from the simulation, real cluster, and fault injection experiments show that the proposed method can achieve better VM fault-tolerant placement solution than that of the traditional first fit or best fit descending method.

Keywords: cloud computing, virtual machine placement, fault tolerance, multi-objective optimization, heuristic ant colony algorithm

References(39)

[1]
I. Foster, Y. Zhao, I. Raicu, and S. Y. Lu, Cloud computing and grid computing 360-degree compared, in Proc. 2008 Grid Computing Environments Workshop, Austin, TX, USA, 2008, pp. 1-10.
DOI
[2]
H. L. Chen, A qualitative and quantitative study on availability of cloud computing, http://www.valleytalk.org/wp-content/uploads/2013/10/, 2013.
[3]
M. Nelson, B. H. Lim, and G. Hutchins, Fast transparent migration for virtual machines, in Proc. 2005 USENIX Annu. Technical Conf., Anaheim, CA, USA, 2005, pp. 391-394.
[4]
M. Lee, A. Krishnakumar, P. Krishnan, N. Singh, and S. Yajnik, Hypervisor-assisted application checkpointing in virtualized environments, in Proc. IEEE/IFIP 41st Int. Conf. Dependable Systems & Networks, Hong Kong, China, 2011, pp. 371-382.
DOI
[5]
X. Zhang, Z. G. Huo, J. Ma, and D. Meng, Fast and live whole-system migration of virtual machines, (in Chinese), Journal of Computer Research and Development, vol. 49, no. 3, pp. 661-668, 2012.
[6]
F. Xu, F. M. Liu, L. H. Liu, H. Jin, B. Li, and B. C. Li, iAware: Making live migration of virtual machines interference-aware in the cloud, IEEE Transactions on Computers, vol. 63, no.12, pp. 3012-3025, 2014.
[7]
K. J. Ye, Z. H. Wu, C. Wang, B. B. Zhou, W. S. Si, X. H. Jiang, and A. Y. Zomaya, Profiling-based workload consolidation and migration in virtualized data centers, IEEE Transactions on Parallel and Distributed Systems, vol. 26, no. 3, pp. 878-890, 2015.
[8]
H. K. Liu and B. S. He, VMbuddies: Coordinating live migration of multi-tier applications in cloud environments, IEEE Transactions on Parallel and Distributed Systems, vol. 26, no. 4, pp. 1192-1205, 2015.
[9]
J. Zhu, W. Dong, Z. F. Jiang, X. G. Shi, Z. Xiao, and X. M. Li, Improving the performance of hypervisor-based fault tolerance, in Proc. IEEE Int. Symp. Parallel & Distributed Processing, Atlanta, GA, USA, 2010, pp. 1-10.
DOI
[10]
J. Zhu, Z. F. Jiang, Z. Xiao, and X. M. Li, Optimizing the performance of virtual machine synchronization for fault tolerance, IEEE Transactions on Computers, vol. 60, no. 12, pp. 1718-1729, 2011.
[11]
D. Shen, J. Z. Luo, F. Dong, and J. X. Zhang, VirtCo: Joint coflow scheduling and virtual machine placement in cloud data centers, Tsinghua Science and Technology, vol. 24, no. 5, pp. 630-644, 2019.
[12]
Z. Liu, S. J. Sun, J. Xing, Z. Fu, X. H. Hu, J. W. Pi, X. F. Yang, Y. S. Lu, and J. Li, MN-SLA: A modular networking SLA framework for cloud management system, Tsinghua Science and Technology, vol. 23, no. 6, pp. 635-644, 2018.
[13]
Q. Li, Q. F. Hao, L. M. Xiao, and Z. J. Li, Adaptive management and multi-objective optimization for virtual machine placement in cloud computing, (in Chinese), Chinese Journal of Computers, vol. 34, no. 12, pp. 2253-2264, 2011.
[14]
K. Tsakalozos, M. Roussopoulos, and A. Delis, Hint-based execution of workloads in clouds with Nefeli, IEEE Transactions on Parallel and Distributed Systems, vol. 24, no. 7, pp. 1331-1340, 2013.
[15]
F. Farahnakian, A. Ashraf, T. Pahikkala, P. Liljeberg, J. Plosila, I. Porres, and H. Tenhunen, Using ant colony system to consolidate VMs for green cloud computing, IEEE Transactions on Services Computing, vol. 8, no. 2, pp. 187-198, 2015.
[16]
E. G. Coffman, M. R. Garey, and D. S. Johnson, Approximation algorithms for bin packing: A survey, in Approximation Algorithms for NP-Hard Problems. Boston, MA, USA: PWS Publishing, 1997, pp. 46-93.
[17]
N. Bobroff, A. Kochut, and K. Beaty, Dynamic placement of virtual machines for managing SLA violations, in Proc. 10th IFIP/IEEE Int. Symp. Integrated Management, Munich, Germany, 2007, pp. 119-128.
DOI
[18]
S. Chaisiri, B. S. Lee, and D. Niyato, Optimization of resource provisioning cost in cloud computing, IEEE Transactions on Services Computing, vol. 5, no. 2, pp. 164-177, 2012.
[19]
C. H. Lien, Y. W. Bai, and M. B. Lin, Estimation by software for the power consumption of streaming-media servers, IEEE Transactions on Instrumentation and Measurement, vol. 56, no. 5, pp. 1859-1870, 2007.
[20]
A. Verma, G. Dasgupta, T. K. Nayak, P. De, and R. Kothari, Server workload analysis for power minimization using consolidation, in Proc. 2009 Conf. USENIX Annu. Technical Conf., San Diego, CA, USA, 2009, p. 28.
[21]
J. K. Dong, H. B. Wang, and S. D. Cheng, Energy-performance tradeoffs in IaaS cloud with virtual machine scheduling, China Communications, vol. 12, no. 2, pp. 155-166, 2015.
[22]
X. Jing and J. A. B. Fortes, Multi-objective virtual machine placement in virtualized data center environments, in Proc. 2010 IEEE/ACM Int’l Conf. Green Computing and Communications & Int’l Conf. Cyber, Physical and Social Computing, Hangzhou, China, 2010, pp. 179-188.
[23]
F. Ma, F. Liu, and Z. Liu, Multi-objective optimization for initial virtual machine placement in cloud data center, Journal of Information and Computational Science, vol. 9, no. 16, pp. 5029-5038, 2012.
[24]
S. N. Wang, H. X. Gu, and G. Wu, A new approach to multi-objective virtual machine placement in virtualized data center, in Proc. IEEE 8th Int. Conf. Networking, Architecture and Storage, Xi’an, China, 2013, pp. 331-335.
DOI
[25]
F. Machida, M. Kawato, and Y. Maeno, Redundant virtual machine placement for fault-tolerant consolidated server clusters, in Proc. 2010 IEEE Network Operations and Management Symposium, Osaka, Japan, 2010, pp. 32-39.
DOI
[26]
G. Y. Jung, K. R. Joshi, M. A. Hiltunen, R. D. Schlichting, and C. Pu, Performance and availability aware regeneration for cloud based multitier applications, in Proc. IEEE/IFIP Int. Conf. Dependable Systems and Networks, Chicago, IL, USA, 2010, pp. 497-506.
DOI
[27]
E. Bin, O. Biran, O. Boni, E. Hadad, E. K. Kolodner, Y. Moatti, and D. H. Lorenz, Guaranteeing high availability goals for virtual machine placement, in Proc. 31st Int. Conf. Distributed Computing Systems, Minneapolis, MN, USA, 2011, pp. 700-709.
DOI
[28]
D. Epping and F. Denneman, VMware vSphere 4.1 HA and DRS Technical Deepdive. North Charleston, SC, USA: CreateSpace, 2010, pp. 15-22.
[29]
Z. B. Zhen, T. C. Zhou, M. R. Lyu, and I. King, FTCloud: A component ranking framework for fault-tolerant cloud applications, in Proc. IEEE 21st Int. Symp. Software Reliability Engineering, San Jose, CA, USA, 2010, pp. 398-407.
DOI
[30]
Z. B. Zheng, T. C. Zhou, M. R. Lyu, and I. King, Component ranking for fault-tolerant cloud applications, IEEE Transactions on Services Computing, vol. 5, no. 4, pp. 540-550, 2012.
[31]
F. Hermenier, J. Lawall, and G. Muller, BtrPlace: A flexible consolidation manager for highly available applications, IEEE Transactions on Dependable and Secure Computing, vol. 10, no. 5, pp. 273-286, 2013.
[32]
X. X. Cui, Multi-Objective Evolutionary Algorithms and Their Applications, (in Chinese). Beijing, China: National Defense Industry Press, 2006, pp. 10-20.
[33]
T. Stützle and H. H. Hoos, MAX-MIN ant system, Future Generation Computer Systems, vol. 16, no. 8, pp. 889-914, 2000.
[34]
R. N. Calheiros, R. Ranjan, A. Beloglazov, C. A. F. De Rose, and R. Buyya, CloudSim: A toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms, Software: Practice and Experience, vol. 41, no.1, pp. 23-50, 2011.
[35]
E. Cecchet, A. Chanda, S. Elnikety, J. Marguerite, and W. Zwaenepoel, Performance comparison of middleware architectures for generating dynamic web content, in Proc. 2003 ACM/IFIP/USENIX Int. Conf. Middleware, Rio de Janeiro, Brazil, 2003, pp. 242-261.
DOI
[36]
Y. Tamura, Kemari: Fault tolerant VM synchronization based on KVM, https://www.linux-kvm.org/images/0/0d/0.5.kemari-kvm-forum-2010.pdf, 2010.
[37]
A. Jin and J. H. Jiang, Fault injection scheme for embedded systems at machine code level and verification, in Proc. 15th IEEE Pacific Rim Int. Symp. Dependable Computing, Shanghai, China, 2009, pp. 55-62.
DOI
[38]
J. W. Hu and J. H. Jiang, Design and implementation of a fault injection mechanism for software reliability evaluation, (in Chinese), Journal of Computer-Aided Design & Computer Graphics, vol. 24, no. 6, pp. 741-751, 2012.
[39]
D. Q. Zhang, J. H. Jiang, and L. B. Chen, A method for validating the effectiveness of fault clustering and failure clustering of programs, Scientia Sinica Informationis, vol. 44, no. 10, pp. 1323-1344, 2014.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 27 June 2019
Accepted: 28 August 2019
Published: 19 June 2020
Issue date: February 2021

Copyright

© The author(s) 2021.

Acknowledgements

This work was supported by the National Natural Science Foundation of China (Nos. 61432017 and 61772199).

Rights and permissions

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).

Return