[go: up one dir, main page]

CN101814121A - Zero downtime HA of predictive virtual machine - Google Patents

Zero downtime HA of predictive virtual machine Download PDF

Info

Publication number
CN101814121A
CN101814121A CN201010149545A CN201010149545A CN101814121A CN 101814121 A CN101814121 A CN 101814121A CN 201010149545 A CN201010149545 A CN 201010149545A CN 201010149545 A CN201010149545 A CN 201010149545A CN 101814121 A CN101814121 A CN 101814121A
Authority
CN
China
Prior art keywords
computer
virtual machine
hardware
virtual
control
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201010149545A
Other languages
Chinese (zh)
Inventor
周龙飞
王洪亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
IEIT Systems Co Ltd
Original Assignee
Shandong High-End Server & Storage Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong High-End Server & Storage Research Institute filed Critical Shandong High-End Server & Storage Research Institute
Priority to CN201010149545A priority Critical patent/CN101814121A/en
Publication of CN101814121A publication Critical patent/CN101814121A/en
Pending legal-status Critical Current

Links

Landscapes

  • Hardware Redundancy (AREA)
  • Debugging And Monitoring (AREA)

Abstract

本发明提供一种预见性虚拟机零宕机HA,基于定期对计算机硬件进行检测,然后对各个硬件部分健康状态进行评估,当评估结果为计算机处于亚健康状态时,主动实施虚拟机平滑转移,从而达到虚拟机上运行的业务不中断的目的;硬件包括控制计算机、主计算机、备用计算机和存储器,控制计算机与主计算机、备用计算机和存储器通过网线连接组成局域网,在局域网中设置由控制计算机监控的虚拟计算机,在控制计算机系统中还设置有:硬件状态检测单元,评估控制单元,虚拟服务转移单元。The present invention provides a predictive virtual machine zero-downtime HA, which is based on regular detection of computer hardware, and then evaluates the health status of each hardware part. When the evaluation result shows that the computer is in a sub-healthy state, the virtual machine is actively implemented for smooth transfer. In order to achieve the purpose of uninterrupted business running on the virtual machine; the hardware includes the control computer, the main computer, the backup computer and the storage, and the control computer is connected with the main computer, the backup computer and the storage through a network cable to form a local area network. The virtual computer is also provided in the control computer system: a hardware state detection unit, an evaluation control unit, and a virtual service transfer unit.

Description

一种预见性虚拟机零宕机HA A predictive virtual machine zero downtime HA

技术领域technical field

本发明涉及计算机虚拟化技术,具体地说是一种虚拟机高可用性的保护方法或预见性虚拟机零宕机HA。The invention relates to computer virtualization technology, in particular to a virtual machine high-availability protection method or predictive virtual machine zero-downtime HA.

背景技术Background technique

l、虚拟化技术被越来越多的厂商采用。虚拟化可以大幅度提高组织中资源和应用程序的效率和可用性。在旧的“一台计算机一个应用程序”模式下,内部资源未得到充分利用,并且IT管理员将太多的时间花在计算机管理上,而不是用于创新。借助虚拟化技术客户可以最大限度的使用物理资源。l. Virtualization technology is adopted by more and more manufacturers. Virtualization can dramatically increase the efficiency and availability of resources and applications in an organization. Under the old "one application per computer" model, internal resources were underutilized and IT administrators spent too much time managing computers rather than innovating. With the help of virtualization technology, customers can maximize the use of physical resources.

2、而虚拟化技术是否具有一定的冗余性是客户选择一个虚拟化产品的重要考察点。目前大多数虚拟化开发商都有着自己的一套冗余机制,我们称这种机制为HA。2. Whether the virtualization technology has a certain degree of redundancy is an important inspection point for customers to choose a virtualization product. At present, most virtualization developers have their own set of redundancy mechanisms, which we call HA.

3、所谓的HA技术,是指由多个物理节点构成的一个集群环境下,当某一个节点发生宕机时,在这个节点上运行的虚拟业务将会转移到另一个节点上。在这种机制下,客户可以持续的使用虚拟业务。3. The so-called HA technology refers to a cluster environment composed of multiple physical nodes. When a node goes down, the virtual business running on this node will be transferred to another node. Under this mechanism, customers can continue to use virtual services.

4、但是,传统的HA技术存在一个较大的缺陷。当集群里的节点发生宕机时,虚拟机并不是马上转移,由于虚拟机数据存放在共享存储中,所以管理端可以将虚拟机在其他计算机上重新启动来达实现HA功能。而重启必将导致虚拟服务的完全中断。而这也导致了客户在使用虚拟业务时发生延时。4. However, there is a big defect in the traditional HA technology. When a node in the cluster goes down, the virtual machine is not transferred immediately. Since the data of the virtual machine is stored in the shared storage, the management side can restart the virtual machine on other computers to achieve the HA function. The restart will inevitably lead to a complete interruption of the virtual service. And this also leads to delays when customers use virtual services.

所以,我们在这里提出一种新的解决方案来防止这种情况的发生,我们称它为:一种可预见性虚拟机零宕机HA。So, here we propose a new solution to prevent this from happening, we call it: a predictable virtual machine zero downtime HA.

发明内容Contents of the invention

本发明的目的是提供一种预见性虚拟机零宕机HA。The purpose of the present invention is to provide a predictive virtual machine zero downtime HA.

本发明的目的是按以下方式实现的,采用硬件探测技术对服务器底层硬件进行探测,探测包括CPU温度,风扇转速,硬盘容量,硬盘读写速率,对每一项进行检查,并给出一个综合数值,当计算机处于亚健康状态时,主动迁移虚拟业务,使虚拟机在计算机发生宕机之前主动实施虚拟机平滑转移,从而达到虚拟机上运行的业务不中断的目的;The purpose of the present invention is achieved in the following manner, using hardware detection technology to detect the underlying hardware of the server, the detection includes CPU temperature, fan speed, hard disk capacity, hard disk read and write speed, each item is checked, and a comprehensive Value, when the computer is in a sub-health state, actively migrate the virtual business, so that the virtual machine can actively implement the smooth migration of the virtual machine before the computer goes down, so as to achieve the purpose of uninterrupted business running on the virtual machine;

硬件包括控制计算机、主计算机、备用计算机和存储器,控制计算机与主计算机、备用计算机和存储器通过网线连接组成局域网,在局域网中设置由控制计算机监控的虚拟计算机,在控制计算机系统中还设置有:硬件状态检测单元,评估控制单元,虚拟服务转移单元,其中:The hardware includes a control computer, a main computer, a backup computer and a memory. The control computer is connected with the main computer, the backup computer and the memory through a network cable to form a local area network. A virtual computer monitored by the control computer is set in the local area network. In the control computer system, there are also: A hardware status detection unit, an evaluation control unit, and a virtual service transfer unit, wherein:

硬件状态检测单元是负责定期探测主计算机、备用计算机硬件健康状态;The hardware status detection unit is responsible for regularly detecting the hardware health status of the main computer and the standby computer;

评估控制单元对主计算机和备用计算机硬件综合健康状态进行判断,当某一台算计机处于亚健康状态运行时,则触发虚拟机平滑转移到另一台计算机上继续运行,以保证客户业务的连续;The evaluation control unit judges the comprehensive health status of the main computer and the backup computer hardware. When a computer is running in a sub-healthy state, it triggers the smooth transfer of the virtual machine to another computer to continue running, so as to ensure the continuity of customer business ;

虚拟服务转移实施单元负责处理虚拟机转移实施。The virtual service transfer implementation unit is responsible for handling virtual machine transfer implementation.

本发明的有益效果是,保证了客户对虚拟业务的连续使用。对于客户工作效率以及经济效益都有良好的作用。The beneficial effect of the present invention is that the continuous use of virtual services by customers is guaranteed. It has a good effect on customer work efficiency and economic benefits.

具体实施方式Detailed ways

本发明的一种预见性虚拟机零宕机HA,是采用硬件探测技术对服务器底层硬件进行探测,探测包括CPU温度,风扇转速,硬盘容量,硬盘读写速率等。对每一项进行检查,并给出一个综合数值。如果计算机处于亚健康状态,此时,主动迁移虚拟业务,使虚拟机在计算机发生宕机之前进行转移。A predictive virtual machine zero downtime HA of the present invention uses hardware detection technology to detect the underlying hardware of the server, including CPU temperature, fan speed, hard disk capacity, and hard disk read/write speed. Each item is checked and a composite value is given. If the computer is in a sub-healthy state, at this time, actively migrate the virtual service, so that the virtual machine can be transferred before the computer goes down.

硬件包括控制计算机、主计算机、备用计算机和存储器,控制计算机与主计算机、备用计算机和存储器通过网线连接组成局域网,在局域网中设置由控制计算机监控的虚拟计算机,在控制计算机系统中还设置有:硬件状态检测单元,评估控制单元,虚拟服务转移单元,其中:The hardware includes a control computer, a main computer, a backup computer and a memory. The control computer is connected with the main computer, the backup computer and the memory through a network cable to form a local area network. A virtual computer monitored by the control computer is set in the local area network. In the control computer system, there are also: A hardware status detection unit, an evaluation control unit, and a virtual service transfer unit, wherein:

硬件状态检测单元是负责定期探测主计算机、备用计算机硬件健康状态;The hardware status detection unit is responsible for regularly detecting the hardware health status of the main computer and the standby computer;

评估控制单元对主计算机和备用计算机硬件综合健康状态进行判断,当某一台算计机处于亚健康状态运行时,则触发虚拟机平滑转移到另一台计算机上继续运行,以保证客户业务的连续;The evaluation control unit judges the comprehensive health status of the main computer and the backup computer hardware. When a computer is running in a sub-healthy state, it triggers the smooth transfer of the virtual machine to another computer to continue running, so as to ensure the continuity of customer business ;

虚拟服务转移实施单元负责处理虚拟机转移实施。The virtual service transfer implementation unit is responsible for handling virtual machine transfer implementation.

Claims (1)

1.一种预见性虚拟机零宕机HA,其特征在于,采用硬件探测技术对服务器底层硬件进行探测,探测包括CPU温度,风扇转速,硬盘容量,硬盘读写速率,对每一项进行检查,并给出一个综合数值,当计算机处于亚健康状态时,主动迁移虚拟业务,使虚拟机在计算机发生宕机之前主动实施虚拟机平滑转移,从而达到虚拟机上运行的业务不中断的目的;1. A predictive virtual machine zero downtime HA, characterized in that the hardware detection technology is used to detect the underlying hardware of the server, including CPU temperature, fan speed, hard disk capacity, hard disk read and write speed, and check each item , and give a comprehensive value, when the computer is in a sub-health state, actively migrate the virtual business, so that the virtual machine can actively implement the smooth migration of the virtual machine before the computer goes down, so as to achieve the purpose of uninterrupted business running on the virtual machine; 硬件包括控制计算机、主计算机、备用计算机和存储器,控制计算机与主计算机、备用计算机和存储器通过网线连接组成局域网,在局域网中设置由控制计算机监控的虚拟计算机,在控制计算机系统中还设置有:硬件状态检测单元,评估控制单元,虚拟服务转移单元,其中:The hardware includes a control computer, a main computer, a backup computer and a memory. The control computer is connected with the main computer, the backup computer and the memory through a network cable to form a local area network. A virtual computer monitored by the control computer is set in the local area network. In the control computer system, there are also: A hardware status detection unit, an evaluation control unit, and a virtual service transfer unit, wherein: 硬件状态检测单元是负责定期探测主计算机、备用计算机硬件健康状态;The hardware status detection unit is responsible for regularly detecting the hardware health status of the main computer and the standby computer; 评估控制单元对主计算机和备用计算机硬件综合健康状态进行判断,当某一台算计机处于亚健康状态运行时,则触发虚拟机平滑转移到另一台计算机上继续运行,以保证客户业务的连续;The evaluation control unit judges the comprehensive health status of the main computer and the backup computer hardware. When a computer is running in a sub-healthy state, it triggers the smooth transfer of the virtual machine to another computer to continue running, so as to ensure the continuity of customer business ; 虚拟服务转移实施单元负责处理虚拟机转移实施。The virtual service transfer implementation unit is responsible for handling virtual machine transfer implementation.
CN201010149545A 2010-04-19 2010-04-19 Zero downtime HA of predictive virtual machine Pending CN101814121A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010149545A CN101814121A (en) 2010-04-19 2010-04-19 Zero downtime HA of predictive virtual machine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010149545A CN101814121A (en) 2010-04-19 2010-04-19 Zero downtime HA of predictive virtual machine

Publications (1)

Publication Number Publication Date
CN101814121A true CN101814121A (en) 2010-08-25

Family

ID=42621373

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010149545A Pending CN101814121A (en) 2010-04-19 2010-04-19 Zero downtime HA of predictive virtual machine

Country Status (1)

Country Link
CN (1) CN101814121A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103186428A (en) * 2011-12-28 2013-07-03 财团法人工业技术研究院 Virtual Resource Management Method
CN105607973A (en) * 2014-11-19 2016-05-25 中国移动通信集团公司 Method, device and system for processing equipment failures in virtual machine system
CN106301876A (en) * 2015-06-26 2017-01-04 中兴通讯股份有限公司 Physical machine upgrade method, business migration method and device
CN103220165B (en) * 2013-03-20 2017-04-19 杭州华三通信技术有限公司 Processing method and device for server active downtime
WO2017075989A1 (en) * 2015-11-06 2017-05-11 华为技术有限公司 Method, device and system for virtual machines migration
CN107704314A (en) * 2017-11-09 2018-02-16 北京百度网讯科技有限公司 Method and apparatus for migrating virtual machine

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1387547A1 (en) * 2002-08-01 2004-02-04 He Joe Gesellschaft für Netzwerk- und Systemadministration mbH System of interconnected computers and method of control
CN101436966A (en) * 2008-12-23 2009-05-20 北京航空航天大学 Network monitoring and analysis system under virtual machine circumstance

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1387547A1 (en) * 2002-08-01 2004-02-04 He Joe Gesellschaft für Netzwerk- und Systemadministration mbH System of interconnected computers and method of control
CN101436966A (en) * 2008-12-23 2009-05-20 北京航空航天大学 Network monitoring and analysis system under virtual machine circumstance

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
吴靖等: "《基于VMware虚拟化技术的远程教育平台设计》", 《电信快报》, no. 10, 31 October 2009 (2009-10-31), pages 11 - 13 *
孟葳: "《分布式虚拟机机群管理的研究与实现》", 《中国优秀硕士学位论文全文数据库信息科技辑》, no. 11, 15 November 2008 (2008-11-15), pages 48 - 54 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103186428A (en) * 2011-12-28 2013-07-03 财团法人工业技术研究院 Virtual Resource Management Method
CN103220165B (en) * 2013-03-20 2017-04-19 杭州华三通信技术有限公司 Processing method and device for server active downtime
CN105607973A (en) * 2014-11-19 2016-05-25 中国移动通信集团公司 Method, device and system for processing equipment failures in virtual machine system
CN105607973B (en) * 2014-11-19 2020-03-17 中国移动通信集团公司 Method, device and system for processing equipment fault in virtual machine system
CN106301876A (en) * 2015-06-26 2017-01-04 中兴通讯股份有限公司 Physical machine upgrade method, business migration method and device
CN106301876B (en) * 2015-06-26 2019-11-29 南京中兴新软件有限责任公司 Physical machine upgrade method, business migration method and device
WO2017075989A1 (en) * 2015-11-06 2017-05-11 华为技术有限公司 Method, device and system for virtual machines migration
CN106681802A (en) * 2015-11-06 2017-05-17 华为技术有限公司 Virtual machine migration method, device and system
CN107704314A (en) * 2017-11-09 2018-02-16 北京百度网讯科技有限公司 Method and apparatus for migrating virtual machine
CN107704314B (en) * 2017-11-09 2023-09-12 北京百度网讯科技有限公司 Method and apparatus for migrating virtual machines

Similar Documents

Publication Publication Date Title
CN103440160B (en) Virtual machine restoration methods and virtual machine migration method and device and system
US8935563B1 (en) Systems and methods for facilitating substantially continuous availability of multi-tier applications within computer clusters
US8627143B2 (en) Dynamically modeling and selecting a checkpoint scheme based upon an application workload
CN104040459B (en) For the system and method for the energy stores demand for reducing data center
CN105960639B (en) Prioritized Data Reconstruction in Distributed Storage Systems
TWI493331B (en) Power optimization via virtualization opportunity
US9037826B1 (en) System for optimization of input/output from a storage array
JP5967215B2 (en) Information processing apparatus, program, and virtual machine migration method
CN107967180B (en) Based on resource overall situation affinity network optimized approach and system under NUMA virtualized environment
CN103403689B (en) A resource fault management method, device and system
US9448824B1 (en) Capacity availability aware auto scaling
US8694827B2 (en) Job migration in response to loss or degradation of a semi-redundant component
US9229843B2 (en) Predictively managing failover in high availability systems
CN101814121A (en) Zero downtime HA of predictive virtual machine
TWI735542B (en) Data storage method and device of virtual machine
JP2011128967A (en) Method for moving virtual machine, virtual machine system and program
US10782898B2 (en) Data storage system, load rebalancing method thereof and access control method thereof
US20230023229A1 (en) Volatile memory data recovery based on independent processing unit data access
CN103718535A (en) Hardware failure mitigation
US11900159B2 (en) Method for repointing resources between hosts
JP7704382B2 (en) MEMORY ALLOCATION AND MEMORY WRITE REDIRECTION IN A CLOUD COMPUTING SYSTEM BASED ON MEMORY MODULE TEMPERATURE - Patent application
CN103279386A (en) Method for achieving high availability of computer operation scheduling system
Almufti et al. Leveraging distributed systems for fault-tolerant cloud computing: A review of strategies and frameworks
US8931051B2 (en) Scalable and highly available clustering for large scale real-time applications
CN107124317A (en) A kind of disaster tolerance system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
ASS Succession or assignment of patent right

Owner name: SHANDONG LANGCHAO HUICAI INVESTMENT HOLDING CO., L

Free format text: FORMER OWNER: SHANDONG HIGH-END SERVER + STORAGE RESEARCH INSTITUTE

Effective date: 20121017

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 250014 JINAN, SHANDONG PROVINCE TO: 250101 JINAN, SHANDONG PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20121017

Address after: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 No. 1768 Qilu Software building B block 3 layer

Applicant after: Shandong wave color Klc Holdings Ltd

Address before: 250014 No. 224 mountain road, Lixia District, Shandong, Ji'nan

Applicant before: Shandong High-End Server & Storage Research Institute

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: LANGCHAO ELECTRONIC INFORMATION INDUSTRY CO., LTD.

Free format text: FORMER OWNER: SHANDONG LANGCHAO HUICAI INVESTMENT HOLDING CO., LTD.

Effective date: 20130909

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20130909

Address after: 250101 Shandong Province, Ji'nan City hi tech Development Zone, Nga Road No. 1036

Applicant after: Langchao Electronic Information Industry Co., Ltd.

Address before: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250101 No. 1768 Qilu Software building B block 3 layer

Applicant before: Shandong wave color Klc Holdings Ltd

C05 Deemed withdrawal (patent law before 1993)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20100825