CN104199747B

CN104199747B - High-availability system obtaining method and system based on health management

Info

Publication number: CN104199747B
Application number: CN201410403247.8A
Authority: CN
Inventors: 伍华凤; 何龙; 龚念; 戴新发; 代征
Original assignee: 709th Research Institute of CSIC
Current assignee: 709th Research Institute of CSIC
Priority date: 2014-08-15
Filing date: 2014-08-15
Publication date: 2017-05-03
Anticipated expiration: 2034-08-15
Also published as: CN104199747A

Abstract

The invention discloses a method for implementing a high-availability system based on health management: S1. Set at least two control nodes and at least two computing nodes; set one of the control nodes as the main control node, and set the other control nodes as backup Control the nodes and configure their priority order; set the priority of the main control node higher than any backup control node; S2, each control node repeatedly collects and stores the health status data of all computing nodes; S3, judges whether the current control node is The main control node, if yes, go to step S5; otherwise, go to step S4; S4, the standby control node with the highest priority detects whether the main control node is faulty through heartbeat in real time; the standby control node with low priority detects in real time Whether the standby control node with a higher priority level is faulty; if so, modify the priority order; S5. The main control node analyzes the stored health status data, and manages the computing nodes according to the analysis results.

Description

Method and system for implementing high-availability system based on health management

技术领域technical field

本发明涉及一种计算机高可用技术领域，尤其涉及一种基于健康管理的高可用系统实现方法及系统。The invention relates to the technical field of high availability of computers, in particular to a method and system for realizing a high availability system based on health management.

背景技术Background technique

计算机系统的可靠性用平均无故障时间来度量，即计算机系统平均能够正常运行多长时间，才发生一次故障。计算机系统的可用性定义为系统保持正常运行时间的百分比。高可用性(High Availability；HA)用于描述一个系统经过专门的设计，从而减少停工时间，而保持其服务的高度可用性。为了提高服务器设备或软件的高可用性，目前常用的方法主要有：The reliability of a computer system is measured by the mean time between failures, that is, how long the computer system can run normally before a failure occurs. Availability of a computer system is defined as the percentage of time the system remains up. High Availability (High Availability; HA) is used to describe a system that has been specially designed to reduce downtime while maintaining high availability of its services. In order to improve the high availability of server equipment or software, currently commonly used methods mainly include:

1、双机冷备份技术：主机处于运行状态，备份机处于等待状态。一旦主机出现故障，备份机立即开启主机的服务。1. Two-machine cold backup technology: the main machine is in the running state, and the backup machine is in the waiting state. Once the main machine fails, the backup machine immediately starts the service of the main machine.

2、双机热备份技术：该技术即为目前通常所说的激活(active)/等待(standby)工作方式，Active和standby设备具有相同的硬件配置和软件系统。一旦active设备出现故障的时候，standby设备立即激活应用服务，保证应用服务在短时间内完全恢复正常使用。高可用性软件(HA)包括RoseHa等软件。2. Dual-machine hot backup technology: This technology is the so-called active/standby working mode. The active and standby devices have the same hardware configuration and software system. Once the active device fails, the standby device immediately activates the application service to ensure that the application service can be fully restored to normal use in a short time. High availability software (HA) includes software such as RoseHa.

3、虚拟机高可用容错技术：通过对服务器进行虚拟化后，在虚拟服务器上进行容错备份配置，再通过心跳检测和虚拟机实时迁移技术，来实现计算机系统所有应用的透明容错计算功能。3. Virtual machine high-availability fault-tolerant technology: After virtualizing the server, perform fault-tolerant backup configuration on the virtual server, and then use heartbeat detection and virtual machine real-time migration technology to realize transparent fault-tolerant computing functions for all applications of the computer system.

以上提高服务器设备或软件的高可用性的方法存在以下缺陷：The above methods for improving the high availability of server equipment or software have the following defects:

1、双机冷备份技术和双机热备份技术大多是针对某一确定服务进程进行设计，即为提高某一服务的高可用性而进行的专门设计，不支持其它应用程序的容错计算，因此用户自己开发的应用软件的用户业务必须通过修改应用程序，而调用高可用软件的专用API，来实现用户业务的容错处理，使得用户自己开发的应用软件要通过商用高可用软件实现高可用则比较困难。1. Most of the dual-machine cold backup technology and dual-machine hot backup technology are designed for a certain service process, that is, specially designed to improve the high availability of a certain service, and do not support fault-tolerant calculations of other applications. Therefore, users The user business of self-developed application software must modify the application program and call the dedicated API of high-availability software to achieve fault-tolerant processing of user business, making it difficult for user-developed application software to achieve high availability through commercial high-availability software .

2、虚拟机高可用容错技术的系统资源开销较大。2. The system resource overhead of high-availability and fault-tolerant technology for virtual machines is relatively large.

发明内容Contents of the invention

有鉴于此，有必要提供一种系统资源开销小，支持各种应用程序的容错计算的基于健康管理的高可用系统实现方法及系统。In view of this, it is necessary to provide a high-availability system implementation method and system based on health management that has low system resource overhead and supports fault-tolerant computing of various applications.

本发明解决其技术问题所采用的技术方案是：构造一种基于健康管理的高可用系统实现方法，所述基于健康管理的高可用系统实现方法包括如下步骤：The technical solution adopted by the present invention to solve the technical problem is: to construct a method for implementing a high-availability system based on health management, and the method for implementing a high-availability system based on health management includes the following steps:

S1、设置至少两个控制节点和至少两个计算节点；将其中一个控制节点设置为主控制节点，将其他的控制节点设置为备控制节点并配置备控制节点的优先级顺序；设置主控制节点的优先级高于任何一备控制节点；S1. Set at least two control nodes and at least two computing nodes; set one of the control nodes as the master control node, set the other control nodes as standby control nodes and configure the priority order of the standby control nodes; set the master control node priority is higher than that of any standby control node;

S2、各控制节点分别重复采集并存储所有的计算节点的健康状态数据；S2. Each control node repeatedly collects and stores the health status data of all computing nodes;

S3、判断当前控制节点是否为主控制节点，如果当前控制节点是主控制节点，则跳转到步骤S5；如果当前控制节点是备控制节点，则跳转到步骤S4；S3. Judging whether the current control node is the main control node, if the current control node is the main control node, then jump to step S5; if the current control node is the standby control node, then jump to step S4;

S4、优先级最高的备控制节点实时通过心跳探测主控制节点是否出现故障；优先级低的备控制节点实时通过心跳探测比其优先级高一级的备控制节点是否出现故障；如果控制节点出现故障，则修改优先级顺序；S4. The standby control node with the highest priority detects whether the main control node fails through heartbeat in real time; the standby control node with low priority detects through heartbeat in real time whether the standby control node with a higher If there is a fault, modify the priority order;

S5、主控制节点分析存储在主控制节点的存储器上存储的健康状态数据，根据分析结果对计算节点进行管理。S5. The master control node analyzes the health state data stored in the memory of the master control node, and manages the computing nodes according to the analysis result.

本发明还提供一种基于健康管理的高可用系统实现系统，其特征在于，所述基于健康管理的高可用系统实现系统包括如下模块：The present invention also provides a system for implementing a high-availability system based on health management, wherein the system for implementing a high-availability system based on health management includes the following modules:

节点配置模块，用于设置至少两个控制节点和至少两个计算节点；将其中一个控制节点设置为主控制节点，将其他的控制节点设置为备控制节点并配置备控制节点的优先级顺序；设置主控制节点的优先级高于任何一备控制节点；The node configuration module is used to set at least two control nodes and at least two computing nodes; set one of the control nodes as the main control node, set the other control nodes as standby control nodes and configure the priority order of the standby control nodes; Set the priority of the master control node to be higher than that of any standby control node;

数据采集模块，用于通过各控制节点分别重复采集并存储所有的计算节点的健康状态数据；The data collection module is used to repeatedly collect and store the health status data of all computing nodes through each control node;

节点判断模块，用于判断当前控制节点是否为主控制节点，如果当前控制节点是主控制节点，则跳转到数据分析模块；如果当前控制节点是备控制节点，则跳转到故障判断模块；The node judgment module is used to judge whether the current control node is the main control node, if the current control node is the main control node, then jump to the data analysis module; if the current control node is the backup control node, then jump to the fault judgment module;

故障判断模块，用于通过优先级最高的备控制节点实时通过心跳探测主控制节点是否出现故障；通过优先级低的备控制节点实时通过心跳探测比其优先级高一级的备控制节点是否出现故障；当控制节点出现故障时，修改优先级顺序；The fault judgment module is used to detect whether the main control node fails in real time through the heartbeat of the standby control node with the highest priority; through the heartbeat of the standby control node with low priority to detect whether the standby control node with a higher priority exists in real time Fault; when the control node fails, modify the priority order;

数据分析模块，用于通过主控制节点分析存储在主控制节点的存储器上存储的健康状态数据，根据分析结果对计算节点进行管理。The data analysis module is configured to analyze the health state data stored in the memory of the main control node through the main control node, and manage the computing nodes according to the analysis results.

本发明提供的基于健康管理的高可用系统实现方法及系统，通过灵活配置控制节点的数量以及优先级，配置非常灵活。并通过主控制节点分析存储在主控制节点的存储器上存储的健康状态数据，根据分析结果对计算节点进行管理，实现了提前预知计算节点的趋势性损耗故障。健康状态数据通过计算节点的旁路获取，不占用计算节点的CPU、内存、网络等资源，相对于双机冷备份技术、双机热备份技术、虚拟机高可用容错技术，节省了系统资源开销。The method and system for realizing a high-availability system based on health management provided by the present invention can be configured very flexibly by flexibly configuring the number and priority of control nodes. In addition, the main control node analyzes the health status data stored in the memory of the main control node, manages the computing nodes according to the analysis results, and realizes the prediction of the trending loss failure of the computing nodes in advance. The health status data is obtained through the bypass of the computing node, which does not occupy the CPU, memory, network and other resources of the computing node. Compared with the dual-machine cold backup technology, dual-machine hot backup technology, and virtual machine high-availability fault-tolerant technology, it saves system resource overhead .

附图说明Description of drawings

图1是本发明一较佳实施例的基于健康管理的高可用系统实现方法流程图；Fig. 1 is a flow chart of a method for implementing a highly available system based on health management in a preferred embodiment of the present invention;

图2是图1中步骤S2的子流程图；Fig. 2 is the subflow chart of step S2 among Fig. 1;

图3是图1中步骤S4的子流程图；Fig. 3 is the subflow chart of step S4 among Fig. 1;

图4是图1中步骤S5的子流程图；Fig. 4 is the subflow chart of step S5 among Fig. 1;

图5是图4中步骤S58的子流程图；Fig. 5 is the subflow chart of step S58 among Fig. 4;

图6是本发明实施例中高可用系统的逻辑关系图；Fig. 6 is a logic diagram of the high availability system in the embodiment of the present invention;

图7是本发明实施例中主控制节点和备控制节点的工作流程图；Fig. 7 is a working flow chart of the main control node and the standby control node in the embodiment of the present invention;

图8是本发明实施例中控制节点接管顺序示意图；FIG. 8 is a schematic diagram of a control node takeover sequence in an embodiment of the present invention;

图9是本发明实施例中健康状态数据采集分析处理工作流程图；Fig. 9 is a work flow chart of health state data collection, analysis and processing in an embodiment of the present invention;

图10是本发明实施例中健康状态数据历史值矩阵示意图。Fig. 10 is a schematic diagram of a historical value matrix of health state data in an embodiment of the present invention.

图11是本发明一较佳实施例的基于健康管理的高可用系统实现系统的结构框图；Fig. 11 is a structural block diagram of a highly available system implementation system based on health management in a preferred embodiment of the present invention;

图12是图11中数据采集模块的子结构框图；Fig. 12 is a block diagram of the substructure of the data acquisition module in Fig. 11;

图13是图11中故障判断模块的子结构框图；Fig. 13 is a block diagram of the substructure of the failure judgment module in Fig. 11;

图14是图11中数据分析模块的子结构框图；Fig. 14 is a substructure block diagram of the data analysis module in Fig. 11;

图15是图14中计算节点管理单元的子结构框图。FIG. 15 is a substructure block diagram of the computing node management unit in FIG. 14 .

具体实施方式detailed description

如图1所示，本发明实施例提供一种基于健康管理的高可用系统实现方法，所述基于健康管理的高可用系统实现方法包括如下步骤：As shown in Figure 1, an embodiment of the present invention provides a method for implementing a high-availability system based on health management, and the method for implementing a high-availability system based on health management includes the following steps:

S1、设置至少两个控制节点和至少两个计算节点。将其中一个控制节点设置为主控制节点，将其他的控制节点设置为备控制节点并配置备控制节点的优先级顺序。设置主控制节点的优先级高于任何一备控制节点。S1. Set at least two control nodes and at least two computing nodes. Set one of the control nodes as the master control node, set the other control nodes as standby control nodes and configure the priority order of the standby control nodes. Set the priority of the master control node to be higher than that of any standby control node.

本发明实施可根据高可用系统的任务重要性及高可用需求，添加或删除控制节点的个数，优选控制节点的个数不超过计算节点的个数，两个或三个控制节点即可。可根据高可用系统的任务处理规模，添加或删除计算节点的个数，建议保留至少两个计算节点。In the implementation of the present invention, the number of control nodes can be added or deleted according to the task importance and high availability requirements of the high-availability system. Preferably, the number of control nodes does not exceed the number of computing nodes, and two or three control nodes are sufficient. The number of computing nodes can be added or deleted according to the task processing scale of the high-availability system. It is recommended to reserve at least two computing nodes.

任一计算节点之外其它的所有计算节点都可配置成该计算节点的备计算节点，因此大幅度提高了计算节点的冗余度，提高了计算节点的高可用性。All computing nodes other than any computing node can be configured as standby computing nodes of the computing node, thus greatly improving the redundancy of computing nodes and improving the high availability of computing nodes.

S2、各控制节点分别重复采集并存储所有的计算节点的健康状态数据。S2. Each control node repeatedly collects and stores health status data of all computing nodes.

健康状态数据可以包括计算节点的温度、风扇、电压、电流等健康状态数据类型。Health status data may include health status data types such as temperature, fan, voltage, and current of computing nodes.

本发明实施例中的控制节点和计算节点可以是网络系统中的服务器或者计算设备。The control node and computing node in the embodiment of the present invention may be a server or a computing device in a network system.

本发明实施例中的健康状态数据可以通过计算节点的前端模块(BMC模块)的旁路获取，不占用计算节点的CPU、内存、网络等资源，因此相比双机热备份、双机冷备份技术、虚拟机容错技术等实现高可用方法。本发明实施例对计算节点本身的性能影响更小。The health status data in the embodiment of the present invention can be obtained through the bypass of the front-end module (BMC module) of the computing node, and does not occupy resources such as the CPU, memory, and network of the computing node. Therefore, compared with dual-machine hot backup and dual-machine cold backup Technology, virtual machine fault tolerance technology, etc. to achieve high availability. The embodiment of the present invention has less impact on the performance of the computing node itself.

可选地，如图2所示，所述步骤S2包括以下子步骤：Optionally, as shown in Figure 2, the step S2 includes the following sub-steps:

S21、各控制节点分别采集一次所有的计算节点的健康状态数据，并将采集的所述健康状态数据保存到各控制节点各自的存储器上；S21. Each control node collects the health state data of all computing nodes once, and saves the collected health state data in the memory of each control node;

S22、判断采集的次数是否小于预定次数p，如果采集的次数小于预定次数p，则等待第一预设时间T1后跳转到步骤S21；如果采集的次数大于预定次数p，则删除各控制节点存储器上多余的历史健康状态数据，保留最近的p次采集到的健康状态数据，则跳转到步骤S23；如果采集的次数等于预定次数p，则跳转到步骤S23；S22. Determine whether the number of collections is less than the predetermined number p, if the number of collections is less than the predetermined number p, then wait for the first preset time T1 and then jump to step S21; if the number of collections is greater than the predetermined number p, delete each control node For redundant historical health status data on the memory, keep the latest p health status data collected, then jump to step S23; if the number of times collected is equal to the predetermined number p, then jump to step S23;

S23、将各控制节点p次采集到的n个计算节点的健康状态数据设置成一个p行n列的健康状态数据历史值矩阵。S23. Set the health status data of n computing nodes collected by each control node p times into a matrix of historical health status data with p rows and n columns.

所述步骤S23中的健康状态数据历史值矩阵可以参见图10。Refer to FIG. 10 for the historical value matrix of the health state data in the step S23.

S3、判断当前控制节点是否为主控制节点，如果当前控制节点是主控制节点，则跳转到步骤S5。如果当前控制节点是备控制节点，则跳转到步骤S4。S3. Judging whether the current control node is the master control node, if the current control node is the master control node, jump to step S5. If the current control node is the standby control node, go to step S4.

S4、优先级最高的备控制节点实时通过心跳探测主控制节点是否出现故障。优先级低的备控制节点实时通过心跳探测比其优先级高一级的备控制节点是否出现故障。如果控制节点出现故障，则修改优先级顺序。S4. The standby control node with the highest priority detects whether the main control node fails in real time through the heartbeat. The standby control node with a lower priority detects in real time whether the standby control node with a higher priority is faulty through heartbeat. If the control node fails, the priority order is modified.

可选地，如图3所示，所述步骤S4包括以下子步骤：Optionally, as shown in Figure 3, the step S4 includes the following sub-steps:

S41、优先级最高的备控制节点实时通过心跳探测主控制节点是否出现故障；优先级低的备控制节点实时通过心跳探测比其优先级高一级的备控制节点是否出现故障。S41. The standby control node with the highest priority detects in real time whether the main control node fails through heartbeat; the standby control node with low priority detects in real time whether the standby control node with a higher priority than the standby control node fails through heartbeat.

S42、如果探测到主控制节点出现故障，则优先级最高的备控制节点配置为主控制节点，将除优先级最高的备控制节点以外的其他备控制节点的优先级各自提高一级。S42. If it is detected that the main control node fails, the standby control node with the highest priority is configured as the main control node, and the priorities of the other standby control nodes except the standby control node with the highest priority are respectively increased by one level.

S43、如果探测到备控制节点出现故障，将优先级低于该出现故障的备控制节点的所有备控制节点的优先级各自提高一级。S43. If it is detected that the standby control node fails, the priorities of all the standby control nodes whose priorities are lower than the failed standby control node are respectively increased by one level.

本实施例可以灵活更改控制节点的优先级，并且在主控制节点出现故障时，能够及时将优先级最高的备控制节点设置为主控制节点。实现了控制节点出现故障时平滑过渡。This embodiment can flexibly change the priority of the control node, and when the main control node fails, the standby control node with the highest priority can be set as the main control node in time. A smooth transition is realized when the control node fails.

可选地，如图4所示，所述步骤S5包括以下子步骤：Optionally, as shown in Figure 4, the step S5 includes the following sub-steps:

S51、主控制节点逐列分析健康状态数据历史值矩阵，每列健康状态数据对应一个计算节点。S51. The main control node analyzes the historical value matrix of the health status data column by column, and each column of health status data corresponds to a computing node.

S52、判断所有的计算节点是否都分析了一遍，如果所有的计算节点都分析了一遍，则跳转到步骤S57。如果存在计算节点没有分析，则跳转到步骤S53。S52. Determine whether all computing nodes have been analyzed once. If all computing nodes have been analyzed once, skip to step S57. If there are computing nodes that have not been analyzed, go to step S53.

S53、分别统计当前计算节点超过健康状态参考范围以及低于健康状态参考范围的采集的次数；S53, respectively counting the number of acquisitions of the current computing node exceeding the health state reference range and below the health state reference range;

S54、如果超过健康状态参考范围的采集的次数小于k1，并且低于健康状态参考范围的采集的次数小于k2，则判定该计算节点为健康状态，并跳转到步骤S55。如果超过健康状态参考范围的采集的次数大于或等于k1，或者低于健康状态参考范围的采集的次数大于或等于k2，则判定该计算节点为非健康状态，并跳转到步骤S56。S54. If the number of acquisitions exceeding the health state reference range is less than k1, and the number of acquisitions below the health state reference range is less than k2, determine that the computing node is in a healthy state, and jump to step S55. If the number of acquisitions exceeding the reference range of the health state is greater than or equal to k1, or the number of acquisitions below the reference range of the health state is greater than or equal to k2, it is determined that the computing node is in an unhealthy state and jumps to step S56.

k1/p表示对超过健康状态参考范围的容忍度(越大越保守，越小越激进)，K2/p表示对低于健康状态参考范围的容忍度(越大越保守，越小越激进)。容忍度也可根据高可用系统的要求进行配置，灵活配置。k1/p represents the tolerance for exceeding the reference range of the health state (the larger is more conservative, the smaller is more aggressive), and K2/p represents the tolerance for being below the reference range of the health state (the larger is more conservative, the smaller is more aggressive). The tolerance can also be configured according to the requirements of the high-availability system, and it can be configured flexibly.

S55、将该计算节点的非健康标志值设置为0，并跳转到步骤S51。S55. Set the unhealthy flag value of the computing node to 0, and jump to step S51.

S56、获取超过健康状态参考范围的采集的次数和低于健康状态参考范围的采集的次数的最大值，将该最大值设置为该计算节点的非健康状态值，并跳转到步骤S51。S56. Obtain the maximum value of the number of acquisitions exceeding the health state reference range and the number of acquisitions below the health state reference range, set the maximum value as the unhealthy state value of the computing node, and jump to step S51.

S57、分析所有计算节点的非健康标志值，判断是否存在非0值的计算节点；如果不存在非0值的计算节点，则确定所有的计算节点都是健康的，如果存在非0值的计算节点，则确定存在非健康的计算节点。S57. Analyze the non-health flag values of all computing nodes, and determine whether there are computing nodes with non-zero values; if there are no non-zero computing nodes, determine that all computing nodes are healthy, and if there are non-zero computing nodes node, it is determined that there is an unhealthy computing node.

S58、按照设置策略对计算节点进行管理，等待第一预设时间T1后跳转到步骤S21。S58. Manage the computing nodes according to the set policy, wait for the first preset time T1, and jump to step S21.

可选地，如图5所示，所述步骤S58包括以下子步骤：Optionally, as shown in Figure 5, the step S58 includes the following sub-steps:

S581、设置第一策略、第二策略、第三策略，第一策略包括提示告警、自动迁移任务出去、告警同时自动迁移任务出去三个选项；第二策略包括提示告警、自动关机、告警同时自动关机等三个选项；第三策略包括提示告警、自动迁移任务回来、告警同时自动迁移任务回来三个选项。S581. Set the first strategy, the second strategy, and the third strategy. The first strategy includes three options: prompt alarm, automatic migration task out, and alarm simultaneous automatic migration task out; the second strategy includes prompt alarm, automatic shutdown, and alarm simultaneous automatic Shutdown and other three options; the third strategy includes three options: prompt alarm, automatic migration task return, alarm and automatic migration task return.

S582、判断非健康的计算节点是否存在处理任务，如果非健康的计算节点存在处理任务，则查询第一策略并按照第一策略进行处理。如果非健康的计算节点不存在处理任务，则查询第二策略并按照第二策略进行处理。S582. Determine whether an unhealthy computing node has a processing task, and if the unhealthy computing node has a processing task, query the first strategy and perform processing according to the first strategy. If there is no processing task in the unhealthy computing node, query the second policy and process according to the second policy.

S583、判断健康的计算节点是否存在处理任务，如果健康的计算节点存在处理任务，则对该健康的计算节点不做操作；如果健康的计算节点不存在处理任务，则确定该健康的计算节点是从非健康状态恢复成健康状态的，并记录下当前时间戳，如果当前时间戳减去上次时间戳的差值小于第二预设时间T2，或者没有上次时间戳，则不作任何操作，如果差值大于或等于第二预设时间T2则按照第三策略进行处理。S583. Determine whether a healthy computing node has a processing task. If the healthy computing node has a processing task, no operation is performed on the healthy computing node; if the healthy computing node does not have a processing task, determine that the healthy computing node is Restore from an unhealthy state to a healthy state, and record the current time stamp. If the difference between the current time stamp and the last time stamp is less than the second preset time T2, or if there is no last time stamp, no action will be taken. If the difference is greater than or equal to the second preset time T2, the processing is performed according to the third strategy.

S584、等待第一预设时间T1后跳转到步骤S21。S584. Jump to step S21 after waiting for the first preset time T1.

本发明实施例通过分析健康状态数据触发计算节点的任务迁移，相比宕机时自动触发任务迁移，或手动进行任务迁移，健康状态数据检测更加科学和灵活。并且在计算节点出现趋势性损耗故障前，在健康状态数据上都有提前反应(亚健康)，因此提高了系统的故障提前感知能力。相比其他技术，本发明实施例更加节省高可用系统本身的资源开销。The embodiment of the present invention triggers task migration of computing nodes by analyzing health status data. Compared with automatically triggering task migration when downtime or manually performing task migration, health status data detection is more scientific and flexible. And before the computing node has a trending loss failure, there is an early response (sub-health) on the health status data, thus improving the system's ability to detect faults in advance. Compared with other technologies, the embodiment of the present invention saves resource overhead of the high availability system itself.

以下结合图6至图10对本发明实施例的原理做更进一步的说明。The principle of the embodiment of the present invention will be further described below with reference to FIG. 6 to FIG. 10 .

图6是本发明实施例中高可用系统的逻辑关系图。可以看出每个控制节点都对应所有的计算节点。低优先级的控制节点通过心跳检测高一级优先权的控制节点是否出现故障。Fig. 6 is a logical relationship diagram of the high availability system in the embodiment of the present invention. It can be seen that each control node corresponds to all computing nodes. The control node with low priority detects whether the control node with a higher priority is faulty through heartbeat.

结合附图7和图9对本发明的本实施方式作详细说明。This embodiment of the present invention will be described in detail with reference to accompanying drawings 7 and 9 .

结合附图7对本发明的主控制节点和备控制节点工作流程进行说明，详细流程如下所示。The working flow of the master control node and the standby control node of the present invention will be described with reference to FIG. 7 , and the detailed flow is as follows.

步骤1.1：控制节点收集所有计算节点的健康状态数据，然后进入步骤1.2。Step 1.1: The control node collects the health status data of all computing nodes, and then enters step 1.2.

步骤1.2：读取配置文件，判断当前控制节点是否是主控制节点。如果不是主控制节点(即备控制节点)，进入步骤1.3；如果是主控制节点，进入步骤1.5。Step 1.2: Read the configuration file to determine whether the current control node is the master control node. If it is not the main control node (that is, the standby control node), go to step 1.3; if it is the main control node, go to step 1.5.

步骤1.3：通过心跳检测网络探测主控制节点和优先级更高的备控制节点是否故障，如果有控制节点故障进入步骤1.4，如果没有控制节点故障则等待时间T1，进入步骤1.1。Step 1.3: Detect whether the main control node and the standby control node with higher priority are faulty through the heartbeat detection network. If there is a control node fault, go to step 1.4. If there is no control node fault, wait for time T1 and go to step 1.1.

步骤1.4：修改控制节点的优先顺序，可参考附图8，图8是本发明实施例中控制节点接管顺序示意图。如果主控制节点故障，优先级最高的第一备控制节点成为主控制节点；如果备控制节点出现故障，则比该备控制节点更低优先级的备控制节点自动提高一级优先级。修改完控制节点的优先顺序后，进入步骤1.2。Step 1.4: modify the priority order of the control nodes, please refer to FIG. 8 , which is a schematic diagram of the control node takeover sequence in the embodiment of the present invention. If the master control node fails, the first standby control node with the highest priority becomes the master control node; if the standby control node fails, the standby control node with a lower priority than the standby control node automatically raises its priority by one level. After modifying the priority order of the control nodes, go to step 1.2.

步骤1.5：对健康状态数据进行分析处理，可参考附图9。Step 1.5: Analyze and process the health status data, please refer to Figure 9.

结合附图9对本发明的健康状态数据分析处理的工作流程进行说明，详细流程如下所示。该健康状态数据分析处理工作流程适用于温度、风扇、电压、电流等健康状态数据类型。The working flow of the health status data analysis and processing of the present invention will be described in conjunction with FIG. 9 , and the detailed flow is as follows. This health status data analysis and processing workflow is applicable to health status data types such as temperature, fan, voltage, and current.

步骤2.1：采集当前所有计算节点的健康状态数据，然后进入步骤2.2。Step 2.1: Collect the health status data of all current computing nodes, and then go to step 2.2.

步骤2.2：如果采集的次数小于p，则表示采集的历史数据不够，等待时间T后进入步骤2.1。如果采集的时间点个数大于或等于p次，则进入步骤2.3。Step 2.2: If the number of collection times is less than p, it means that the collected historical data is not enough, and enter step 2.1 after waiting time T. If the number of time points collected is greater than or equal to p times, proceed to step 2.3.

步骤2.3：删除多余的历史数据，只保留最近p次采集的健康状态数据，此时共p个时间点下n个计算节点的数据组成一个p行n列的健康状态数据历史值矩阵，可参考附图10。矩阵的行对应每个时间点，该矩阵的列对应每个计算节点，进入步骤2.4。Step 2.3: Delete redundant historical data, and only keep the health status data collected for the last p times. At this time, the data of n computing nodes at p time points form a historical value matrix of health status data with p rows and n columns, which can be referred to Figure 10. The row of the matrix corresponds to each time point, and the column of the matrix corresponds to each computing node, go to step 2.4.

步骤2.4：对健康状态数据历史值矩阵进行逐列分析，每列数据对应一个计算节点，进入步骤2.5。Step 2.4: Analyze the historical value matrix of the health status data column by column, each column of data corresponds to a computing node, and proceed to step 2.5.

步骤2.5：还有计算节点没有分析完毕，进入步骤2.6；如果所有计算节点都分析了一遍，则当前一次采集的健康状态数据分析已经完成，进入步骤2.10。Step 2.5: If there are still computing nodes that have not been analyzed, go to step 2.6; if all computing nodes have been analyzed, the health status data analysis of the current collection has been completed, go to step 2.10.

步骤2.6：统计当前列超过健康状态参考范围的采集次数，统计当前列低于健康状态参考范围的采集次数，然后进入步骤2.7。Step 2.6: Count the collection times of the current column exceeding the reference range of the health state, count the collection times of the current column below the reference range of the health state, and then proceed to step 2.7.

步骤2.7：如果超过健康状态参考范围的时间点次数小于k1，且低于健康状态参考范围的时间点次数小于k2，则仍然判定该计算节点为健康状态，进入步骤2.8；如果超过健康状态参考范围的时间点次数大于等于k1，或者低于健康状态参考范围的时间点次数大于等于k2，则表示该计算节点多次超过或低于健康状态参考范围，且趋于稳定，判定该计算节点为非健康状态，进入步骤2.9。Step 2.7: If the number of time points exceeding the health state reference range is less than k1, and the number of time points below the health state reference range is less than k2, then it is still determined that the computing node is in a healthy state, and enter step 2.8; if it exceeds the health state reference range The number of time points is greater than or equal to k1, or the number of time points below the health status reference range is greater than or equal to k2, it means that the computing node has exceeded or lower than the health status reference range for many times, and tends to be stable, and it is determined that the computing node is not Health status, go to step 2.9.

步骤2.8：置当前计算节点的非健康标志值为0，然后进入步骤2.4。Step 2.8: Set the unhealthy flag value of the current computing node to 0, and then go to step 2.4.

步骤2.9：取超过健康状态参考范围的采集次数和低于健康状态参考范围的时间点次数的最大值，作为该计算节点的非健康标志值，然后进入步骤2.4。Step 2.9: Take the maximum value of the number of collections exceeding the reference range of the health state and the number of time points below the reference range of the health state as the unhealthy flag value of the computing node, and then proceed to step 2.4.

步骤2.10：分析所有计算节点的非健康标志值，如果不存在非0值的计算节点，则表示所有计算节点都是健康的，如果存在非0值的计算节点，则表示存在非健康计算节点，进入步骤2.11。Step 2.10: Analyze the unhealthy flag values of all computing nodes. If there are no computing nodes with non-zero values, it means that all computing nodes are healthy. If there are computing nodes with non-zero values, it means that there are unhealthy computing nodes. Go to step 2.11.

步骤2.11：第一策略设置包括提示告警、自动迁移任务出去、告警同时自动迁移任务出去等三个选项；第二策略设置包括提示告警、自动关机、告警同时自动关机等三个选项；第三策略包括提示告警、自动迁移任务回来、告警同时自动迁移任务回来等三个选项。根据设置策略的要求对计算节点进行相应的操作：Step 2.11: The first policy setting includes three options: prompt alarm, automatic migration task out, alarm and automatic migration task out; the second policy setting includes three options: prompt alarm, automatic shutdown, and alarm simultaneous automatic shutdown; the third policy There are three options including alerting, automatic migration task return, and automatic migration task return at the same time as the alarm. Perform corresponding operations on computing nodes according to the requirements of setting policies:

查看非健康计算节点是否存在处理任务，如果存在处理任务则查询第一策略，并按照第一策略的操作进行处理；如果不存在处理任务则查询第二策略，并按照第二策略的操作进行处理；Check whether there are processing tasks in unhealthy computing nodes. If there are processing tasks, query the first policy and process them according to the operation of the first policy; if there are no processing tasks, query the second policy and process them according to the operations of the second policy ;

查看健康计算节点是否存在处理任务，如果存在处理任务则不做操作；如果不存在处理任务，则表示该计算节点是从非健康状态恢复成健康状态的，并记录下当前时间戳，如果当前时间戳减去上次时间戳的差值小于第二预设时间T2，或者没有上次时间戳，则不作任何操作，如果差值大于或等于第二预设时间T2则按照第三策略进行处理Check whether there are processing tasks in the healthy computing node. If there are processing tasks, no operation is performed; if there are no processing tasks, it means that the computing node has recovered from an unhealthy state to a healthy state, and record the current timestamp. If the current time If the difference between the stamp and the last time stamp is less than the second preset time T2, or if there is no last time stamp, no operation will be done, and if the difference is greater than or equal to the second preset time T2, then the third strategy will be processed

等待时间T1返回步骤2.1。Waiting time T1 returns to step 2.1.

健康状态数据分析处理中牵涉的参数定义如下：The parameters involved in the analysis and processing of health status data are defined as follows:

n：表示参与的计算节点个数；n: indicates the number of participating computing nodes;

p：表示采集健康状态数据的次数；p: indicates the number of times to collect health status data;

k1：表示能够容忍超过健康状态参考范围的时间点次数，该值不大于p，k1/p表示对超过健康状态参考范围的容忍度(越大越保守，越小越激进)；k1: Indicates the number of time points that can tolerate exceeding the reference range of the health state, the value is not greater than p, k1/p represents the tolerance for exceeding the reference range of the health state (the larger the more conservative, the smaller the more aggressive);

k2：表示能够容忍低于健康状态参考范围的时间点次数，该值不大于p，k2/p表示对低于健康状态参考范围的容忍度(越大越保守，越小越激进)；k2: Indicates the number of time points that can be tolerated below the reference range of the health state, the value is not greater than p, and k2/p represents the tolerance for below the reference range of the health state (the larger the more conservative, the smaller the more aggressive);

T1：表示每次健康状态数据分析处理的时间周期，该值不小于健康状态数据采集的时间周期，否则会出现收集重复数据的情况。T1: Indicates the time period for each health status data analysis and processing. This value should not be less than the time period for health status data collection. Otherwise, duplicate data will be collected.

如图11所示，本发明实施例还提供一种基于健康管理的高可用系统实现系统，所述基于健康管理的高可用系统实现系统包括如下模块：As shown in Figure 11, the embodiment of the present invention also provides a system for implementing a high-availability system based on health management, and the system for implementing a high-availability system based on health management includes the following modules:

节点配置模块10，用于设置至少两个控制节点和至少两个计算节点。并用于将其中一个控制节点设置为主控制节点，将其他的控制节点设置为备控制节点并配置备控制节点的优先级顺序。并用于设置主控制节点的优先级高于任何一备控制节点。The node configuration module 10 is configured to configure at least two control nodes and at least two computing nodes. It is also used to set one of the control nodes as the master control node, set the other control nodes as standby control nodes and configure the priority order of the standby control nodes. And it is used to set the priority of the master control node higher than any standby control node.

数据采集模块20，用于通过各控制节点分别重复采集并存储所有的计算节点的健康状态数据。The data collection module 20 is configured to repeatedly collect and store health status data of all computing nodes through each control node.

优选地，如图12所示，所述数据采集模块20包括以下单元：Preferably, as shown in Figure 12, the data acquisition module 20 includes the following units:

数据采集单元21，用于通过各控制节点分别每隔一固定时间T1采集一次所有的计算节点的健康状态数据，并将采集的所述健康状态数据保存到各控制节点各自的存储器上。The data collection unit 21 is configured to collect the health status data of all the computing nodes at intervals of a fixed time T1 by each control node, and save the collected health status data to the memory of each control node.

采集次数判断单元22，用于判断采集的次数是否小于预定次数p，当果采集的次数小于预定次数p时，启动数据采集单元21的功能。并用于当采集的次数大于预定次数p时，删除各控制节点存储器上多余的历史健康状态数据，保留最近的p次采集到的健康状态数据。并用于当采集的次数等于预定次数p时，则直接保留最近的p次采集到的健康状态数据；。The number of collection times judging unit 22 is used to judge whether the number of times of collection is less than the predetermined number p, and when the number of collections is less than the predetermined number p, the function of the data collection unit 21 is activated. It is also used to delete redundant historical health state data on the memory of each control node when the number of times of collection is greater than the predetermined number p, and keep the health state data collected for the latest p times. And it is used to directly retain the health status data collected for the latest p times when the number of times of collection is equal to the predetermined number of times p;

矩阵配置单元23，用于将各控制节点p次采集到的n个计算节点的健康状态数据设置成一个p行n列的健康状态数据历史值矩阵。The matrix configuration unit 23 is configured to set the health status data of n computing nodes collected by each control node p times into a matrix of historical health status data with p rows and n columns.

节点判断模块30，用于判断当前控制节点是否为主控制节点，如果当前控制节点是主控制节点，则跳转到数据分析模块50。如果当前控制节点是备控制节点，则跳转到故障判断模块40。The node judging module 30 is used to judge whether the current control node is the main control node, and if the current control node is the main control node, jump to the data analysis module 50 . If the current control node is the standby control node, then jump to the fault judging module 40 .

故障判断模块40，用于通过优先级最高的备控制节点实时通过心跳探测主控制节点是否出现故障。并用于通过优先级低的备控制节点实时通过心跳探测比其优先级高一级的备控制节点是否出现故障。并用于当控制节点出现故障时，修改优先级顺序。The fault judging module 40 is configured to use the standby control node with the highest priority to detect whether the master control node fails in real time through heartbeat. It is also used to use the standby control node with a lower priority to detect whether the standby control node with a higher priority than the standby control node fails in real time through heartbeat. It is also used to modify the priority order when the control node fails.

优选地，如图13所示，所述故障判断模块40包括以下单元：Preferably, as shown in Figure 13, the fault judgment module 40 includes the following units:

故障判断单元41，用于通过优先级最高的备控制节点实时通过心跳探测主控制节点是否出现故障；并用于通过优先级低的备控制节点实时通过心跳探测比其优先级高一级的备控制节点是否出现故障。The fault judging unit 41 is used to detect whether the main control node fails in real time through the heartbeat of the standby control node with the highest priority; Whether the node has failed.

第一调整单元42，用于当探测到主控制节点出现故障时，通过优先级最高的备控制节点配置为主控制节点，将除优先级最高的备控制节点以外的其他备控制节点的优先级各自提高一级。The first adjustment unit 42 is configured to configure the standby control node with the highest priority as the primary control node when a failure of the primary control node is detected, and set the priorities of other standby control nodes other than the standby control node with the highest priority Each raises a level.

第二调整单元43，用于当探测到备控制节点出现故障时，将优先级低于该出现故障的备控制节点的所有备控制节点的优先级各自提高一级。The second adjustment unit 43 is configured to increase the priorities of all the standby control nodes whose priorities are lower than the failed standby control node by one level when it is detected that the standby control node fails.

数据分析模块50，用于通过主控制节点分析存储在主控制节点的存储器上存储的健康状态数据，根据分析结果对计算节点进行管理。The data analysis module 50 is configured to analyze the health status data stored in the memory of the main control node through the main control node, and manage the computing nodes according to the analysis results.

优选地，如图14所示，所述数据分析模块50包括以下单元：Preferably, as shown in Figure 14, the data analysis module 50 includes the following units:

分析单元51，用于通过主控制节点逐列分析健康状态数据历史值矩阵，每列健康状态数据对应一个计算节点。The analysis unit 51 is configured to analyze the historical value matrix of the health status data column by column through the main control node, and each column of health status data corresponds to a computing node.

分析判断单元52，用于判断所有的计算节点是否都分析了一遍，当所有的计算节点都分析了一遍时，启动计算节点判断单元的功能。并用于当存在计算节点没有分析时，启动统计单元53的功能。The analyzing and judging unit 52 is configured to judge whether all computing nodes have been analyzed once, and when all computing nodes have been analyzed once, the function of the computing node judging unit is activated. It is also used to start the function of the statistical unit 53 when there are computing nodes that are not analyzed.

统计单元53，用于分别统计当前计算节点超过健康状态参考范围以及低于健康状态参考范围的采集的次数。The statistical unit 53 is configured to respectively count the number of acquisitions of the current computing node exceeding the health state reference range and below the health state reference range.

状态判断单元54，用于在超过健康状态参考范围的采集的次数小于k1，并且低于健康状态参考范围的采集的次数小于k2时，判定该计算节点为健康状态，并启动第一标志值设置单元55的功能。并用于当超过健康状态参考范围的采集的次数大于或等于k1，或者低于健康状态参考范围的采集的次数大于或等于k2时，则判定该计算节点为非健康状态，并启动第二标志值设置单元56的功能。A state judging unit 54, configured to determine that the computing node is in a healthy state when the number of collections exceeding the health state reference range is less than k1, and the number of collections below the health state reference range is less than k2, and start setting the first flag value Function of unit 55. And it is used to determine that the computing node is in an unhealthy state when the number of collections exceeding the reference range of the health state is greater than or equal to k1, or the number of collections below the reference range of the health state is greater than or equal to k2, and the second flag value is activated Sets the function of unit 56.

第一标志值设置单元55，用于将该计算节点的非健康标志值设置为0，并启动分析单元51的功能。The first flag value setting unit 55 is configured to set the unhealthy flag value of the computing node to 0, and start the function of the analysis unit 51 .

第二标志值设置单元56，用于获取超过健康状态参考范围的采集的次数和低于健康状态参考范围的采集的次数的最大值，将该最大值设置为该计算节点的非健康状态值，并启动分析单元51的功能。The second flag value setting unit 56 is configured to obtain the maximum value of the number of acquisitions exceeding the health state reference range and the number of acquisitions below the health state reference range, and set the maximum value as the unhealthy state value of the computing node, And start the function of the analyzing unit 51 .

计算节点判断单元57，用于分析所有计算节点的非健康标志值，判断是否存在非0值的计算节点。并用于当不存在非0值的计算节点时，确定所有的计算节点都是健康的，当存在非0值的计算节点，则确定存在非健康的计算节点。The computing node judging unit 57 is configured to analyze the unhealthy flag values of all computing nodes, and judge whether there are computing nodes with non-zero values. It is also used to determine that all computing nodes are healthy when there are no computing nodes with non-zero values, and to determine that there are unhealthy computing nodes when there are computing nodes with non-zero values.

计算节点管理单元58，用于按照设置策略对计算节点进行管理，等待第一预设时间T1后启动数据采集单元21的功能。The computing node management unit 58 is configured to manage the computing nodes according to the set policy, and start the function of the data collection unit 21 after waiting for a first preset time T1.

优选地，如图15所示，所述计算节点管理单元58包括以下子单元：Preferably, as shown in FIG. 15, the computing node management unit 58 includes the following subunits:

策略设置子单元581，用于设置第一策略、第二策略、第三策略，第一策略包括提示告警、自动迁移任务出去、告警同时自动迁移任务出去三个选项；第二策略包括提示告警、自动关机、告警同时自动关机等三个选项；第三策略包括提示告警、自动迁移任务回来、告警同时自动迁移任务回来三个选项。The strategy setting subunit 581 is used to set the first strategy, the second strategy, and the third strategy. The first strategy includes three options: prompt alarm, automatic migration task out, and alarm and automatic migration task out; the second strategy includes prompt alarm, There are three options: automatic shutdown, alarm and automatic shutdown; the third strategy includes three options: prompt alarm, automatic migration task return, alarm and automatic migration task return.

第一任务判断子单元582，用于判断非健康的计算节点是否存在处理任务，当非健康的计算节点存在处理任务时，查询第一策略并按照第一策略进行处理。并用于当非健康的计算节点不存在处理任务时，查询第二策略并按照第二策略进行处理。The first task judging subunit 582 is configured to judge whether there are processing tasks in the unhealthy computing nodes, and when there are processing tasks in the unhealthy computing nodes, query the first strategy and perform processing according to the first strategy. And it is used for querying the second strategy and performing processing according to the second strategy when there is no processing task in the unhealthy computing node.

第二任务判断子单元583，用于判断健康的计算节点是否存在处理任务，当健康的计算节点存在处理任务时，对该健康的计算节点不做操作。当健康的计算节点不存在处理任务时，确定该健康的计算节点是从非健康状态恢复成健康状态的，并对该健康的计算节点计时，计时不到第二预设时间T2时不作任何操作，计时到T2则按照第三策略进行处理。The second task judging subunit 583 is configured to judge whether a healthy computing node has a processing task, and when the healthy computing node has a processing task, no operation is performed on the healthy computing node. When there is no processing task in a healthy computing node, it is determined that the healthy computing node has recovered from an unhealthy state to a healthy state, and the healthy computing node is timed, and no operation is performed when the timing is less than the second preset time T2 , when the timing reaches T2, it will be processed according to the third strategy.

跳转子单元584，用于在等待第一预设时间T1后启动数据采集单元21的功能。The skip subunit 584 is configured to start the function of the data collection unit 21 after waiting for the first preset time T1.

本发明提供的基于健康管理的高可用系统实现系统，通过灵活配置控制节点的数量以及优先级，配置非常灵活，支持其它应用程序的容错计算。并通过主控制节点分析存储在主控制节点的存储器上存储的健康状态数据，根据分析结果对计算节点进行管理，实现了提前预知计算节点的趋势性损耗故障。健康状态数据通过计算节点的旁路获取，不占用计算节点的CPU、内存、网络等资源，相对于双机冷备份技术、双机热备份技术、虚拟机高可用容错技术，节省了系统资源开销。The high-availability system based on health management provided by the present invention implements the system, through flexible configuration of the number and priority of control nodes, the configuration is very flexible, and the fault-tolerant calculation of other application programs is supported. In addition, the main control node analyzes the health status data stored in the memory of the main control node, manages the computing nodes according to the analysis results, and realizes the prediction of the trending loss failure of the computing nodes in advance. The health status data is obtained through the bypass of the computing node, which does not occupy the CPU, memory, network and other resources of the computing node. Compared with the dual-machine cold backup technology, dual-machine hot backup technology, and virtual machine high-availability fault-tolerant technology, it saves system resource overhead .

以上装置实施例与方法实施例是一一对应的，装置实施例简略之处，参见方法实施例即可。The above device embodiments correspond to the method embodiments one by one, and for the briefness of the device embodiments, refer to the method embodiments.

本说明书中各个实施例采用递进的方式描述，每个实施例重点说明的都是与其他实施例的不同之处，各个实施例之间相同相似部分互相参见即可。Each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same and similar parts of each embodiment can be referred to each other.

专业人员还可以进一步意识到，结合本文中所公开的实施例描述的各示例的单元及算法步骤，能够以电子硬件、计算机软件或者二者的结合来实现，为了清楚地说明硬件和软件的可互换性，在上述说明中已经按照功能性一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行，取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能，但是这种实现不应超过本发明的范围。Professionals can further realize that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software or a combination of the two. In order to clearly illustrate the possible For interchangeability, in the above description, the composition and steps of each example have been generally described in terms of functionality. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not exceed the scope of the present invention.

结合本文中所公开的实施例描述的方法或算法的步骤可以直接用硬件、处理器执行的软件模块，或者二者的结合来实施。软件模块可以置于随机储存器、内存、只读存储器、电可编程ROM、电可檫除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其他形式的存储介质中。The steps of the methods or algorithms described in connection with the embodiments disclosed herein may be directly implemented by hardware, software modules executed by a processor, or a combination of both. Software modules can be placed in random access memory, internal memory, read-only memory, electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other form known in the technical field in the storage medium.

上面结合附图对本发明的实施例进行了描述，但是本发明并不局限于上述的具体实施方式，上述的具体实施方式仅仅是示意性的，而不是限制性的，本领域的普通技术人员在本发明的启示下，在不脱离本发明宗旨和权利要求所保护的范围情况下，还可做出很多形式，这些均属于本发明的保护之内。Embodiments of the present invention have been described above in conjunction with the accompanying drawings, but the present invention is not limited to the above-mentioned specific implementations, and the above-mentioned specific implementations are only illustrative, rather than restrictive. Those of ordinary skill in the art will Under the enlightenment of the present invention, many forms can also be made without departing from the gist of the present invention and the protection scope of the claims, and these all belong to the protection of the present invention.

Claims

1. A method for realizing a high-availability system based on health management, characterized in that, the method for realizing a high-availability system based on health management comprises the steps:

S1. Set at least two control nodes and at least two computing nodes; set one of the control nodes as the master control node, set the other control nodes as standby control nodes and configure the priority order of the standby control nodes; set the master control node priority is higher than that of any standby control node;

S2. Each control node repeatedly collects and stores the health status data of all computing nodes;

S3. Judging whether the current control node is the main control node, if the current control node is the main control node, then jump to step S5; if the current control node is the standby control node, then jump to step S4;

S4. The standby control node with the highest priority detects whether the main control node fails through heartbeat in real time; the standby control node with low priority detects through heartbeat in real time whether the standby control node with a higher If there is a fault, modify the priority order;

S5. The master control node analyzes the health status data stored in the memory of the master control node, and manages the computing nodes according to the analysis results;

Wherein, step S2 specifically includes the following sub-steps:

S21. Each control node collects the health state data of all computing nodes once, and saves the collected health state data in the memory of each control node;

S22. Determine whether the number of collections is less than the predetermined number p, if the number of collections is less than the predetermined number p, then wait for the first preset time T1 and then jump to step S21; if the number of collections is greater than the predetermined number p, delete each control node For redundant historical health status data on the memory, keep the latest p health status data collected, then jump to step S23; if the number of times collected is equal to the predetermined number p, then jump to step S23;

S23. Set the health status data of n computing nodes collected by each control node p times into a matrix of historical health status data with p rows and n columns.

2. The method for realizing a high-availability system based on health management according to claim 1, wherein said step S4 comprises the following sub-steps:

S41. The standby control node with the highest priority detects whether the main control node fails in real time through heartbeat; the standby control node with low priority detects in real time whether the standby control node with a higher priority than the standby control node fails through heartbeat;

S42. If it is detected that the master control node fails, the standby control node with the highest priority is configured as the master control node, and the priorities of the standby control nodes other than the standby control node with the highest priority are respectively increased by one level;

S43. If it is detected that the standby control node fails, the priorities of all the standby control nodes whose priorities are lower than the failed standby control node are respectively increased by one level.

3. The method for realizing a high-availability system based on health management according to claim 2, wherein said step S5 comprises the following sub-steps:

S51. The main control node analyzes the historical value matrix of the health status data column by column, and each column of health status data corresponds to a computing node;

S52. Determine whether all computing nodes have been analyzed once, if all computing nodes have been analyzed once, then jump to step S57; if there are computing nodes that have not been analyzed, then jump to step S53;

S53, respectively counting the number of acquisitions of the current computing node exceeding the health state reference range and below the health state reference range;

S54. If the number of acquisitions exceeding the health status reference range is less than k1, and the number of acquisitions lower than the health status reference range is less than k2, determine that the computing node is in a healthy state, and jump to step S55; if it exceeds the health status reference If the number of acquisitions in the range is greater than or equal to k1, or the number of acquisitions lower than the reference range of the healthy state is greater than or equal to k2, then it is determined that the computing node is in an unhealthy state and jumps to step S56;

S55. Set the unhealthy flag value of the computing node to 0, and jump to step S51;

S56. Obtain the maximum value of the number of acquisitions exceeding the health state reference range and the number of acquisitions below the health state reference range, set the maximum value as the unhealthy state value of the computing node, and jump to step S51;

S57. Analyze the non-health flag values of all computing nodes, and determine whether there are computing nodes with non-zero values; if there are no non-zero computing nodes, determine that all computing nodes are healthy, and if there are non-zero computing nodes node, it is determined that there is an unhealthy computing node;

S58. Manage the computing nodes according to the set policy, wait for the first preset time T1, and then jump to step S21.

4. The method for realizing a high-availability system based on health management according to claim 3, wherein the step S58 includes the following sub-steps:

S581. Set the first strategy, the second strategy, and the third strategy. The first strategy includes three options: prompt alarm, automatic migration task out, and alarm simultaneous automatic migration task out; the second strategy includes prompt alarm, automatic shutdown, and alarm simultaneous automatic Shutdown and other three options; the third strategy includes three options: prompt alarm, automatic migration task return, alarm and automatic migration task return;

S582. Determine whether there is a processing task in the unhealthy computing node, if there is a processing task in the unhealthy computing node, query the first strategy and perform processing according to the first strategy; if there is no processing task in the unhealthy computing node, query the first strategy The second strategy and handle it according to the second strategy;

S583. Determine whether a healthy computing node has a processing task. If the healthy computing node has a processing task, no operation is performed on the healthy computing node; if the healthy computing node does not have a processing task, determine that the healthy computing node is Restore from an unhealthy state to a healthy state, and record the current time stamp. If the difference between the current time stamp and the last time stamp is less than the second preset time T2, or if there is no last time stamp, no action will be taken. If the difference is greater than or equal to the second preset time T2, then proceed according to the third strategy;

S584. Jump to step S21 after waiting for the first preset time T1.

5. A high-availability system implementation system based on health management, characterized in that, the high-availability system implementation system based on health management includes the following modules:

The node configuration module is used to set at least two control nodes and at least two computing nodes; set one of the control nodes as the main control node, set the other control nodes as standby control nodes and configure the priority order of the standby control nodes; Set the priority of the master control node to be higher than that of any standby control node;

The data collection module is used to repeatedly collect and store the health status data of all computing nodes through each control node;

The node judgment module is used to judge whether the current control node is the main control node, if the current control node is the main control node, then jump to the data analysis module; if the current control node is the backup control node, then jump to the fault judgment module;

The fault judgment module is used to detect whether the main control node fails in real time through the heartbeat of the standby control node with the highest priority; through the heartbeat of the standby control node with low priority to detect whether the standby control node with a higher priority exists in real time Fault; when the control node fails, modify the priority order;

The data analysis module is used to analyze the health status data stored in the memory of the main control node through the main control node, and manage the computing nodes according to the analysis results;

Wherein said data acquisition module includes the following units:

The data collection unit is used to collect the health status data of all computing nodes once every fixed time T1 through each control node, and save the collected health status data to the respective memory of each control node;

The number of collection judgment unit is used to judge whether the number of times collected is less than the predetermined number p, and when the number of times collected is less than the predetermined number p, the function of the data acquisition unit is started; when the number of collections is greater than the predetermined number p, each control node memory is deleted For redundant historical health status data, keep the health status data collected for the latest p times; when the number of times collected is equal to the predetermined number p, directly keep the health status data collected for the latest p times;

The matrix configuration unit is configured to set the health state data of n computing nodes collected by each control node p times into a matrix of historical value of health state data with p rows and n columns.

6. The system for implementing a high-availability system based on health management according to claim 5, wherein the fault judgment module comprises the following units:

The fault judgment unit is used to detect whether the main control node is faulty through the heartbeat of the highest priority standby control node in real time; through the heartbeat of the low priority standby control node, it is used to detect whether the standby control node with a higher priority than the standby control node appears in real time Fault;

The first adjustment unit is configured to configure the standby control node with the highest priority as the primary control node when a failure of the primary control node is detected, and configure the priorities of the standby control nodes other than the standby control node with the highest priority move up a level;

The second adjustment unit is configured to increase the priorities of all standby control nodes whose priorities are lower than the failed standby control node by one level when it is detected that the standby control node fails.

7. The system for realizing a high-availability system based on health management according to claim 6, wherein the data analysis module comprises the following units:

The analysis unit is used to analyze the historical value matrix of the health status data column by column through the main control node, and each column of health status data corresponds to a computing node;

The analysis and judgment unit is used to judge whether all the computing nodes have been analyzed once. When all the computing nodes have been analyzed once, the function of the computing node judging unit is started; when there are computing nodes that have not been analyzed, the function of the statistical unit is started;

The statistics unit is used to count the number of acquisitions of the current computing node exceeding the reference range of the health state and below the reference range of the health state;

A state judging unit, configured to determine that the computing node is in a healthy state when the number of collections exceeding the health state reference range is less than k1, and the number of collections below the health state reference range is less than k2, and start the first flag value setting unit function; when the number of acquisitions exceeding the reference range of the health state is greater than or equal to k1, or the number of acquisitions below the reference range of the health state is greater than or equal to k2, it is determined that the computing node is in an unhealthy state, and the second flag is activated the function of the value setting unit;

The first flag value setting unit is used to set the unhealthy flag value of the computing node to 0, and start the function of the analysis unit;

The second flag value setting unit is used to obtain the maximum value of the number of acquisitions exceeding the health state reference range and the number of acquisitions below the health state reference range, and set the maximum value as the unhealthy state value of the computing node, and function to activate the analysis unit;

The computing node judging unit is used to analyze the unhealthy flag values of all computing nodes, and judge whether there are computing nodes with non-zero values; when there are no non-zero computing nodes, it is determined that all computing nodes are healthy. Computing nodes with non-zero value, it is determined that there are unhealthy computing nodes;

The computing node management unit is configured to manage the computing nodes according to the set policy, and start the function of the data collection unit after waiting for a first preset time T1.

8. The system for implementing a high-availability system based on health management according to claim 7, wherein the computing node management unit comprises the following subunits:

The strategy setting subunit is used to set the first strategy, the second strategy, and the third strategy. The first strategy includes three options: prompt alarm, automatic migration task out, and alarm and automatic migration task out; the second strategy includes prompt alarm, automatic There are three options: shutdown, alarm and automatic shutdown; the third strategy includes three options: prompt alarm, automatic migration task return, alarm and automatic migration task return;

The first task judging subunit is used to judge whether there is a processing task in the unhealthy computing node. When there is a processing task in the unhealthy computing node, query the first strategy and perform processing according to the first strategy; when the unhealthy computing node does not When there is a processing task, query the second strategy and process it according to the second strategy;

The second task judging subunit is used to judge whether there is a processing task in a healthy computing node, and when there is a processing task in a healthy computing node, no operation is performed on the healthy computing node; when there is no processing task in a healthy computing node, Determine that the healthy computing node has recovered from an unhealthy state to a healthy state, and record the current timestamp, if the difference between the current timestamp and the last timestamp is less than the second preset time T2, or there is no last time stamp, then do not do any operation, if the difference is greater than or equal to the second preset time T2, it will be processed according to the third strategy;

The jumping subunit is used to start the function of the data acquisition unit after waiting for the first preset time T1.