[go: up one dir, main page]

CN1257464C - Blade Server Management System with Hardware Standby Structure - Google Patents

Blade Server Management System with Hardware Standby Structure Download PDF

Info

Publication number
CN1257464C
CN1257464C CNB021563829A CN02156382A CN1257464C CN 1257464 C CN1257464 C CN 1257464C CN B021563829 A CNB021563829 A CN B021563829A CN 02156382 A CN02156382 A CN 02156382A CN 1257464 C CN1257464 C CN 1257464C
Authority
CN
China
Prior art keywords
management
blade type
management piece
piece
main
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
CNB021563829A
Other languages
Chinese (zh)
Other versions
CN1508713A (en
Inventor
张英哲
黄仁烜
张仲一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Quanta Computer Inc
Original Assignee
Quanta Computer Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Quanta Computer Inc filed Critical Quanta Computer Inc
Priority to CNB021563829A priority Critical patent/CN1257464C/en
Publication of CN1508713A publication Critical patent/CN1508713A/en
Application granted granted Critical
Publication of CN1257464C publication Critical patent/CN1257464C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Landscapes

  • Hardware Redundancy (AREA)

Abstract

一种具有硬件备用结构的刀片(blade)型服务器管理系统,是用来管理刀片型服务器与刀片型网桥,包含至少二块管理块与一块基板。管理块根据插在基板上的位置,依预定的时间顺序检测心跳(Heart Beat)信号,以形成直接管理的主管理块与备用管理的辅助管理块。在主管理块与辅助管理块之间,利用通信接口进行数据同步的工作,还利用网络接口进行备用。该系统还可使用连接到另一个通信接口的本地管理电脑备用管理该系统。该系统的辅助管理块,可以在主管理块失效时,提升为主管理块,所以可增加系统的稳定性与效率。

Figure 02156382

A blade-type server management system with a hardware backup structure is used to manage blade-type servers and blade-type network bridges, and includes at least two management blocks and a baseboard. The management blocks detect heartbeat signals in a predetermined time sequence according to the position of the management blocks inserted on the baseboard to form a main management block for direct management and an auxiliary management block for backup management. Between the main management block and the auxiliary management block, a communication interface is used for data synchronization, and a network interface is used for backup. The system can also use a local management computer connected to another communication interface for backup management of the system. The auxiliary management block of the system can be promoted to the main management block when the main management block fails, so the stability and efficiency of the system can be increased.

Figure 02156382

Description

具有硬件备用结构的刀片型服务器管理系统Blade Server Management System with Hardware Standby Structure

技术领域technical field

本发明涉及关于刀片(blade)型服务器管理系统,特别涉及硬件备用结构的刀片型服务器管理系统。The present invention relates to a blade server management system, in particular to a blade server management system with a hardware standby structure.

背景技术Background technique

由于科技的进步及对电脑系统的依赖程度越来越高。因此,市场对服务器系统的运算能力要求也越来越高,但随着可用空间的日渐狭窄,对服务器系统所占据的办公室或厂房空间,却要求能越来越小。传统服务器系统,为了具有高的稳定性,所以相对于一般桌上型电脑的大小,有过之而无不及,且在管理上及空间配置上,还衍生出来许多的服务器连线与管理的问题。对一般公司而言,少则仅有两三台的服务器,但多则上千台的服务器。因此服务器的管理与空间使用,还将随着企业对电脑的需求而日益凸显。Due to the advancement of technology and the increasing dependence on computer systems. Therefore, the market has higher and higher requirements for the computing power of the server system, but as the available space becomes narrower, the office or factory space occupied by the server system is required to be smaller and smaller. In order to have high stability, the traditional server system is larger than the size of ordinary desktop computers, and in terms of management and space configuration, many problems of server connection and management are also derived. . For general companies, there are as few as two or three servers, but as many as thousands of servers. Therefore, the management and space utilization of servers will become increasingly prominent along with the demand for computers by enterprises.

刀片型服务器是目前的趋势,其可以把所有服务器系统的硬件——包括处理器、存储器、硬盘驱动器、以及网络连线等功能,整合到单一的扩充卡,或者所谓的“刀片”(Blade)上,使得这种服务器拥有较高的运算能力与高稳定性,所占空间亦远比传统服务器少了许多。此外,还可降低成本与降低温度,在各项功能与性能上,都较传统服务器为之改善。刀片型服务器是以刀片插入的方式嵌进服务器机箱内,所以称之为刀片型服务器。使用者可依个别需求购置不同数量的服务器,拆装容易,只需花费几秒钟时间就可插入一台全新服务器。而每一片服务器都是互相独立的,当需求增加时,只要增购服务器嵌入即可,任何插入或移除的动作都不会影响到同一机板上其他服务器的动作。所以多个服务器可同时在同一个服务器的机箱内工作。其大小仅占约一台传统服务器的空间。Blade server is the current trend, which can integrate the hardware of all server systems-including functions such as processors, memory, hard drives, and network connections, into a single expansion card, or so-called "blade" (Blade). In terms of performance, this kind of server has high computing power and high stability, and occupies much less space than traditional servers. In addition, the cost and temperature can be reduced, and various functions and performances are improved compared with traditional servers. The blade server is embedded in the server chassis by inserting the blade, so it is called the blade server. Users can purchase different numbers of servers according to individual needs. It is easy to disassemble and assemble, and it only takes a few seconds to insert a new server. And each server is independent of each other. When the demand increases, you only need to purchase additional servers and embed them. Any insertion or removal will not affect the actions of other servers on the same board. So multiple servers can work in the same server chassis at the same time. It's about the size of a traditional server.

因此刀片服务器对于电信业、门户网站、因特网服务提供商(InternetServices Provider,ISP)等企业服务器使用者,以及需大量且极高速运算能力的工作,如气象数据的研究、天文观察、生物科技(如DNA的计算)、电影工业中的电脑动画特效等工作者,提供高速与稳定的服务器装置。Therefore, blade servers are suitable for enterprise server users such as telecommunications, portal websites, and Internet Service Providers (Internet Services Providers, ISPs), as well as tasks that require a large amount of extremely high-speed computing power, such as meteorological data research, astronomical observation, biotechnology (such as DNA computing), computer animation special effects in the film industry, etc., providing high-speed and stable server devices.

刀片服务器的管理,一般而言,可分为带内(In-Band)管理与带外(Out-Band)管理。所谓带内管理,是结构在BIOS或操作系统(OS)的软件上,使用软件进行服务器的管理,其缺点在于无任何管理块存在,只要服务器出现任何状况,将成瘫痪无法运行的状态。而带外管理,则是利用管理块进行服务器的管理。其虽然有一个管理块,但无任何备用机制存在。由于每台刀片服务器,同时管理多组的服务器,当管理块失效时,将造成刀片服务器由带外的管理模式退化成为带内管理模式。此时,其中任何一片的服务器发生异常时,服务器的管理人员,将无法再经由管理块中获得异常的现象,以采取对策及处理。造成服务器管理人员在管理上的困难,并影响服务器的稳定性。因此,如何有效改善刀片服务器管理系统的结构,使得服务器的运行状态均能为管理人员所控制,并能快速地解决服务器上所发生的问题,是每一信息管理人员与企业用户所企盼的。Generally speaking, blade server management can be divided into in-band (In-Band) management and out-of-band (Out-Band) management. The so-called in-band management is structured on the BIOS or operating system (OS) software, and uses software to manage the server. Its disadvantage is that there is no management block. As long as any situation occurs in the server, it will become paralyzed and unable to run. The out-of-band management is to use the management block to manage the server. Although it has a management block, no backup mechanism exists. Since each blade server manages multiple groups of servers at the same time, when the management block fails, the blade server will degenerate from the out-of-band management mode to the in-band management mode. At this time, when any one of the servers is abnormal, the management personnel of the server will no longer be able to obtain the abnormal phenomenon through the management block to take countermeasures and handle it. It causes difficulties for the server administrators in management and affects the stability of the server. Therefore, how to effectively improve the structure of the blade server management system so that the operating status of the server can be controlled by the management personnel and quickly solve the problems on the server is what every information management personnel and enterprise users look forward to.

发明内容Contents of the invention

鉴于在上述的技术背景中,刀片服务器利用管理块进行服务器的管理。但当每台刀片服务器同时管理数个服务器到二十个以上服务器,且当管理块失效时,服务器的管理人员,无法再经由管理块中获得异常的现象,以采取对策及处理。将造成服务器管理上的困难,因此,影响服务器工作的稳定性。In view of the above technical background, the blade server uses the management block to manage the server. But when each blade server manages several servers to more than 20 servers at the same time, and when the management block fails, the server management personnel can no longer obtain abnormal phenomena through the management block to take countermeasures and processing. It will cause difficulties in server management, thus affecting the stability of server work.

本发明的一个目的是提供一种具有硬件备用机制的刀片型服务器管理系统,能有效增加刀片型服务器的管理的稳定性。An object of the present invention is to provide a blade server management system with a hardware backup mechanism, which can effectively increase the management stability of the blade server.

本发明的又一目的是提供一种刀片型服务器管理系统,不仅具有硬件备用机制,还利用本地管理电脑备用以进行系统管理。Another object of the present invention is to provide a blade server management system, which not only has a hardware backup mechanism, but also utilizes a local management computer for system management.

根据以上所述的目的,本发明是一种具有硬件备用机制的刀片型服务器管理系统,用来管理刀片型服务器与刀片型网桥。该管理系统至少包含二块管理块、机板、与存储器。每一管理块均具有产生心跳(Heart Beat)信号的能力,并可用来管理上述的这些刀片型服务器与这些刀片型网桥的功能。每一管理块包含有两个通信接口,与一个网络接口。第一通信接口,用来进行管理块之间的数据同步,而第二通信接口,则连接本地管理电脑,以直接管理刀片型服务器与刀片型网桥。网络接口,是用来连接到刀片型网桥,以进行网络分组与信息交换,还可在第一通信接口失效时,进行管理块之间的数据同步工作。According to the purpose mentioned above, the present invention is a blade server management system with a hardware backup mechanism, which is used to manage blade servers and blade bridges. The management system includes at least two management blocks, a board, and a memory. Each management block has the ability to generate a Heart Beat signal, and can be used to manage the functions of the above blade servers and blade bridges. Each management block contains two communication interfaces and one network interface. The first communication interface is used for data synchronization among the management blocks, and the second communication interface is connected with a local management computer to directly manage the blade server and the blade network bridge. The network interface is used to connect to the blade bridge for network grouping and information exchange, and also for data synchronization between management blocks when the first communication interface fails.

基板则具有第一管理块插槽与第二管理块插槽,以及多个刀片插槽。其中,插在第一管理块插槽的管理块,等待第一预定时间之后,若没有另一管理块所传来的心跳信号,则自动形成主管理块。该主管理块将直接管理刀片型服务器与刀片型网桥,并产生心跳信号,传送至另一个管理块,且当另一个管理块收到心跳信号后,自动形成辅助管理块。管理块之间经由个别的第一通信接口,进行数据同步,使辅助管理块具有接手主管理块的备用管理机制。但若插在第二管理块插槽的管理块等待第二预定时间之后,并没有收到任何心跳信号时,则该管理块将形成主管理块。该主管理块产生心跳信号,传送至另一管理块,而另一管理块收到此心跳信号时,自动形成辅助管理块。一般而言,第二预定时间设定大于第一预定时间,例如第一预定时间约为3秒,而第二预定时间约为5秒。The base plate has a first management block slot, a second management block slot, and a plurality of blade slots. Wherein, the management block inserted in the first management block slot will automatically form the main management block if there is no heartbeat signal from another management block after waiting for the first predetermined time. The main management block will directly manage the blade server and the blade bridge, and generate a heartbeat signal, which is sent to another management block, and when another management block receives the heartbeat signal, it automatically forms an auxiliary management block. Data synchronization is performed between the management blocks via individual first communication interfaces, so that the auxiliary management block has a backup management mechanism to take over from the main management block. However, if the management block inserted in the second management block slot does not receive any heartbeat signal after waiting for the second predetermined time, the management block will become the master management block. The main management block generates a heartbeat signal and transmits it to another management block, and when the other management block receives the heartbeat signal, it automatically forms an auxiliary management block. Generally speaking, the second predetermined time is set to be longer than the first predetermined time, for example, the first predetermined time is about 3 seconds, and the second predetermined time is about 5 seconds.

存储器则连接到基板,是用来记录主管理块的工作记录,并在辅助管理块接替主管理块时,将这些工作记录提供给新主管理块,以继续管理刀片型服务器与刀片型网桥。The memory is connected to the base board and is used to record the work records of the main management block, and when the auxiliary management block takes over from the main management block, these work records are provided to the new main management block to continue to manage the blade server and the blade bridge .

上述的网络接口也可直接连接网络,直接进行网络分组与信息交换。而主管理块与辅助管理块,还可由应用程序改变主管理块与辅助管理块。而该管理系统还经由硬件电路的隔离,使辅助管理块不会进行刀片型服务器与刀片型网桥的直接管理工作。The above-mentioned network interface can also be directly connected to the network to directly perform network grouping and information exchange. As for the main management block and the auxiliary management block, the main management block and the auxiliary management block can also be changed by the application program. Moreover, the management system also prevents the auxiliary management block from directly managing the blade server and the blade bridge through the isolation of the hardware circuit.

因此,本发明可增加服务器工作的稳定性,使服务器管理人员能有效控制服务器的运行及随时获得服务器的使用状况。Therefore, the present invention can increase the working stability of the server, so that the server manager can effectively control the operation of the server and obtain the usage status of the server at any time.

附图说明Description of drawings

本发明的优选实施例将在稍后的说明中参照下列附图做更详细的阐述,其中:Preferred embodiments of the present invention will be described in more detail in the following description with reference to the following drawings, in which:

图1为本发明的具有硬件备用结构的刀片型服务器管理系统的优选实施例的示意图。FIG. 1 is a schematic diagram of a preferred embodiment of the blade server management system with hardware backup structure of the present invention.

图号说明:Description of figure number:

102服务器                   104网桥102 server 104 bridge

106第一管理块               108第二管理块106 The first management block 108 The second management block

110网络                     112第一本地管理电脑110 Network 112 The first local management computer

114第二本地管理电脑         116存储器114 second local management computer 116 memory

118基板            122第二通信接口118 Substrate 122 Second communication interface

124网络接口        126第一通信接口124 network interface 126 first communication interface

132第二通信接口    134网络接口132 second communication interface 134 network interface

136第一通信接口136 first communication interface

具体实施方式Detailed ways

以下将参考附图及本发明的优选实施例,详细说明本发明的精神,熟悉该技术的人员在了解了本发明的优选实施例后,可由本发明所教的技术,加以改变及修饰,其并不脱离本发明的精神与范围。参阅图1为本发明的具有硬件备用结构的刀片型服务器管理系统的优选实施例示意图。如图中所示,本发明包含第一管理块106、第二管理块108、网桥104、多个服务器102、与用来记录系统状态的存储器116、以及系统的基板118。而第一管理块106具有第二通信接口122、网络接口124与第一通信接口126,并通过基板118连接存储器116。同样地,第二管理块108具有第二通信接口132、网络接口134与第一通信接口136,并通过过基板118连接存储器116。The spirit of the present invention will be described in detail below with reference to the accompanying drawings and preferred embodiments of the present invention. After understanding the preferred embodiments of the present invention, those who are familiar with the art can change and modify the technology taught by the present invention. without departing from the spirit and scope of the present invention. Referring to FIG. 1 , it is a schematic diagram of a preferred embodiment of a blade server management system with a hardware backup structure according to the present invention. As shown in the figure, the present invention includes a first management block 106 , a second management block 108 , a network bridge 104 , a plurality of servers 102 , a memory 116 for recording system status, and a system substrate 118 . The first management block 106 has a second communication interface 122 , a network interface 124 and a first communication interface 126 , and is connected to the memory 116 through the substrate 118 . Likewise, the second management block 108 has a second communication interface 132 , a network interface 134 and a first communication interface 136 , and is connected to the memory 116 through the substrate 118 .

本发明的具有硬件备用结构的刀片型服务器管理系统,是利用第一管理块106与第二管理块108,进行刀片型服务器中的全部的服务器102与网桥104的管理工作。这些服务器102与网桥104的实际使用数量,是由使用者根据工作的需求来进行插入的。例如,当使用者欲使用二片服务器及一片网桥时,仅需将二片服务器与一片网桥插入本发明的具有硬件备用结构的刀片型服务器管理系统的基板118上,即可利用本发明的管理块进行这些服务器与网桥的管理与控制。服务器的主要功能是提供网络中电脑服务器的功能,而网桥则是提供网络信号交换的功能。The blade server management system with hardware backup structure of the present invention uses the first management block 106 and the second management block 108 to manage all the servers 102 and network bridges 104 in the blade server. The actual usage numbers of these servers 102 and network bridges 104 are inserted by users according to work requirements. For example, when a user wants to use two servers and a network bridge, he only needs to insert the two servers and a network bridge into the substrate 118 of the blade server management system with hardware backup structure of the present invention, and the present invention can be used. The management block manages and controls these servers and network bridges. The main function of the server is to provide the function of the computer server in the network, and the network bridge is to provide the function of network signal exchange.

当使用本发明的具有硬件备用结构的刀片型服务器管理系统时,第一管理块106与第二管理块108,分别插入基板118的管理块插槽中(未示出)。而所需使用的服务器102与网桥104则分别插入基板118上的刀片插槽中。根据管理块所插入的管理块插槽位置与管理块所发出的心跳信号,本发明的第一管理块106与第二管理块108,将自动形成主管理块与辅助管理块。例如,当第一管理块106插入第一管理块插槽,而第二管理块108则插入第二管理块插槽后。插在第一管理块插槽的第一管理块106,将在等待第一预定时间后,若没有收到由其他管理块所传来的心跳信号,则第一管理块106将自动形成本管理系统的主管理块。但若在第一预定时间内,第一管理块106收到由其他管理块所传来的心跳信号时,则第一管理块106将自动形成本管理系统的辅助管理块,并经由硬件电路的隔离(Hardware Isolation)的设计而成为监视模式。When using the blade server management system with hardware backup structure of the present invention, the first management block 106 and the second management block 108 are respectively inserted into the management block slots of the base plate 118 (not shown). The server 102 and the network bridge 104 to be used are respectively inserted into the blade slots on the substrate 118 . According to the position of the management block slot where the management block is inserted and the heartbeat signal sent by the management block, the first management block 106 and the second management block 108 of the present invention will automatically form a main management block and an auxiliary management block. For example, when the first management block 106 is inserted into the first management block slot, the second management block 108 is inserted into the second management block slot. The first management block 106 inserted in the first management block slot will wait for the first predetermined time, if it does not receive the heartbeat signal sent by other management blocks, then the first management block 106 will automatically form this management block. The main management block of the system. But if within the first predetermined time, the first management block 106 receives heartbeat signals from other management blocks, then the first management block 106 will automatically form the auxiliary management block of the management system, and through the hardware circuit Isolation (Hardware Isolation) design and become monitor mode.

而此时插在第二管理块插槽的第二管理块108,将先等待第二预定时间,当超过此第二预定时间后,若第二管理块108仍未收到任何由其他管理块所传来的心跳信号,则第二管理块108将自动形成主管理块。但若在此第二预定时间内,第二管理块108收到由其他管理块所传来的心跳信号时,则第二管理块108将自动形成本管理系统的辅助管理块,并经由硬件电路的隔离的设计而成为监视模式。一般而言,在本发明的优选实施例中,此第一预定时间约为3秒,而第二预定时间较第一预定时间略长,约为5秒。因此,插在第一管理块插槽的第一管理块106将等待约3秒,以确定系统中是否有其他的主管理块已经在进行主管理块的工作并发出心跳信号,若是没有则将由第一管理块106形成主管理块。而插在第二管理块插槽的第二管理块108,将等待约5秒,以确定系统中并没其他的主管理块已经在进行管理的工作之后并发出心跳信号,自动由第二管理块108形成主管理块。所以本发明的管理块可清楚地根据预定的时间差,自动形成主管理块与辅助管理块。And this moment, the second management block 108 inserted in the second management block slot will wait for the second predetermined time earlier. After exceeding this second predetermined time, if the second management block 108 has not received any If the transmitted heartbeat signal is received, the second management block 108 will automatically form the main management block. However, if the second management block 108 receives heartbeat signals from other management blocks within the second predetermined time, the second management block 108 will automatically form an auxiliary management block of the management system, and through the hardware circuit The design of isolation becomes monitor mode. Generally speaking, in a preferred embodiment of the present invention, the first predetermined time is about 3 seconds, and the second predetermined time is slightly longer than the first predetermined time, about 5 seconds. Therefore, the first management block 106 inserted in the first management block slot will wait for about 3 seconds to determine whether there are other main management blocks in the system that are already carrying out the work of the main management block and sending a heartbeat signal, if not, it will be performed by The first management block 106 forms the main management block. And the second management block 108 that is inserted in the second management block slot will wait for about 5 seconds to determine that there are no other main management blocks in the system after performing management work and send a heartbeat signal, automatically by the second management block. Block 108 forms the main management block. Therefore, the management block of the present invention can automatically form the main management block and the auxiliary management block according to the predetermined time difference.

而本发明的具有硬件备用结构的刀片型服务器管理系统,若在实际使用时,第一管理块插槽中并未插入任何管理块时,则根据前述的判断逻辑,插在第二管理块插槽中的管理块将在等待约5秒后,自动形成主管理块。However, in the blade server management system with a hardware backup structure of the present invention, if no management block is inserted into the first management block slot during actual use, then according to the aforementioned judgment logic, insert the second management block into the second management block slot. The management block in the slot will automatically form the main management block after waiting for about 5 seconds.

当本发明的具有硬件备用结构的刀片型服务器管理系统,清楚地完成主管理块与辅助管理块的设定后,主管理块将负责系统的管理,并将系统所有发生的状态记录到系统基板的存储器116中,例如电可擦除可编程只读存储器(Electrically Erasable Prgorammable Read-Only Memory,EEPROM)。而辅助管理块则进入监视模式,当主管理块失效时,辅助管理块将接替管理的工作,并自动成为主管理块。此时新主管理块将从存储器116中读取原来主管理块失效时的系统状态记录,并继续整个系统的管理工作。由于辅助管理块在主管理块停止工作时,自动地形成新主管理块,因此本发明的刀片型服务器管理系统,因具有硬件备用结构,所以不会因原主管理块停止工作时而失去对系统的管理工作。When the blade server management system with hardware backup structure of the present invention clearly completes the setting of the main management block and the auxiliary management block, the main management block will be responsible for the management of the system and record all the states of the system to the system substrate In the memory 116, such as Electrically Erasable Programmable Read-Only Memory (Electrically Erasable Prgorammable Read-Only Memory, EEPROM). The auxiliary management block enters the monitoring mode. When the main management block fails, the auxiliary management block will take over the management work and become the main management block automatically. At this time, the new main management block will read the system status record when the original main management block fails from the memory 116, and continue the management work of the whole system. Because the auxiliary management block automatically forms a new main management block when the main management block stops working, the blade server management system of the present invention has a hardware backup structure, so it will not lose control of the system when the original main management block stops working. management work.

在辅助管理块形成新主管理块之后,失效的管理块将可直接由刀片服务器的管理块插槽中取出,维修或更换成新管理块。当此新管理块插入原先的第一管理块插槽中时,由于位于第二管理块插槽中的第二管理块108,此时已形成主管理块,并发出心跳信号。因此,位于第一管理块插槽中的新第一管理块106,将自动形成辅助管理块,随时待命,在主管理块失效时,接替管理的工作。因此本发明的具有硬件备用结构的刀片型服务器管理系统,可实现无管理死角的刀片型服务器管理工作。After the auxiliary management block forms a new main management block, the failed management block can be directly taken out from the management block slot of the blade server, and can be repaired or replaced with a new management block. When the new management block is inserted into the original first management block slot, the second management block 108 located in the second management block slot has formed the main management block at this time and sends out a heartbeat signal. Therefore, the new first management block 106 located in the slot of the first management block will automatically form an auxiliary management block, which is on standby at any time, and takes over the management work when the main management block fails. Therefore, the blade server management system with hardware backup structure of the present invention can realize blade server management work without management dead angle.

主管理块与辅助管理块,经由其本身各自的第一通信接口126与136,进行重要数据的同步的工作。且通过硬件电路的自动隔离设计,使辅助管理块并不会进行服务器与网桥的管理工作。辅助管理块仅通过第一通信接口126与136,以取得主管理块的数据,例如,区域网络的媒体存取控制(Media-AccessControl,MAC)地址与机板识别符(Chassis ID)。而主管理块则同样地通过第一通信接口126与136取得辅助管理块的数据,例如,场地替代单元(FieldReplacement Unit,FRU)与区域网络的媒体存取控制地址。The main management block and the auxiliary management block perform important data synchronization via their respective first communication interfaces 126 and 136 . And through the automatic isolation design of the hardware circuit, the auxiliary management block will not manage the server and the network bridge. The auxiliary management block only uses the first communication interfaces 126 and 136 to obtain the data of the main management block, such as Media-Access Control (MAC) address and Chassis ID of the LAN. The main management block also obtains the data of the auxiliary management block through the first communication interfaces 126 and 136, for example, the MAC address of the Field Replacement Unit (Field Replacement Unit, FRU) and the LAN.

由于辅助管理块并不会进行服务器与网桥的管理工作,所以不会造成管理上的混乱及效率上的降低。而从第一通信接口126与136取得主管理块的数据,所以当主管理块失去作用时,或者是服务器管理员通过应用程序要求改变主管理块时,辅助管理块可在主管理块停止心跳信号传送后,自动成为新主管理块,并发出信息,以更新系统的相关数据,而不会使数据遗失,且服务器管理员还利用此新主管理块立刻进行系统管理,所以不会形成管理上的空档。Since the auxiliary management block does not manage the server and the network bridge, it will not cause confusion in management and decrease in efficiency. The data of the main management block is obtained from the first communication interface 126 and 136, so when the main management block fails, or the server administrator requests to change the main management block through the application program, the auxiliary management block can stop the heartbeat signal at the main management block After transmission, it will automatically become the new master management block and send out information to update the relevant data of the system without data loss, and the server administrator also uses this new master management block to manage the system immediately, so there will be no management problems. vacancy.

为更进一步的确保辅助管理块与主管理块之间的数据与信息能维持一致,以使辅助管理块能顺利的替代主管理块的工作。在本发明的刀片型服务器管理的硬件备用系统的管理块之间,除了以第一通信接口126与136进行数据的更新外,还使用网络接口124与134,以形成另一个备用的数据交换电路。也就是说,当管理块之间的第一通信接口126与136因为硬件或其它原因失去作用时,网络接口124与134将立刻成为管理块之间备用的数据交换电路。而网络接口124与134也同样提供本发明的具有硬件备用结构的刀片型服务器管理系统的网络连接的工作,以进行网络分组的交换。本发明的管理系统与网络连接,一方面可通过网桥104,以减少系统间的电缆线数量,而另一方面网络110也可以直接连接到管理块上的网络接口124与134,直接通过每个管理块上个别的网络接口,进行系统的管理与控制。In order to further ensure that the data and information between the auxiliary management block and the main management block can maintain consistency, so that the auxiliary management block can successfully replace the work of the main management block. Between the management blocks of the hardware backup system managed by the blade server of the present invention, in addition to updating data with the first communication interfaces 126 and 136, network interfaces 124 and 134 are also used to form another standby data exchange circuit . That is to say, when the first communication interfaces 126 and 136 between the management blocks fail due to hardware or other reasons, the network interfaces 124 and 134 will immediately become backup data exchange circuits between the management blocks. The network interfaces 124 and 134 also provide the network connection of the blade server management system with hardware backup structure of the present invention, so as to exchange network packets. The management system of the present invention is connected to the network. On the one hand, the network bridge 104 can be used to reduce the number of cables between the systems. On the other hand, the network 110 can also be directly connected to the network interfaces 124 and 134 on the management block, directly through each Individual network interfaces on each management block for system management and control.

同时本发明的具有硬件备用结构的刀片型服务器管理系统的每一个管理块,还包含第二通信接口122与132,例如,为管理通信接口,直接与第一本地管理电脑112与第二本地管理电脑114连接。所以,管理员亦可使用本地电脑直接对管理块进行系统的管理与控制,使本发明的系统更为安全与稳定。Simultaneously each management block of the blade type server management system with hardware standby structure of the present invention also comprises the second communication interface 122 and 132, for example, for the management communication interface, directly with the first local management computer 112 and the second local management A computer 114 is connected. Therefore, the administrator can also use the local computer to directly manage and control the system of the management block, making the system of the present invention more secure and stable.

因此本发明的具有硬件备用结构的刀片型服务器管理系统,利用硬件的隔离设计,自动将辅助管理块隔离在管理机制之外,使辅助管理块不会影响服务器与网桥的管理,再利用通信接口及网络接口将两块管理块相连接,以维持重要数据的同步。所以辅助管理块可在任何时间均做好备用的准备工作,随时可接手成为主管理块,使服务器的管理更为安全无误。而服务器管理人员可将损坏的主服务器加以更换,然后插入,自动形成新辅助管理块。所以本发明的刀片型服务器管理的硬件备用结构,可在完全不影响正常运行的情况下,更换损坏的硬件设备。还由于利用网络接口的备用数据交换能力,使两块管理块的数据同步工作更为安全与稳定。因此,服务器管理人员不必因为担心单个硬件设备故障所形成的管理危机,仅需在系统通知发生硬件损坏之时,将已损坏的硬件加以更换。因为备用的系统早已立刻取代原来的管理与通信系统,使服务器管理工作,真正达到低风险管理境界。而且本发明的管理块还可以直接连接网络,通过通信接口连接本地管理电脑,使本发明的具有硬件备用结构的刀片型服务器管理系统的稳定性与安全性更为提高。本发明并不限定仅使用两块管理块,可根据使用者需求,以决定管理块的数量。Therefore, the blade server management system with hardware backup structure of the present invention utilizes the isolation design of the hardware to automatically isolate the auxiliary management block from the management mechanism, so that the auxiliary management block will not affect the management of the server and the network bridge, and then utilizes the communication The interface and the network interface connect the two management blocks to maintain the synchronization of important data. Therefore, the auxiliary management block can be prepared for backup at any time, and can take over as the main management block at any time, so that the management of the server is more secure and correct. The server management personnel can replace the damaged main server, and then insert it to automatically form a new auxiliary management block. Therefore, the hardware backup structure managed by the blade server of the present invention can replace damaged hardware devices without affecting normal operation at all. Furthermore, due to the use of the spare data exchange capability of the network interface, the data synchronization of the two management blocks is safer and more stable. Therefore, server administrators do not need to worry about the management crisis caused by the failure of a single hardware device, but only need to replace the damaged hardware when the system notifies that the hardware is damaged. Because the backup system has already replaced the original management and communication system immediately, the server management work has truly reached the realm of low-risk management. Moreover, the management block of the present invention can also be directly connected to the network, and connected to the local management computer through the communication interface, so that the stability and safety of the blade server management system with hardware backup structure of the present invention are further improved. The present invention does not limit the use of only two management blocks, and the number of management blocks can be determined according to user requirements.

如熟悉本技术的人员所了解的,以上所述的仅为本发明的优选实施例而已,并非用以限定本发明的权利要求;凡是其它未脱离本发明所揭示的精神下所完成的等效改变或修饰,均应包含在所附的权利要求之内。As understood by those familiar with the art, what is described above is only a preferred embodiment of the present invention, and is not intended to limit the claims of the present invention; all other equivalents that do not depart from the spirit disclosed by the present invention are completed Any change or modification should be included in the appended claims.

Claims (20)

1 one kinds of blade type server management systems with hardware alternate configuration are used for managing blade type server and blade type bridge, and wherein, this blade type server management system with hardware alternate configuration comprises at least:
A substrate has a plurality of insert pocket, the first management piece slot and the second management piece slot, and wherein, these a plurality of insert pocket are used for connecting this blade type server and this blade type bridge;
The first management piece is connected to this first management piece slot, after waiting for for first schedule time, when not receiving first heartbeat signal, forms the main management piece, with control and this blade type server of management and this blade type bridge, and produces second heartbeat signal;
The second management piece is connected to this second management piece slot, after receiving this second heartbeat signal, forms auxiliary management piece, obtains data from this main management piece, to carry out the standby management work of this blade type server and this blade type bridge; And
Storer is connected to this substrate, is used for writing down the job record of this main management piece.
2. the blade type server management system with hardware alternate configuration as claimed in claim 1, wherein, the second above-mentioned management piece is after waiting for for second schedule time, when not receiving this second heartbeat signal, form new main management piece with replacing old main management piece automatically, and from this storer, obtain the job record of old main management piece, to continue control and this blade type server of management and this blade type bridge, and produce this first heartbeat signal, and this storer continues the job record of this new main management piece of record.
3. the blade type server management system with hardware alternate configuration as claimed in claim 2, wherein, the first above-mentioned management piece, when within this first schedule time, receiving this first heartbeat signal, to form new auxiliary management piece automatically, and obtain the data of this new main management piece, to carry out the standby management work of this blade type server and this blade type bridge.
4. the blade type server management system with hardware alternate configuration as claimed in claim 2, wherein, the second above-mentioned schedule time is greater than this first schedule time.
5. the blade type server management system with hardware alternate configuration as claimed in claim 4, wherein, the second above-mentioned schedule time is about 5 seconds, and this first schedule time is about 3 seconds.
6. the blade type server management system with hardware alternate configuration as claimed in claim 1 wherein, between above-mentioned main management piece and this auxiliary management piece, also utilizes communication interface to carry out the work of data sync.
7. the blade type server management system with hardware alternate configuration as claimed in claim 6, wherein, between above-mentioned main management piece and this auxiliary management piece, when this communication interface can't be carried out the work of this data sync, also utilize network interface to carry out the work of this data sync.
8. the blade type server management system with hardware alternate configuration as claimed in claim 7, wherein, the work of above-mentioned data sync comprises: this auxiliary management piece is obtained the medium access control address of Local Area Network of this main management piece and the data of machine plate identifier.
9. the blade type server management system with hardware alternate configuration as claimed in claim 8, wherein, the work of above-mentioned data sync also comprises: this main management piece is obtained the data of the medium access control address of this place substituting unit of assisting the management piece and Local Area Network.
10. the blade type server management system with hardware alternate configuration as claimed in claim 1, wherein, above-mentioned main management piece and this auxiliary management piece, also can require to change this main management piece by application program, make this main management piece become this auxiliary management piece, and should become this main management piece by auxiliary management piece.
11. the blade type server management system with hardware alternate configuration as claimed in claim 1 also via the isolation of hardware circuit, makes this auxiliary management piece can not carry out the direct management work of this blade type server and this blade type bridge.
12. the blade type server management system with hardware alternate configuration as claimed in claim 1, wherein, above-mentioned auxiliary management piece, when this main management piece lost efficacy, the function of this main management piece will be replaced at once, and send the information update system related data, make server administrators proceed the direct management work of this blade type server and this blade type bridge.
13. the blade type server management system with hardware alternate configuration as claimed in claim 1, wherein, above-mentioned main management piece and this auxiliary management piece, has the supervisory communications interface respectively, directly connect the local management computer, carry out the standby management work of this blade type server and this blade type bridge.
14. the blade type server management system with hardware alternate configuration comprises at least:
At least one blade type server, the function of the device of providing services on the Internet;
At least one blade type bridge, the function that provides network signal to exchange;
Two management pieces have the function that produces heartbeat signal and manage these blade type servers and these blade type bridges, and wherein each this management piece also comprises:
First communication interface, carrying out the data sync of these management between pieces,
The second communication interface connects the local management computer, has the function of these blade type servers of direct management and these blade type bridges, with
Network interface is connected to these blade type bridges, carries out network packet and message exchange;
A substrate has the first management piece slot and the second management piece slot, is used for connecting these two management pieces, and a plurality of insert pocket, is used for connecting these blade servers and these blade type bridges, wherein,
When this management piece that is inserted in this first management piece slot, wait for after one first schedule time, do not receive first heartbeat signal that is transmitted by another this management piece that is inserted in this second management piece slot, then be inserted in this management piece of this first management piece slot, form a main management piece, this main management piece is directly managed these blade type servers and these blade type bridges, and produce second heartbeat signal, be sent to another this management piece that is inserted in this second management piece slot, and after another this management piece receives second heartbeat signal, form an auxiliary management piece, should auxiliaryly manage piece and this main management piece,, carry out this data sync via this first communication interface separately, should auxiliaryly manage piece and carry out the standby management work of these blade type servers and this blade type bridge, with
When this management piece of another piece that is inserted in this second management piece slot, wait for after second schedule time, do not receive this second heartbeat signal that is transmitted by this management piece that is inserted in this first management piece slot, then be inserted in another this management piece of this second management piece slot, replace this main management piece, form new main management piece, this new main management piece is managed these blade type servers and these blade type bridges, and produce first heartbeat signal, be sent to this first management piece slot, and this management piece that is inserted in this first management piece slot is when receiving this heartbeat signal, form new auxiliary management piece, should newly assist management piece and this new main management piece, via this first communication interface separately, carry out this data sync, this is assisted management piece and carries out the standby management work of these blade type servers and these blade type bridges, wherein, this second schedule time is greater than this first schedule time; And
Storer, be connected to this substrate, be used for writing down the job record of this main management piece, wherein, when this new main management piece replaces this main management piece, this new main management piece is by obtaining this job record, to continue these blade type servers of management and these blade type bridges, this job record of this storer and this new main management piece of continuation record in this storer.
15. the blade type server management system with hardware alternate configuration as claimed in claim 14, wherein, between above-mentioned main management piece and this auxiliary management piece, when this first communication interface can't be carried out the work of this data sync, also utilize this network interface to carry out the work of this data sync.
16. the blade type server management system with hardware alternate configuration as claimed in claim 15, wherein, the work of above-mentioned data sync comprises this auxiliary management piece and obtains the medium access control address of Local Area Network of this main management piece and the data of machine plate identifier.
17. the blade type server management system with hardware alternate configuration as claimed in claim 16, wherein, the work of above-mentioned data sync comprises the data that this main management piece is obtained the medium access control address of this place substituting unit of assisting the management piece and Local Area Network.
18. the blade type server management system with hardware alternate configuration as claimed in claim 14, wherein, above-mentioned network interface is direct interconnection network also, to carry out this network packet and message exchange.
19. the blade type server management system with hardware alternate configuration as claimed in claim 14, wherein, above-mentioned main management piece and this auxiliary management piece, also can require to change this main management piece by application program, make this main management piece become this auxiliary management piece, and should become this main management piece by auxiliary management piece.
20. the blade type server management system with hardware alternate configuration as claimed in claim 14 also via the isolation of hardware circuit, makes this auxiliary management piece can not carry out the direct management work of these blade type servers and these blade type bridges.
CNB021563829A 2002-12-18 2002-12-18 Blade Server Management System with Hardware Standby Structure Expired - Lifetime CN1257464C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB021563829A CN1257464C (en) 2002-12-18 2002-12-18 Blade Server Management System with Hardware Standby Structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB021563829A CN1257464C (en) 2002-12-18 2002-12-18 Blade Server Management System with Hardware Standby Structure

Publications (2)

Publication Number Publication Date
CN1508713A CN1508713A (en) 2004-06-30
CN1257464C true CN1257464C (en) 2006-05-24

Family

ID=34236200

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB021563829A Expired - Lifetime CN1257464C (en) 2002-12-18 2002-12-18 Blade Server Management System with Hardware Standby Structure

Country Status (1)

Country Link
CN (1) CN1257464C (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101547550B (en) * 2008-03-24 2011-07-20 英业达股份有限公司 Server system and circuit board thereof

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWM242781U (en) * 2002-11-25 2004-09-01 Quanta Comp Inc Blade server management system with auxiliary management structure
CN100508481C (en) * 2004-07-27 2009-07-01 广达电脑股份有限公司 Method for automatically distributing communication port address and blade type server system thereof
JP2006048122A (en) * 2004-07-30 2006-02-16 Ntt Docomo Inc Communications system
CN100375961C (en) * 2005-07-12 2008-03-19 广达电脑股份有限公司 Error detection method and device applied to blade servo system
CN101499961B (en) * 2008-01-28 2011-12-07 联想(北京)有限公司 Blade server and method for managing blade address
US8639661B2 (en) * 2008-12-01 2014-01-28 Microsoft Corporation Supporting media content revert functionality across multiple devices
CN102902612A (en) * 2012-09-18 2013-01-30 曙光信息产业股份有限公司 Management system applicable to Loongson blade server
CN103473152B (en) * 2013-09-25 2017-03-01 郑州云海信息技术有限公司 A kind of active and standby management module backup of blade server and update method
CN103634141A (en) * 2013-11-01 2014-03-12 浪潮电子信息产业股份有限公司 Symmetric recovery method for blade server management network
CN108628412A (en) * 2017-11-30 2018-10-09 英业达科技有限公司 Cutter point server

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101547550B (en) * 2008-03-24 2011-07-20 英业达股份有限公司 Server system and circuit board thereof

Also Published As

Publication number Publication date
CN1508713A (en) 2004-06-30

Similar Documents

Publication Publication Date Title
US7430616B2 (en) System and method for reducing user-application interactions to archivable form
US7085961B2 (en) Redundant management board blade server management system
US7330897B2 (en) Methods and apparatus for storage area network component registration
US7681088B2 (en) Apparatus expressing high availability cluster demand based on probability of breach
CN1175353C (en) A Realization Method of Dual Computer Backup
CN1257464C (en) Blade Server Management System with Hardware Standby Structure
US9448615B2 (en) Managing power savings in a high availability system at a redundant component level of granularity
CN113765697B (en) Method and system for managing logs of a data processing system and computer readable medium
US11494130B2 (en) Operation data accessing device and accessing method thereof
CN111045602A (en) Cluster system control method and cluster system
CN1308278A (en) IP fault-tolerant method for colony server
TWI437445B (en) Computer managing method of blade server
US8565067B2 (en) Apparatus, system, and method for link maintenance
JP2003203060A (en) Managing system network connections
CN1221904C (en) Management System of Blade Server
JP7440747B2 (en) Information processing equipment, information processing system, and network communication confirmation method
CN1482773A (en) Implementation of Fault Tolerant Transmission Control Protocol
CN118502788A (en) A firmware upgrade method, device, electronic device and storage medium
TW202026882A (en) Method for remotely clearing abnormal status of racks applied in data center
Barber Increased Server Availibility Through Failover Capability
CN117827379A (en) A high-speed resource scheduling system based on cloud space
WO2025085654A1 (en) Infrastructure independent self-configuring management network
CN115665178A (en) Management method, system, device and storage medium of a distributed storage system
CN118764498A (en) A hardware support platform system for information processing system
Marks Server Issues and Trends, 2000

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CX01 Expiry of patent term
CX01 Expiry of patent term

Granted publication date: 20060524