[go: up one dir, main page]

CN114816911A - Management system of multi-node server and multi-node server - Google Patents

Management system of multi-node server and multi-node server Download PDF

Info

Publication number
CN114816911A
CN114816911A CN202210394747.4A CN202210394747A CN114816911A CN 114816911 A CN114816911 A CN 114816911A CN 202210394747 A CN202210394747 A CN 202210394747A CN 114816911 A CN114816911 A CN 114816911A
Authority
CN
China
Prior art keywords
bmc
fan
power supply
node server
management system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210394747.4A
Other languages
Chinese (zh)
Inventor
张秀波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202210394747.4A priority Critical patent/CN114816911A/en
Publication of CN114816911A publication Critical patent/CN114816911A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3093Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/321Display for diagnostics, e.g. diagnostic result display, self-test user interface

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Power Sources (AREA)

Abstract

The invention discloses a management system of a multi-node server and the multi-node server, which are applied to the technical field of server management and comprise a selection module respectively connected with a power supply, a fan and a plurality of BMCs, a processor connected with the selection module and BMCs respectively arranged in each node of the multi-node server, wherein the processor can control each BMC to be connected with the power supply and the fan according to the preset priority cycle through the selection module. Therefore, in the scheme, the management right of the CMC is transferred to each BMC, and each BMC monitors and manages the power supply and the fan, so that the management structure is simple, the stability of a management system is enhanced, the use scene is enlarged, in addition, the CMC is omitted, and the management cost is reduced.

Description

Management system of multi-node server and multi-node server
Technical Field
The present invention relates to the technical field of server management, and in particular, to a management system for a multi-node server and a multi-node server.
Background
With the increase of the computing demand of users on computers, the multi-node server is also in operation. In the prior art, in order to perform centralized Management on a power supply and a fan in a multi-node server, a multi-level Management architecture such as a CMC (Chassis Management Controller) two-level Management architecture is introduced in particular, where the CMC two-level Management architecture includes a CMC and a plurality of BMCs in each node in the multi-node server. However, the two-level management architecture of the CMC has the disadvantages of complex architecture and complex management network of the client room, so that the CMC has few scenes used by the client and has high cost.
Disclosure of Invention
The invention aims to provide a management system of a multi-node server and the multi-node server, which have simple management architecture, enhance the stability of the management system, enlarge the use scenes, omit CMC and reduce the management cost.
In order to solve the above technical problem, the present invention provides a management system for a multi-node server, wherein the multi-node server includes a power supply and a fan, the management system for the multi-node server includes:
the BMC is respectively arranged in each node of the multi-node server and is used for monitoring a power supply and a fan connected with the BMC;
a selection module respectively connected with the power supply, the fan and the plurality of BMCs;
and the processor is connected with the selection module and used for controlling each BMC to be connected with the power supply and the fan according to the round robin of the preset priority through the selection module.
Preferably, each BMC is specifically configured to acquire power data of a power supply connected to the BMC and fan data of a fan, and control the rotation speed of the fan according to the temperature data of the multi-node server.
Preferably, each BMC is further configured to display the power data and/or the fan data through a display module when receiving a power data query instruction and/or a fan data query instruction.
Preferably, the processor is further configured to control the BMC corresponding to the BMC control instruction to be connected to the power supply and the fan when the BMC control instruction is received.
Preferably, the processor is further connected with a plurality of BMCs; the processor is further configured to send a notification instruction to the BMC connected to the power supply and the fan after controlling the BMC to be connected to the power supply and the fan, so that the BMC monitors the power supply and the fan connected to the BMC after receiving the notification instruction.
Preferably, the selection module is an I2C MUX.
Preferably, the processor is further connected with a plurality of BMCs; controlling each BMC to be connected with the power supply and the fan according to the round robin of the preset priority through the selection module, wherein the round robin comprises the following steps:
and when a monitoring completion message sent by the BMC which monitors the power supply and the fan at present is received, the selection module controls the next BMC to be connected with the power supply and the fan in a round-robin manner according to the preset priority.
Preferably, each BMC is connected to the processor through a GPIO interface.
Preferably, the step of controlling each BMC to be connected with the power supply and the fan by the selection module according to a preset priority cycle includes:
and starting timing when the current BMC is controlled to be connected with the power supply and the fan, and controlling the next BMC to be connected with the power supply and the fan according to the preset priority round-robin sequence through the selection module when the timing reaches a preset time length.
In order to solve the above technical problem, the present invention further provides a multi-node server, including a power supply and a fan, and further including the management system of the multi-node server, where the management system of the multi-node server is connected to the power supply and the fan respectively.
The invention provides a management system of a multi-node server and the multi-node server, which are applied to the technical field of server management and comprise a selection module, a processor and a BMC, wherein the selection module is respectively connected with a power supply, a fan and a plurality of BMCs, the processor is connected with the selection module, the BMC is respectively arranged in each node of the multi-node server, and the processor can control each BMC to be connected with the power supply and the fan according to the preset priority cycle through the selection module. Therefore, in the scheme, the management right of the CMC is transferred to each BMC, and each BMC monitors and manages the power supply and the fan, so that the management structure is simple, the stability of a management system is enhanced, the use scene is enlarged, in addition, the CMC is omitted, and the management cost is reduced.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed in the prior art and the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.
Fig. 1 is a schematic structural diagram of a management system of a multi-node server according to the present invention.
Detailed Description
The core of the invention is to provide a management system of a multi-node server and the multi-node server, the management architecture is simple, the stability of the management system is enhanced, the use scene is enlarged, in addition, CMC is omitted, and the management cost is reduced.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, fig. 1 is a schematic structural diagram of a management system of a multi-node server according to the present invention.
The multi-node server comprises a power supply and a fan, and the management system of the multi-node server comprises:
a BMC 1(Baseboard Management Controller) respectively disposed in each node of the multi-node server, wherein each BMC 1 is configured to monitor a power supply and a fan connected thereto;
a selection module 2 respectively connected with the power supply, the fan and the plurality of BMCs 1;
and the processor 3 is connected with the selection module 2 and is used for controlling each BMC 1 to be connected with a power supply and a fan according to the preset priority cycle through the selection module 2.
The multi-node server may be a 4-node server or an 8-node server, and the present application is not limited thereto.
In order to simplify the management architecture of the multi-node server, in the application, the management system of the multi-node server includes a selection module 2, a processor 3 and BMCs 1 arranged in each node of the multi-node server, and in addition, the management right of the CMC is also transferred to each BMC 1, so that the power and the fan can be monitored through the BMCs 1 in each node, and a user can conveniently inquire the monitoring data of the power and the fan through the BMCs 1.
Specifically, the selection module 2 is connected to each BMC 1, and is further connected to the power supply, the fan, and the processor 3, and in addition, a priority may be preset for each BMC 1, and taking a 4-node server as an example, the management system includes 4 BMCs 1, where the priority may be BMC 1 in node 1, BMC 1 in node 2, BMC 1 in node 3, and BMC 1 in node 4. The processor 3 controls the selection module 2 to control each BMC 1 to be connected with the power supply and the fan according to the preset priority cycle, and when the BMC 1 is connected with the power supply and the fan, the BMC 1 can monitor the power supply and the fan, for example, obtain power supply data of the power supply and fan data of the fan, and can also control the rotating speed of the fan according to the current temperature in the multi-node server.
In the present application, the processor 3 may be, but is not limited to, a CPLD (Complex Programmable logic device), and the Power supply may be a PSU (Power supply unit). There may be one or more fans in the multi-node server, and the number of power supplies may be, but is not limited to, 1.
Therefore, in the scheme, the CMC multi-level architecture is simplified, the management right of the CMC is transferred to each BMC 1, each BMC 1 is used for monitoring and managing the power supply and the fan, the management architecture is simple, the stability of a management system is enhanced, the use scene is enlarged, the CMC is omitted, and the management cost is reduced.
On the basis of the above-described embodiment:
as a preferred embodiment, each BMC 1 is specifically configured to acquire power data of a power supply connected thereto and fan data of a fan, and control the rotation speed of the fan according to temperature data of the multi-node server.
Specifically, when connected to a power supply and a fan, BMC 1 may monitor the power supply and the fan, for example, obtain power supply data of the power supply and fan data of the fan, and also obtain temperature data collected by a temperature sensor in the multi-node server, and control the rotation speed of the fan according to the current temperature data in the multi-node server.
In addition, the power data may include one or more combinations of input voltage, output voltage, fault status, and the like of the power source, and the fan data may include one or more combinations of rotation speed, fault information, and the like.
It can be seen that this approach simplifies the CMC multi-level architecture, but still retains centralized management of power supplies and fans from a management function, reducing complexity and cost.
As a preferred embodiment, each BMC 1 is further configured to display the power data and/or the fan data through the display module when receiving the power data query instruction and/or the fan data query instruction.
Specifically, in the application, the BMC 1 located in each node of the multi-node server can monitor the power supply and the fan, when a user wants to query the power supply data and/or the fan data, the BMC 1 can send a power supply data query instruction and/or a fan data query instruction to the BMC 1, and the BMC 1 displays the power supply data and/or the fan data through the display module when receiving the power supply data query instruction and/or the fan data query instruction.
As a preferred embodiment, the processor 3 is further configured to control the BMC 1 corresponding to the BMC 1 control instruction to be connected to the power supply and the fan when receiving the BMC 1 control instruction.
In this application, processor 3 can be connected with power and fan according to each BMC 1 of priority round robin control that predetermines automatically through selection module 2 to realize BMC 1 to the control of power and fan, in addition, this management system can also control appointed BMC 1 to monitor power and fan according to user's instruction, specifically, when BMC 1 control instruction that user sent is received to processor 3, can control BMC 1 that corresponds with BMC 1 control instruction and be connected with power and fan, so that this BMC 1 monitors power and fan. In practical applications, after the BMC 1 finishes monitoring the power and the fan, the processor 3 may continue to start the sequential control from the next BMC 1 of the BMC 1 according to a preset priority.
Therefore, the management system can automatically control the BMC 1 to monitor the power supply and the fan, can control the appointed BMC 1 to monitor the power supply and the fan according to the instruction of the user, and is flexible in control mode.
As a preferred embodiment, the processor 3 is further connected to a plurality of BMCs 1; the processor 3 is also used for sending a notification instruction to the BMC 1 connected with the power supply and the fan after controlling the BMC 1 to be connected with the power supply and the fan, so that the BMC 1 monitors the power supply and the fan connected with the BMC 1 after receiving the notification instruction.
Considering that the processor 3 may control each BMC 1 to monitor the power and the fan according to a preset priority cycle, in order to improve control reliability, in this application, after controlling one BMC 1 to be connected with the power and the fan, the processor 3 may send a notification instruction to the BMC 1, so that the BMC 1 may monitor the power and the fan connected thereto after receiving the notification instruction. The notification instruction may be specifically an interrupt signal. Therefore, the control mode can realize that each BMC 1 can reliably and stably monitor the power supply and the fan.
As a preferred embodiment, the selection module 2 is an I2C MUX.
In the present application, the selection module 2 may be, but is not limited to, an I2C MUX, and has the advantages of simple structure, low power consumption, and strong interference resistance.
As a preferred embodiment, the processor 3 is further connected to a plurality of BMCs 1; through selecting module 2 and according to presetting priority round robin control each BMC 1 and power and fan be connected, include:
when a monitoring completion message sent by the BMC 1 which monitors the power supply and the fan at present is received, the selection module 2 controls the next BMC 1 to be connected with the power supply and the fan in a round-robin manner according to the preset priority.
In order to ensure that each BMC 1 can complete monitoring of the power supply and the fan completely, in the application, the BMC 1 sends a monitoring completion message to the processor 3 after monitoring of the power supply and the fan is completed, and the processor 3 controls the next BMC 1 to be connected with the power supply and the fan according to a preset priority cycle through the selection module 2 when receiving the monitoring completion message sent by the BMC 1.
In practical application, when monitoring the power supply and the fan, the BMC 1 may send a power supply data acquisition instruction to the power supply and a fan data acquisition instruction to the fan, and after the power supply data sent by the power supply and the fan data sent by the fan are acquired and the rotation speed of the fan is controlled based on the temperature of the multi-node server in the process, the current monitoring may be considered to be completed.
In a preferred embodiment, each BMC 1 is connected to the processor 3 through a GPIO interface.
In the application, each BMC 1 is connected with the processor 3 through a GPIO interface, so that the real-time performance of communication between each BMC 1 and the processor 3 is guaranteed. Of course, each BMC 1 and the processor 3 may be connected through another interface, and the present application is not limited thereto.
As a preferred embodiment, the selection module 2 controls each BMC 1 to be connected with a power supply and a fan according to a preset priority cycle, and includes:
when the current BMC 1 is controlled to be connected with the power supply and the fan, timing is started, and when the timing reaches a preset time length, the next BMC 1 is controlled to be connected with the power supply and the fan through the selection module 2 according to a preset priority round-robin sequence.
In order to simplify the control process, in the application, the processor 3 starts timing when controlling one BMC 1 to be connected with the power supply and the fan, determines that the BMC 1 has finished monitoring the power supply and the fan when the timing reaches a preset time length, and controls the next BMC 1 to be connected with the power supply and the fan according to a preset priority round-robin sequence. The preset time duration can be set according to the time duration of monitoring the fan and the power supply once by the BMC 1 in practical application.
Therefore, the method realizes the round-robin control of the fan and the power supply by each BMC 1, and the control process is simple and reliable.
The invention also provides a multi-node server, which comprises a power supply, a fan and the management system of the multi-node server, wherein the management system of the multi-node server is respectively connected with the power supply and the fan.
For the introduction of the management system in a multi-node server provided by the present invention, please refer to the above-mentioned embodiment of the management system, and the present application is not limited specifically herein.
It should be noted that, in the present specification, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. A management system of a multi-node server, the multi-node server comprising a power supply and a fan, the management system of the multi-node server comprising:
the BMC is respectively arranged in each node of the multi-node server and is used for monitoring a power supply and a fan connected with the BMC;
a selection module respectively connected with the power supply, the fan and the plurality of BMCs;
and the processor is connected with the selection module and used for controlling each BMC to be connected with the power supply and the fan according to the round robin of the preset priority through the selection module.
2. The management system of claim 1, wherein each BMC is configured to obtain power data of a power supply connected to the BMC and fan data of a fan, and to control a rotational speed of the fan according to the temperature data of the multi-node server.
3. The management system of a multi-node server of claim 2, wherein each BMC is further configured to display the power data and/or the fan data through a display module when receiving a power data query command and/or a fan data query command.
4. The management system of a multi-node server of claim 1, wherein the processor is further configured to control a BMC corresponding to the BMC control instruction to be connected to the power supply and the fan upon receiving the BMC control instruction.
5. The management system of a multi-node server of claim 1, wherein said processor is further connected to a plurality of said BMCs; the processor is further configured to send a notification instruction to the BMC connected to the power supply and the fan after controlling the BMC to be connected to the power supply and the fan, so that the BMC monitors the power supply and the fan connected to the BMC after receiving the notification instruction.
6. The management system of a multinode server as claimed in claim 1, wherein said selection module is an I2C MUX.
7. The management system of a multi-node server of any of claims 1 to 6, wherein the processor is further connected to a plurality of BMCs; controlling each BMC to be connected with the power supply and the fan according to the round robin of the preset priority through the selection module, wherein the round robin comprises the following steps:
and when a monitoring completion message sent by the BMC which monitors the power supply and the fan at present is received, the selection module controls the next BMC to be connected with the power supply and the fan in a round-robin manner according to the preset priority.
8. The management system of a multi-node server of claim 7, wherein each BMC is connected to the processor via a GPIO interface.
9. The management system of a multi-node server as claimed in any one of claims 1 to 6, wherein the controlling each BMC to be connected to the power supply and the fan according to a preset priority cycle by the selection module comprises:
and starting timing when the current BMC is controlled to be connected with the power supply and the fan, and controlling the next BMC to be connected with the power supply and the fan according to the preset priority round-robin sequence through the selection module when the timing reaches a preset time length.
10. A multi-node server comprising a power supply and a fan, and further comprising a management system of the multi-node server according to any one of claims 1 to 9, the management system of the multi-node server being connected to the power supply and the fan, respectively.
CN202210394747.4A 2022-04-15 2022-04-15 Management system of multi-node server and multi-node server Pending CN114816911A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210394747.4A CN114816911A (en) 2022-04-15 2022-04-15 Management system of multi-node server and multi-node server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210394747.4A CN114816911A (en) 2022-04-15 2022-04-15 Management system of multi-node server and multi-node server

Publications (1)

Publication Number Publication Date
CN114816911A true CN114816911A (en) 2022-07-29

Family

ID=82537353

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210394747.4A Pending CN114816911A (en) 2022-04-15 2022-04-15 Management system of multi-node server and multi-node server

Country Status (1)

Country Link
CN (1) CN114816911A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120560474A (en) * 2025-07-30 2025-08-29 苏州元脑智能科技有限公司 Multi-node server management system and multi-node server

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109723666A (en) * 2018-11-26 2019-05-07 曙光信息产业股份有限公司 Fan control device and method
CN111459751A (en) * 2020-03-20 2020-07-28 苏州浪潮智能科技有限公司 High-end server management system
CN113608607A (en) * 2021-09-30 2021-11-05 阿里云计算有限公司 Multi-node server control method and multi-node server

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109723666A (en) * 2018-11-26 2019-05-07 曙光信息产业股份有限公司 Fan control device and method
CN111459751A (en) * 2020-03-20 2020-07-28 苏州浪潮智能科技有限公司 High-end server management system
CN113608607A (en) * 2021-09-30 2021-11-05 阿里云计算有限公司 Multi-node server control method and multi-node server

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN120560474A (en) * 2025-07-30 2025-08-29 苏州元脑智能科技有限公司 Multi-node server management system and multi-node server

Similar Documents

Publication Publication Date Title
US10846160B2 (en) System and method for remote system recovery
US9842003B2 (en) Master baseboard management controller election and replacement sub-system enabling decentralized resource management control
US10853052B1 (en) Peripheral device firmware update using rest over IPMI interface firmware shell utility
CN117555760B (en) Server monitoring method and device, substrate controller and embedded system
US20120119577A1 (en) Smart Power Sockets, Boards, and Plugs
CN109617712B (en) Server management method, device, server and readable storage medium
CN102035862A (en) Configuration node fault transfer method and system in SVC cluster
JP7757432B2 (en) Method, device and electronic device for generating hardware interface signals
CN111949320B (en) Methods, systems and servers for providing system data
US7562247B2 (en) Providing independent clock failover for scalable blade servers
WO2015184866A1 (en) Monitoring method and device, and first monitoring unit in power source system
US20180357193A1 (en) Computing device and operation method
CN111488048A (en) Power supply management method and system
CN112462661B (en) Industrial data acquisition controller and control method thereof
US20250139021A1 (en) Computing device and data obtaining method
US9983867B2 (en) Software image provisioning and charging system
CN105549706A (en) Method, device and system for warmly restarting server
CN114816911A (en) Management system of multi-node server and multi-node server
CN113986000A (en) Server power-on method, system, device and medium
CN108989123A (en) A kind of server configuration method and device
CN115756616A (en) Basic Input Output System (BIOS) configuration item configuration method and server
US12189968B1 (en) Recovering mechanical energy from data storage devices
CN118331413B (en) Server management system, method, program product and medium
US10402357B1 (en) Systems and methods for group manager based peer communication
TWI881478B (en) System power synchronization control method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Country or region after: China

Address after: Building 9, No.1, guanpu Road, Guoxiang street, Wuzhong Economic Development Zone, Wuzhong District, Suzhou City, Jiangsu Province

Applicant after: Suzhou Yuannao Intelligent Technology Co.,Ltd.

Address before: Building 9, No.1, guanpu Road, Guoxiang street, Wuzhong Economic Development Zone, Wuzhong District, Suzhou City, Jiangsu Province

Applicant before: SUZHOU LANGCHAO INTELLIGENT TECHNOLOGY Co.,Ltd.

Country or region before: China