[go: up one dir, main page]

CN119484326A - A method for monitoring heterogeneous servers - Google Patents

A method for monitoring heterogeneous servers Download PDF

Info

Publication number
CN119484326A
CN119484326A CN202311005191.6A CN202311005191A CN119484326A CN 119484326 A CN119484326 A CN 119484326A CN 202311005191 A CN202311005191 A CN 202311005191A CN 119484326 A CN119484326 A CN 119484326A
Authority
CN
China
Prior art keywords
node
monitoring
server
monitoring node
control node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311005191.6A
Other languages
Chinese (zh)
Inventor
王云飞
杨勇胜
付豪
曹纪伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Life Property and Casualty Insurance Co Ltd
Original Assignee
China Life Property and Casualty Insurance Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Life Property and Casualty Insurance Co Ltd filed Critical China Life Property and Casualty Insurance Co Ltd
Priority to CN202311005191.6A priority Critical patent/CN119484326A/en
Publication of CN119484326A publication Critical patent/CN119484326A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/02Standardisation; Integration
    • H04L41/0246Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols
    • H04L41/0253Exchanging or transporting network management information using the Internet; Embedding network management web servers in network elements; Web-services-based protocols using browsers or web-pages for accessing management information
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/04Processing captured monitoring data, e.g. for logfile generation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/50Testing arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/04Protocols for data compression, e.g. ROHC
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/16Implementation or adaptation of Internet protocol [IP], of transmission control protocol [TCP] or of user datagram protocol [UDP]
    • H04L69/161Implementation details of TCP/IP or UDP/IP stack architecture; Specification of modified or new header fields
    • H04L69/162Implementation details of TCP/IP or UDP/IP stack architecture; Specification of modified or new header fields involving adaptations of sockets based mechanisms

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Environmental & Geological Engineering (AREA)
  • Data Mining & Analysis (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a method for supervising a heterogeneous server, and relates to the technical field of computers. The method comprises the steps of collecting state information of a heterogeneous server, transmitting the state information to a monitoring node, transmitting the state information collected by the monitoring node to a main control node through an agent, transmitting a task to the monitoring node through the main control node when transmitting the task, and transmitting the task transmitted by the monitoring node to the heterogeneous server to execute the task after the task transmitted by the monitoring node passes detection. According to the method and the device, operation and maintenance personnel are prevented from manually inputting or operation and maintenance management and operation and maintenance operation are required to be carried out in different monitoring operation and maintenance software, and the accuracy and the efficiency of operation and maintenance are greatly improved.

Description

Heterogeneous server supervision method
Technical Field
The invention relates to the technical field of computers, in particular to a method for supervising a heterogeneous server.
Background
With the deep development of digital construction, the number of intranet servers is increased, the types and versions of installed operating systems are not uniform, and with the online of new servers, the new servers must be brought into operation and maintenance management in time, so that heterogeneous servers are managed uniformly, and a monitoring management system and method of the heterogeneous servers are required to be realized.
In the process of implementing the present invention, the inventor finds that at least the following problems exist in the prior art:
The daily operation and maintenance work of the server which is incorporated into the operation and maintenance management generally comprises the steps of inputting hardware information and software information of the server, monitoring server resources, collecting the activity conditions of server processes and ports, using the server network, issuing operation and maintenance instructions in batches, wherein the work usually requires operation and maintenance personnel to input the operation and maintenance instructions manually or requires operation and maintenance management and operation and maintenance operations in different monitoring operation and maintenance software.
Disclosure of Invention
The execution main body corresponding to the invention comprises a main control node and a monitoring node. The master control node is used for controlling the operation of the whole system, managing the heterogeneous servers, configuring and issuing commands. The monitoring node is used for monitoring the state and the running condition of the heterogeneous server, and also comprises the steps of timing task execution and expanding task execution. In addition, the monitoring node is also suitable for different CPU architectures and different operating systems. The message communication mode between the main control node and the monitoring node adopts websocket protocol, and the message can adopt protobuff serialization protocol and carry out confusion encryption. The real-time performance, the safety and the high efficiency of the message are ensured. The method can solve the monitoring and management of the server and the application, and the flexible and extensible task execution can complete some customized monitoring tasks.
To achieve the above object, according to one aspect of the embodiments of the present invention, there is provided a method for supervising a heterogeneous server, including:
collecting state information of a heterogeneous server, and transmitting the state information to a monitoring node;
Transmitting the state information acquired by the monitoring node to a main control node through an agent;
When issuing a task, issuing the task to the monitoring node through a main control node;
And after the task issued by the monitoring node passes the detection, issuing the task to the heterogeneous server to execute the task.
Optionally, a security mechanism is adopted when issuing the task, including:
The master control node adopts a security mechanism to authenticate the monitoring node to join the master control node;
the master control node communicates with the monitoring node through compression and encryption.
Optionally, the protocol between the master control node and the monitoring node is websocket, and long-time connection is used for bidirectional communication;
when the main control node and the monitoring node are connected for the first time in an approach, the monitoring node provides basic information of a key, a version and a server;
the basic information of the server comprises an MAC address, a framework, an operating system, memory capacity and disk capacity.
Optionally, the collecting the status information of the heterogeneous server, before transmitting the status information to the monitoring node, includes:
The method comprises the steps of setting a database for recording server data, and collecting monitoring information by a recording server, wherein the monitoring information comprises hardware data, task information, important log information and dynamic configuration;
and outputting the monitoring information into software for external data processing through a stream data pipeline.
Optionally, the collecting the status information of the heterogeneous server, before transmitting the status information to the monitoring node, includes:
receiving a scheduling instruction through a GRPC interface, analyzing a message body which is packaged, and obtaining the instruction;
writing the scheduling instruction into a database for state preservation;
and dispatching to a server where the corresponding monitoring node is located according to the content of the instruction message.
Optionally, the collecting status information of the heterogeneous server, transmitting the status information to a monitoring node, further includes:
setting a message queue for the main control node, and consuming the messages in the message queue in real time;
Taking a message queue as an entrance, wherein an instruction in the message queue is consistent with the instruction message;
and sending the message to a designated monitoring node.
Optionally, acquiring the hardware index of the control node at fixed time;
acquiring the hardware index of the main control node at fixed time;
The hardware index comprises starting time, running time length and GC times.
Optionally, deploying monitoring node software to each heterogeneous server to be monitored by using a super administrator account of the server, so as to obtain all management authorities of the server;
after the monitoring node is started, when connection is established with the control node, encryption is performed by using the MD5, wherein the monitoring node transmits a secret key built in when encrypting data.
Compressing and packaging the encrypted data by protobuff;
And establishing connection between the monitoring node and the control node by using the data packet which is compressed and packaged, and reporting the data packet to the control node after compression and encryption are completed.
Optionally, when transmitting data, the monitoring node transmits a Ping message to the control node, and waits for the control node to reply with the pong message within a certain time for the control node to manage the heartbeat of the monitoring node.
According to another aspect of an embodiment of the present invention, there is provided a heterogeneous server supervision electronic device, including:
One or more processors;
storage means for storing one or more programs,
The one or more programs, when executed by the one or more processors, cause the one or more processors to implement a method of heterogeneous server supervision provided by the present invention.
According to yet another aspect of an embodiment of the present invention, there is provided a computer readable medium having stored thereon a computer program which when executed by a processor implements a method of heterogeneous server supervision provided by the present invention.
Further effects of the above-described non-conventional alternatives are described below in connection with the embodiments.
Drawings
The drawings are included to provide a better understanding of the invention and are not to be construed as unduly limiting the invention. Wherein:
FIG. 1 is a schematic diagram of a system according to the present invention.
FIG. 2 is a schematic diagram of the main flow of a method of heterogeneous server supervision according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a control node system deployment according to an embodiment of the present invention.
Fig. 4 is a schematic diagram of data compression during implementation of a monitoring node according to a second embodiment of the present invention.
FIG. 5 is an exemplary system architecture diagram in which embodiments of the present invention may be applied;
Fig. 6 is a schematic diagram of a computer system suitable for use in implementing an embodiment of the invention.
Detailed Description
Exemplary embodiments of the present invention will now be described with reference to the accompanying drawings, in which various details of the embodiments of the present invention are included to facilitate understanding, and are to be considered merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
FIG. 1 is a schematic diagram of a system according to the present invention.
The system is divided into two components, namely a monitoring node and a main control node. The monitoring node is responsible for collecting monitoring data of the heterogeneous server and uploading the data to the main control node, and the main control node is responsible for processing the monitoring data and managing the heterogeneous server. The service of the master control node is distributed architecture. And flexibly expanding the main control nodes according to the number of the accessed monitoring nodes. Wherein, a proxy device can be arranged between the main control node and the monitoring node.
Fig. 2 is a schematic diagram of the main flow of a method for heterogeneous server supervision, as shown in fig. 2, and as shown in fig. 2, including:
Step S201, collecting state information of heterogeneous servers, and transmitting the state information to a monitoring node;
step S202, transmitting the state information acquired by the monitoring node to a main control node through an agent;
Step S203, when issuing a task, issuing the task to the monitoring node through the master control node;
And step S204, after the task issued by the monitoring node passes the detection, issuing the task to the heterogeneous server to execute the task.
In the present invention, monitoring for heterogeneous servers includes the following:
The monitoring node acquires the state information of the heterogeneous server through the monitoring agent program, and the monitoring node information acquisition function is to acquire basic information and server state information.
After the monitoring node is started, acquiring basic information to acquire hardware information of the service includes, but is not limited to, information such as main board information, BIOS information, CPU information, memory slot information and container information.
The server state information collection comprises, but is not limited to, the information of an arp cache of a system, information under an analysis system/proc/net/arp, a Linux kernel block device layer stack obtained through analysis, information analysis under an analysis system/sys/fd/bcache/and the like, and additionally, the information of the number of network cards which are already configured and in an active state, cpu use details, the total number of UDP queues rx_queue and tx_queue and the like need to be obtained.
Obtaining more than 4.2 Linux kernel through/proc/net/udp and/proc/net/upd 6, monitoring all consumed memory under user mode, system average load data, disk I/O service condition data, memory consumption data, network tracking and recording connection state, generating a new connection record item for each packet passing through network stack by Linux. The data acquisition is only carried out under the condition of/pro/sys/net/netfilter/existence, and the information of the internet access and the statistical information under the system file/proc/stat comprise the system starting time, the process information, the process interrupt information and the system virtual memory statistical information. Statistical data related to IP, TCP, UDP and ICMP protocols are obtained and collected through analysis under/proc/vmstat, aggregation process statistical information is obtained and collected through analysis under system files/proc/net/netstat, and system disk use conditions, such as disk use rate, are obtained and collected through analysis under/proc. The monitoring data is transmitted to the main control node through the network and stored in the database. To ensure data integrity and security, the monitoring node encrypts and compresses the data.
Managing heterogeneous servers:
and the master control node acquires the state of the monitoring heterogeneous server in real time by collecting the monitoring data, and executes management operation. The operator may send commands, such as start, stop, restart, configure, auto upgrade, to the master node via the interface.
Issuing a task:
In order to monitor the scalability of the functionality, the master node needs to perform task scheduling on the heterogeneous servers. The system command, the script command and the timing task can be transmitted to the monitoring node through the interface of the main control end, the monitoring node can be used for backing up data, cleaning logs and the like, and meanwhile, the monitoring node can also detect whether the transmitted command is a high-risk command or not, and otherwise, execution is refused.
Command execution, script execution functions. And the monitoring node receives the execution message sent by the main control node, executes the command or the script after detecting the high-risk command, and asynchronously reports the data output by the control console to the main control node.
And the timing task function is that a set of timing task manager is maintained in the monitoring node, the timing task manager is realized based on a Map container, can dynamically add and delete the timing tasks, and also provides the function of inquiring the timing task currently running. The timing expression of the timing task supports a standard linux timing expression.
Regarding security assurance:
In order to ensure the security of the system, the master control node needs to adopt a security mechanism, the monitoring node needs to authenticate after joining the master control node, and all message communication of the master control node needs to be compressed and encrypted. The protocol adopted by the main control node and the monitoring node is websocket, and a long-time connection and two-way communication mechanism is adopted. When Websocket is connected for the first time, the key, version and basic information of a server of a monitoring node need to be provided, wherein the basic information comprises an MAC address, a framework, an operating system, memory capacity, disk capacity and the like.
Adaptation with respect to heterogeneous servers
To accommodate different types of heterogeneous servers, the monitoring node needs to support a variety of operating system and CPU architectures, such as CentOS, ubuntu, windows, ARM, x, 86, etc. Different operating system monitoring node programs have been adapted to different operating systems during the packaging process. Different operating systems or architectures deploy monitoring nodes corresponding to different adapted versions. The control node program is mainly adapted to linux centos version.
Self-health adjustment
As the monitoring node software, the CPU, the memory and the disk occupied by the software are optimized to be minimum, and the task execution frequency of the monitoring node is regulated according to the system resources occupied by other programs of the system in the running process, so that the power consumption is reduced.
The following technical effects can be achieved by adopting the embodiment of the invention:
1. the system can monitor the state and the service condition of the heterogeneous server, help operation and maintenance personnel to better know the work load of the server, further carry out optimal configuration on the server, and improve the utilization rate and the efficiency of the server.
2. The system can automatically monitor and manage heterogeneous servers, and lighten the workload of operation and maintenance personnel, thereby improving the working efficiency and reducing the management cost.
3. The system can monitor the state of the server in real time, and alarm and notice when the server fails or is abnormal, so that operation and maintenance personnel can take measures in time, and the influence of the server failure on the system is avoided.
4. The system can monitor the data transmission and storage conditions of the server, find abnormal behaviors and perform early warning and processing, so that the safety and reliability of the data are improved.
5. The system can monitor and manage different types of servers, including different manufacturers, different operating systems, different processors and the like, and supports centralized management in the heterogeneous server environment.
Detailed description of the preferred embodiments,
FIG. 3 is a schematic diagram of a control node system deployment according to an embodiment of the present invention. As shown in fig. 3:
The control node system deployment comprises a data storage module, a scheduling module, a queue module and a message processing module.
And the data storage module is used for integrating mysql to record information such as hardware data, task information, important log information, dynamic configuration and the like acquired by the server and integrating InfluxDB to output monitoring information to software for external data processing through a stream data pipeline.
The scheduling module externally promotes the GRPC interface, for example, receives an instruction through the GRPC interface, analyzes a message body packaged by the protobuf, acquires the instruction, writes the instruction into mysql, stores the state, and schedules the server to which the corresponding monitoring node is located according to the content of the instruction message.
And (2) a queue module, in order to release the coupling degree with an external system, a control node integrates a message queue, consumes the message in the RabbitMQ in real time, takes the message queue as an entrance, and sends the message to a designated monitoring node through a scheduling module, wherein the message is consistent with the instruction message analyzed by the GRPC in the step (1).
And the message processing module is used for initializing websocket server service, initializing and maintaining a certain amount of working cooperation to process asynchronous messages.
When the self-health check module is started, a protocol is started to acquire the internal memory used by the control node per se, the CPU and disk consumption, and the self-health check module also comprises some indexes such as starting time, running time, GC times and the like, and meanwhile, the internal memory of the basic current server, the CPU and disk conditions are acquired, and all data indexes are output to InfluxDb.
When the message returned by the monitoring node is received, all the received messages are put into an internal memory queue maintained internally to be asynchronously processed, and the monitoring node and the progress of the monitoring node are not blocked.
The protocol of asynchronous processing reads the information received by the memory queue according to the writing sequence, firstly judges the type of the information, namely, the information of the monitoring class is decrypted, encrypted and compressed and then output to InfluxDb as monitoring data to be used as other platforms. And the task type message records the state into mysql according to the result of judging the execution of the task type message, and if the task message has the address of the http protocol of the corresponding callback, the callback address is called after the task is completed.
Second embodiment,
Fig. 4 is a schematic diagram of data compression during implementation of a monitoring node according to a second embodiment of the present invention. As shown in fig. 4, regarding the implementation of the monitoring node:
Firstly, a ROOT user of a server needs to be used for deploying monitoring node software to each heterogeneous server to be monitored, so as to acquire all management authorities of the server.
After the monitoring node is started, when connection is established with the control node for the first time, the MD5 of the monitoring node software itself needs to be provided, and a secret key and a version of the monitoring node software are built in when the monitoring node software is packaged, which are used for preventing the monitoring node software from being tampered maliciously.
And establishing connection with the control node through websocket, compressing and encrypting, and reporting to the control node. The message body is packaged by protobuff, and in order to reduce the byte size of the message body and improve the efficiency of serialization and deserialization, the byte streams of all the message bodies need to be subjected to a custom confusion encryption algorithm through built-in keys, and the zlib is used for further data compression after encryption is completed.
And starting a background cooperative distance, timing a Ping message to the control node, and waiting for the control node to reply a pong message within a certain time. If the pong message is not received within a specified time, indicating that the connection may have been interrupted, a re-establishment of the connection is taken.
After the software information is checked to be complete and authenticated, policy configuration parameters are obtained from the control node, such as monitoring items to be monitored, including whether to start monitoring memory usage, hard disk usage or network traffic, etc., default monitoring frequency, default timing tasks, etc.
Relevant hardware information of the server is obtained at one time through a basic information acquisition function, wherein the relevant hardware information comprises main board information, BIOS information, CPU information, memory slot information, container information, MAC address and the like. Reporting to the control node by websocket mode, wherein all information is also encrypted and compressed.
Starting a timing task with the default 30 second frequency of the background cooperative program, monitoring data such as a CPU, a memory, a disk, network indexes and the like, and reporting monitoring information to a control node through a websocket connection of which connection is established.
The timed task management module is started, wherein the execution time of the timed task module is a crontab time format based on the linux itself.
The timing module adds the default extension task to be executed and executes the default extension task within the time specified by the expression.
After the timing task is executed, the result is reported to the control node in a websocket mode.
Fig. 5 illustrates an exemplary system architecture 500 in which a heterogeneous server administration method or heterogeneous server administration apparatus of an embodiment of the present invention may be applied.
As shown in fig. 5, the system architecture 500 may include terminal devices 501, 502, 503, a network 504, and a server 505. The network 504 is used as a medium to provide communication links between the terminal devices 501, 502, 503 and the server 505. The network 504 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
A user may interact with the server 505 via the network 504 using the terminal devices 501, 502, 503 to receive or send messages or the like. Various communication client applications may be installed on the terminal devices 501, 502, 503, such as shopping class applications, web browser applications, search class applications, instant messaging tools, mailbox clients, social platform software, etc. (by way of example only).
The terminal devices 501, 502, 503 may be a variety of electronic devices having a display screen and supporting web browsing, including but not limited to smartphones, tablets, laptop and desktop computers, and the like.
The server 505 may be a server providing various services, such as a background management server (by way of example only) providing support for shopping-type websites browsed by users using the terminal devices 501, 502, 503. The background management server may analyze and process the received data such as the product information query request, and feedback the processing result (e.g., the target push information, the product information—only an example) to the terminal device.
It should be noted that, in the embodiment of the present invention, the heterogeneous server supervision method is generally executed by the server 505, and accordingly, the heterogeneous server supervision device is generally disposed in the server 505.
It should be understood that the number of terminal devices, networks and servers in fig. 5 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation.
Referring now to FIG. 6, there is illustrated a schematic diagram of a computer system 600 suitable for use in implementing an embodiment of the present invention. The terminal device shown in fig. 6 is only an example, and should not impose any limitation on the functions and the scope of use of the embodiment of the present invention.
As shown in fig. 6, the computer system 600 includes a Central Processing Unit (CPU) 601, which can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 602 or a program loaded from a storage section 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data required for the operation of the system 600 are also stored. The CPU 601, ROM 602, and RAM 603 are connected to each other through a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
Connected to the I/O interface 605 are an input section 606 including a keyboard, a mouse, and the like, an output section 607 including a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, a speaker, and the like, a storage section 608 including a hard disk, and the like, and a communication section 609 including a network interface card such as a LAN card, a modem, and the like. The communication section 609 performs communication processing via a network such as the internet. The drive 610 is also connected to the I/O interface 605 as needed. Removable media 611 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is installed as needed on drive 610 so that a computer program read therefrom is installed as needed into storage section 608.
In particular, according to embodiments of the present disclosure, the processes described above with reference to flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method shown in the flow chart. In such an embodiment, the computer program may be downloaded and installed from a network through the communication portion 609, and/or installed from the removable medium 611. The above-described functions defined in the system of the present invention are performed when the computer program is executed by a Central Processing Unit (CPU) 601.
The computer readable medium shown in the present invention may be a computer readable signal medium or a computer readable storage medium, or any combination of the two. The computer readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples of a computer-readable storage medium may include, but are not limited to, an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In the present invention, however, the computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, with the computer-readable program code embodied therein. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination of the foregoing. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams or flowchart illustration, and combinations of blocks in the block diagrams or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The modules involved in the embodiments of the present invention may be implemented in software or in hardware. The described modules may also be provided in a processor, for example, a processor may be described as comprising a sending module, an obtaining module, a determining module and a first processing module. The names of these modules do not in some cases limit the module itself, and for example, the transmitting module may also be described as "a module that transmits a picture acquisition request to a connected server".
As a further aspect, the invention also provides a computer readable medium which may be comprised in the device described in the above embodiments or may be present alone without being fitted into the device. The computer readable medium carries one or more programs which, when executed by a device, cause the device to include:
collecting state information of a heterogeneous server, and transmitting the state information to a monitoring node;
Transmitting the state information acquired by the monitoring node to a main control node through an agent;
And when the task is issued, the task is issued to the monitoring node through the master control node, and after the task issued by the monitoring node passes the detection, the task is issued to the heterogeneous server to execute the task.
The above embodiments do not limit the scope of the present invention. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives can occur depending upon design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should be included in the scope of the present invention.

Claims (11)

1. A method for heterogeneous server supervision, comprising:
collecting state information of a heterogeneous server, and transmitting the state information to a monitoring node;
Transmitting the state information acquired by the monitoring node to a main control node through an agent;
When issuing a task, issuing the task to the monitoring node through a main control node;
And after the task issued by the monitoring node passes the detection, issuing the task to the heterogeneous server to execute the task.
2. The method of claim 1, wherein the issuing a task employs a security mechanism, comprising:
The master control node adopts a security mechanism to authenticate the monitoring node to join the master control node;
the master control node communicates with the monitoring node through compression and encryption.
3. The method of claim 2, wherein the step of determining the position of the substrate comprises,
The protocol between the main control node and the monitoring node is websocket, and long-time connection is utilized for bidirectional communication;
when the main control node and the monitoring node are connected for the first time in an approach, the monitoring node provides basic information of a key, a version and a server;
the basic information of the server comprises an MAC address, a framework, an operating system, memory capacity and disk capacity.
4. The method of claim 1, wherein the collecting status information of the heterogeneous server, before transmitting the status information to the monitoring node, comprises:
The method comprises the steps of setting a database for recording server data, and collecting monitoring information by a recording server, wherein the monitoring information comprises hardware data, task information, important log information and dynamic configuration;
and outputting the monitoring information into software for external data processing through a stream data pipeline.
5. The method of claim 1, wherein the collecting status information of the heterogeneous server, before transmitting the status information to the monitoring node, comprises:
receiving a scheduling instruction through a GRPC interface, analyzing a message body which is packaged, and obtaining the instruction;
writing the scheduling instruction into a database for state preservation;
and dispatching to a server where the corresponding monitoring node is located according to the content of the instruction message.
6. The method of claim 5, wherein the collecting status information of the heterogeneous server, transmitting the status information to a monitoring node, further comprises:
setting a message queue for the main control node, and consuming the messages in the message queue in real time;
Taking a message queue as an entrance, wherein an instruction in the message queue is consistent with the instruction message;
and sending the message to a designated monitoring node.
7. The method of claim 6, wherein the step of providing the first layer comprises,
Acquiring the hardware index of the control node at fixed time;
acquiring the hardware index of the main control node at fixed time;
The hardware index comprises starting time, running time length and GC times.
8. The method of claim 1, wherein the step of determining the position of the substrate comprises,
Deploying monitoring node software to each heterogeneous server to be monitored by using a super administrator account of the server, and acquiring all management authorities of the server;
after the monitoring node is started, when connection is established with the control node, encryption is performed by using the MD5, wherein the monitoring node transmits a secret key built in when encrypting data.
Compressing and packaging the encrypted data by protobuff;
And establishing connection between the monitoring node and the control node by using the data packet which is compressed and packaged, and reporting the data packet to the control node after compression and encryption are completed.
9. The method of claim 8, wherein the step of determining the position of the first electrode is performed,
When data is transmitted, the monitoring node transmits a Ping message to the control node, and waits for the control node to reply the pong message within a certain time for the control node to manage the heartbeat of the monitoring node.
10. An electronic device for heterogeneous server supervision, comprising:
One or more processors;
storage means for storing one or more programs,
When executed by the one or more processors, causes the one or more processors to implement the method of any of claims 1-9.
11. A computer readable medium, on which a computer program is stored, characterized in that the program, when being executed by a processor, implements the method according to any of claims 1-9.
CN202311005191.6A 2023-08-10 2023-08-10 A method for monitoring heterogeneous servers Pending CN119484326A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311005191.6A CN119484326A (en) 2023-08-10 2023-08-10 A method for monitoring heterogeneous servers

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311005191.6A CN119484326A (en) 2023-08-10 2023-08-10 A method for monitoring heterogeneous servers

Publications (1)

Publication Number Publication Date
CN119484326A true CN119484326A (en) 2025-02-18

Family

ID=94582851

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311005191.6A Pending CN119484326A (en) 2023-08-10 2023-08-10 A method for monitoring heterogeneous servers

Country Status (1)

Country Link
CN (1) CN119484326A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN121000765A (en) * 2025-08-25 2025-11-21 北京国御网络安全技术有限公司 An improved distributed scheduling system and method based on gRPC communication mechanism

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN121000765A (en) * 2025-08-25 2025-11-21 北京国御网络安全技术有限公司 An improved distributed scheduling system and method based on gRPC communication mechanism

Similar Documents

Publication Publication Date Title
CN112583898A (en) Business process arranging method and device and readable medium
US12360776B2 (en) Multi-tenant Java agent instrumentation system
US10776245B2 (en) Analyzing physical machine impact on business transaction performance
US20150067146A1 (en) Custom correlation of a distributed business transaction
CN104636678B (en) The method and system of management and control is carried out under a kind of cloud computing environment to terminal device
CN113900834B (en) Data processing method, device, equipment and storage medium based on Internet of things technology
EP3616061B1 (en) Hyper dynamic java management extension
WO2024125106A1 (en) Data transmission method and apparatus, device, and storage medium
CN107809350A (en) The method and apparatus for obtaining HTTP server performance data
CN113760842A (en) Data processing method and device, electronic equipment and storage medium
US9866455B2 (en) Using status inquiry and status response messages to exchange management information
WO2017074471A1 (en) Tracking contention in a distributed business transaction
US20180324063A1 (en) Cloud-based system for device monitoring and control
US10067862B2 (en) Tracking asynchronous entry points for an application
CN110008092A (en) A kind of secure virtual machine monitoring method, device, equipment and readable storage medium storing program for executing
CN119484326A (en) A method for monitoring heterogeneous servers
CN110674172A (en) Data query method and device
US20240073146A1 (en) Transaction-level network policies for online applications
CN112416698A (en) Monitoring system expansion method and device, storage medium and electronic equipment
CN111176782B (en) Online experiment method and device
US10216926B2 (en) Isolation of untrusted code in operating system without isolation capability
US20230376632A1 (en) Protecting confidential information in online applications
CN117950591A (en) Gateway storage management method and device, electronic device, and storage medium
CN113079055B (en) AGV operation data dynamic acquisition method and device
CN115412921A (en) Traffic processing method and system, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication