[go: up one dir, main page]

CN114928529A - Information system and information system fault detection method - Google Patents

Information system and information system fault detection method Download PDF

Info

Publication number
CN114928529A
CN114928529A CN202210870184.1A CN202210870184A CN114928529A CN 114928529 A CN114928529 A CN 114928529A CN 202210870184 A CN202210870184 A CN 202210870184A CN 114928529 A CN114928529 A CN 114928529A
Authority
CN
China
Prior art keywords
module
information
information system
fault
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210870184.1A
Other languages
Chinese (zh)
Inventor
廖秋平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Shengwang Information Technology Co ltd
Original Assignee
Guangzhou Shengwang Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Shengwang Information Technology Co ltd filed Critical Guangzhou Shengwang Information Technology Co ltd
Priority to CN202210870184.1A priority Critical patent/CN114928529A/en
Publication of CN114928529A publication Critical patent/CN114928529A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3024Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3037Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a memory, e.g. virtual memory, cache
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/02Standardisation; Integration
    • H04L41/0213Standardised network management protocols, e.g. simple network management protocol [SNMP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/142Network analysis or design using statistical or mathematical methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computing Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • Algebra (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Probability & Statistics with Applications (AREA)
  • Pure & Applied Mathematics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses an information system and a fault detection method thereof, relating to the technical field of fault detection and solving the problem of fault detection of the information system, wherein the information system comprises information system network equipment with identifiable configuration; a troubleshooting module; for receiving a fault detection notification message, receiving a request for selected device state parameters associated with a fault event from an information system network device using a code division multiple access transmission mode; a data collection module; for receiving identifiable configuration information upon initialization of the information system network device; a first database; the data collection module is used for storing the configuration information of the information system network equipment sent by the data collection module; a second database; for storing failure problems and solutions for information system network devices of known history; and a fault detection module based on the association rule. The invention greatly improves the fault detection capability of the information system.

Description

Information system and information system fault detection method
Technical Field
The present invention relates to the field of fault detection, and more particularly, to an information system and a method for detecting a fault in an information system.
Background
Existing information system network management and maintenance techniques typically require manual intervention when an information system fails, and the network manager typically needs to perform an exchange of hardware components or attempt to load updated software to attempt to resolve the problem when isolating the network device in question. However, network administrators almost always need to obtain additional technical information to manage or troubleshoot network device failures. As networks and information technology become more sophisticated in service and support, the efficiency of information system troubleshooting using web-based information sources becomes less and less efficient, as network managers typically need to manually provide a brief description of device type, serial number, software version, and problems in order to provide sufficient search parameters for a web page to generate queries in a client-supported back-end database. Such queries do not take into account more complex interactions that may affect network device performance, including network topology, device configuration, and dynamic parameters that affect network device performance.
Existing information system network technologies cannot preempt network device failures, for example, if the system modules of a network device are insufficient, the reduction of system modules is usually not detected within a sufficient time to take preventive measures, but instead, it may be detected only when the information system network device fails, and therefore, the network user with the failed device bears unnecessary network service loss burden.
Disclosure of Invention
Aiming at the technical defects, the invention discloses an information system and an information system fault detection method, which can extract the running state parameters of information system network equipment, analyze and process the extracted running state parameters and realize the fault detection of the information system.
In order to realize the technical effects, the invention adopts the following technical scheme:
an information system, comprising:
an information system network device having an identifiable configuration; the information system network device comprises a monitoring agent module which is used for identifying configuration information when the information system network device is initialized;
the monitoring agent module is used for identifying the running state parameter information of the information system network equipment, and the running state parameter information of the information system network equipment is the CPU utilization rate and the memory utilization rate of the information system network equipment; the information system network equipment detects at least one state parameter based on a communication network and generates a fault detection notification message; the notification message comprises an identifier of each process executed by the network device when the notification message is generated and the memory module utilization rate of each corresponding identified process;
a troubleshooting module; the device comprises a fault detection module, a fault elimination module and a fault management module, wherein the fault detection module is used for receiving a fault detection notification message, receiving a selected device state parameter which is requested to be associated with a fault event from an information system network device by adopting a code division multiple access transmission mode, and eliminating fault data information;
a data collection module; for receiving identifiable configuration information upon initialization of the information system network device and collecting together selected device state parameters associated with the failure event; the data collection module is executed by an executable software module with XML analysis capability, and information interaction is carried out through an IP network;
a first database; the data collection module is used for storing the configuration information of the information system network equipment sent by the data collection module; the first database is a parallel database, and the parallel database can realize parallelization of database operation processing;
a second database; the fault problem and solution of the information system network equipment with known history are stored;
a fault detection module based on the association rule; means for associating information system network device configuration information retrieved from the first database with selected device state parameters of the failure event to enable failure detection of the network device and storing the configuration information in a second database;
each information system network device comprises a monitoring agent module, and a plurality of monitoring agent modules are connected through a local server; the local server receives the fault detection notification message transmitted by the fault elimination module and performs information interaction with the data collection modules through the IP network; the fault detection module also comprises a client network database, a report module, a notification module and an association rule management module, receives a fault detection report message from the data collection module, retrieves the correlation of the identified configuration information from the client database, and outputs the correlation of the identified configuration information through the report module; and sending a retrieval command to the association rule management module through the central communication module, carrying out association rule data mining on the first database and the fault detection notification message by the association rule management module, sharing the processed result with the second database, feeding back the result to the association rule management module and the central communication module, and outputting a fault detection result through the notification module.
As a further technical solution of the present invention, the fault detection module includes:
an association rule management module; for identifying selected information system network device operational status parameters; the association rule management module adopts a Levy whale-based optimization algorithm to carry out data mining, and detects the correlation between the operation state parameters of the identifiable and selected information system network equipment and the historical fault problem of the information system network equipment in the second database according to the association rule;
a notification module; a client contact for actively sending out a message identifying the network device; the notification module adopts simplex communication to send fault event notification information to a client contact;
a data update module; the system is used for updating the historical fault problem or solution information of the second database; the data updating module adopts a big data model open source tool Flume technical component to acquire the problem of information system faults, and the Flume technical component is used for acquiring changed data generated in the information system in time and finishing data updating of the second database;
a central communication module; the system is used for controlling information interaction of a plurality of data collection modules, and comprises the following steps:
(1) message routing; the system is used for controlling routing messages among the practice module, the equipment problematic answer module, the network event management module, the inventory management module and the association rule management module; the message routing receives the report message sent by the data collection module, and the message routing adopts duplex communication to complete the control command interaction among the practice module, the equipment trouble-shooting answer module, the network event management module, the inventory management module and the association rule management module;
(2) a practice module; the Levy whale-based optimization algorithm flow is used for outputting fault events of the information system network equipment to the client network database module and obtaining fault information processing based on the association rule management module;
(3) a device trouble-solving module; for responding to the fault event report message by obtaining operational status parameters of the information system network device associated with the fault event report message; the equipment problematic answering module sends a command for inquiring a second database to associate historical fault information to the association rule management module through RS232C port serial port communication, and sends an inquiry result to a client network database;
(4) a network event management module; the information system network equipment operation state parameters are used for managing fault events and non-fault events; the network event management module adopts a Hadoop platform to manage fault events, and a Map-Reduce distributed data processor in the Hadoop platform can quickly process running state parameters of information system network equipment of fault events and non-fault events;
(5) an inventory management module; the system comprises a data collection module, a report module and a report module, wherein the data collection module is used for receiving a report request message from information system network equipment through a corresponding data collection module and transmitting the report request message to the report module through a client network database; the inventory management module manages report messages by adopting a HBase column-oriented storage distributed management method, and receives and sends the report messages through RS232C port serial port communication;
the central communication module is respectively connected with the client network database, the association rule management module and the notification module.
As a further technical scheme of the invention, the association rule management module also associates the hardware configuration of the network equipment of the identifiable information system with the storage information and outputs the equipment parameter setting of the network equipment to the notification module.
As a further technical solution of the present invention, the first database further includes a storing step for providing, by the monitoring agent, identifiable network device hardware configuration information to a corresponding one of the data collection modules through the IP network at the time of initialization, to be stored in the first database.
As a further technical solution of the present invention, the second database includes a storing step, after the storing step of the first database, the hardware configuration information that the network device can recognize is detected from the first database and stored in the second database.
An information system fault detection method, comprising the steps of:
(S1) outputting, by the monitoring agent module, operation state parameter information describing the information system network device, and generating at least one fault event report message, the monitoring agent module outputting the fault event report message to the local server;
(S2) the fault elimination module connected with the local server requests to screen the running state parameter of the information system network device in the fault event report message, and the fault elimination module sends the screened running state parameter of the information system network device to the data collection module in an XML form through the IP network;
(S3) the data collection module receives a fault event report message from the IP network through the RS232C port, the fault event report message including the information system network device operation state parameters, the data collection module sends the information system network device operation state parameters to the fault detection module, and the fault detection module is loaded into the report module according to the client network database;
(S4) the fault detection module retrieves normal operation information of the information system network equipment from the network client database, and the association rule management module performs association analysis on the screened operation state parameters and the normal operation information according to an optimized association rule algorithm based on the whale of Leivy to obtain a fault detection processing result;
(S5) the association rule management module sends the fault detection processing result to the central communication module, the central communication module sends the result to the reporting module, and the reporting module outputs the fault detection result.
As a further technical solution of the present invention, in step (S1), the monitoring agent module further actively sends the fault event notification information to the client contact by using simplex communication, including the following steps:
(S11) the customer contacts database receiving the notification information sent by the notification module;
(S12) the customer contact parsing the troubleshooting module through the versed DAC-R26 decoder to obtain the annunciation information, associating the annunciation information with the customer contact database to the ID of the fault information system network device according to the association rule;
(S13) the customer contact database sending customer contact information and fault information for the identified fault information system network device and resolution information;
(S14) the notification module sends the resolution information to the central communication module using duplex communication, and then to the monitoring agent module of the fault information system network device via the central communication module.
As a further technical solution of the present invention, in step (S3), the data collection module receives a fault event report message from the IP network through the RS232C port, where the fault event report message includes information system network device operating status parameters, the data collection module sends the information system network device operating status parameters to the fault detection module, and the fault detection module is loaded into the reporting module through the client network database, including the following steps:
(S31) the information system network device transmitting a fault event report message including the operation state parameters through a Hello protocol at the time of startup;
(S32) receiving, at the local server, a report message from the information system network device, and then storing the relevant operating state parameters of the network device in a local database;
(S33) the local server selecting a data collection module communication connection through the IP network to send a report message to the data collection module through the RS232C port;
(S34) the data collection module sends the report to the message route through RS232C port, the message route sends the report to the inventory management module through RS232C port, the inventory management module manages the report message by HBase distributed management method of column-oriented storage, and updates the client network database;
(S35) the client network database transmits the report message to the report module through the RS232C port.
As a further technical solution of the present invention, in step (S4), the association rule management module performs association analysis on the screened operation state parameters and operation normal information according to an algorithm based on an optimized association rule of a sperm whale, including the following steps:
(S41) suppose to be oneDRandomly generating a dimension in spaceNA population of fault detection information particles, coefficient vectors when the first database is retrieved using LWOADeciding whether to search globally or locally, vectorAValue and parameter ofaThe contact is close to the contact of the two parts,Ais the interval-a,a]When a random value ofaAt the time of enlargementAAlso increased, the parameteraThe update formula is:
Figure 481364DEST_PATH_IMAGE001
(1)
in the formula (1), the first and second groups,tindicating number of iterations, subscriptmaxRepresents a maximum value; LWO, the algorithm is easy to fall into local optimum during iterative training, and a Levy optimization strategy is introduced to obtain a LWO position update formula as follows:
Figure 288783DEST_PATH_IMAGE002
(2)
in the formula (2), the first and second groups,Xindicating the optimal solution for the location of the fault-detection information particles,
Figure 66859DEST_PATH_IMAGE003
a constant representing the shape of a logarithmic spiral,Levy() A Levy optimization strategy function is represented,pit is indicated that the constant parameter is,lrepresents the interval [0,1 ]]The parameters of (a);
(S42) limiting the positions of the fault detection information particles to {0,1} by using LWO algorithm, and encoding the fault detection information particles into a binary sequence by a sigmoid function;
Figure 756466DEST_PATH_IMAGE004
(3)
in the formula (3), T 1 To T 5 Indicating five types of failure detection information particle populations, in commonI 1 ToI 6 Each population of the six items contains different numbers of items, if the item exists, the item is marked as 1, otherwise, the item is marked as 0;I 1 is 11, representsI 1 This term exists inT 1 Associating within the rule and being a antecedent to the rule;I 2 is coded as 10, representsI 2 This term existsT 1 Associating within the rule and being a back-part of the rule;I 3 the code for this term is 01, meaningI 3 This term does not existT 1 Within the association rules, and therefore without having to consider the issue of context, the entire rule is interpreted asI 1I 2 The association rule of (1);
(S43) applying the binary sequence of fault detection information to the multi-target association rule data mining, including the support degreeSConfidence of the imageCMinimum support minSMinimum confidence minCDegree of liftingLAnd a certainty factorCFWherein:
degree of supportSAB) Is represented in the failure detection information setKMiddle item setABPercentages occurring together:
Figure 671333DEST_PATH_IMAGE005
(3)
in the formula (3), P: (AB) RepresentAAndBsimultaneous detection of information sets in a faultKThe number of occurrences; confidence levelCAB) Is represented in item setAAlready existing, item setBProbability of occurrence:
Figure 615149DEST_PATH_IMAGE006
(4)
in the formula (4), P: (A) RepresentADetecting information sets in a faultKThe number of occurrences; minimum support minSAnd minimum confidence minCSatisfy the degree of supportSAnd confidence levelCIn the set of fault detection information to be minedKIn, ifABDegree of support ofSAnd confidence levelCAre all greater than minSAnd minCThen, thenABIs a strong association rule; degree of liftingLAB) A metric representing an evaluation correlation coefficient:
Figure 81902DEST_PATH_IMAGE007
(5)
in the formula (5), whenL>1, indicates the association ruleABIs in a positive correlation with each other,Acan promoteB(iii) occurrence of (a); when in useL<1, then the association rule is negative relative,Awill inhibitBOccurrence of (1);L=1, it means that the predecessor and successor of the association rule are independent from each other; deterministic factorCFRepresents the probability of occurrence of a back-piece in case of occurrence of a front-piece in the evaluation association rule:
Figure 743959DEST_PATH_IMAGE008
(6)
in the formula (6), the first and second groups of the compound,CF>0, then the association ruleABPositive correlation;CF<0, then the association ruleABA negative correlation;CAB)=S(B) Time-pieceCF=0, then the association ruleABThe front piece and the rear piece are mutually independent; correcting some misleading rules in strong association rules through multi-target evaluation to avoid productionA numerically correct but in fact irrelevant association rule of the failure detection information;
(S44) fault detection information mining is carried out through a multi-target association rule, and a fitness function obtained through weighting is as follows:
Figure 462516DEST_PATH_IMAGE009
(7)
in the formula (7), the first and second groups,frepresenting a fitness function byfAnd judging the relevance of the fault detection information so as to output the information system network equipment fault information with the relevance.
The invention has the beneficial and positive effects that:
unlike conventional techniques, the present invention can provide an arrangement of predictive, reactive, and proactive information systems that enables network devices to perform fault detection with minimal economic loss; the present invention enables automatic detection of arrangements of information system network device operations in which problems associated with network devices can be predicted and resolved before a failure is encountered in the network device; the invention enables the operating state parameters of the network equipment to be automatically delivered to the client contact person, so that the monitoring agent module can actively implement corrective measures before the network equipment fails.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without inventive exercise, wherein:
FIG. 1 is a diagram of an information system framework of the present invention;
FIG. 2 is a flow chart of a method for detecting faults in an information system according to the present invention;
FIG. 3 is a diagram of a rule-based fault detection module embodying the present invention;
FIG. 4 is a flow chart of the notification module actively sending a fault event notification message to a client contact in accordance with the present invention;
fig. 5 is a flow chart of a reception report update client network database in a network device according to the present invention.
Detailed Description
The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, it being understood that the embodiments described herein are merely illustrative and explanatory of the invention, and are not restrictive thereof;
as shown in fig. 1, an information system, comprising:
an information system network device having an identifiable configuration; the information system network device comprises a monitoring agent module which is used for identifying configuration information when the information system network device is initialized;
each information system network device includes a monitoring agent module, for example implemented as a software management module residing within an internal operating system of the information system network device, also referred to as a fault manager, wherein upon initialization of the information system network device, the monitoring agent module provides recognizable configuration information to a local server to communicate the configuration information and associated topology information to a database in a centralized module. The local server, also referred to as a service device, connects the data collection module and the troubleshooting module, which are collections of text files having extensible markup language, XML, based operational commands configured to specify the operation of the application and XML based logical commands for specifying the logic of the application according to the corresponding application. The monitoring agent module also outputs a notification message in response to any hardware change or any software configuration change, enabling the database to retain updated information about the configuration of the corresponding information system network. As described in further detail below, the monitoring agent module is also configured to monitor device operation with respect to monitored parameters, such as Central Processing Unit (CPU) utilization and memory usage that may indicate system module exhaustion, notifying the local server in the event that a corresponding information system network device is about to encounter an information system failure.
A troubleshooting module; the device comprises a fault detection module, a fault elimination module and a fault management module, wherein the fault detection module is used for receiving a fault detection notification message, receiving a selected device state parameter which is requested to be associated with a fault event from an information system network device by adopting a code division multiple access transmission mode, and eliminating fault data information;
a data collection module; for receiving identifiable configuration information upon initialization of the information system network device and collecting together selected device state parameters associated with the failure event; the information system includes four data collection modules for collecting information system network device operational status parameters selected from the troubleshooting modules, each data collection module capable of receiving selected information system network device operational status parameters from within a respective network region, whereby the distributed data collection modules can collect network device information from global clients for centralized processing and analysis. The data collection modules are executed by executable software modules with XML parsing capability, information interaction is carried out through the IP network, and in order to optimize safety, at least one firewall server can be deployed between each data collection module and the IP network, and at least one firewall server can be deployed between each local server and the IP network.
A first database; the data collection module is used for storing the configuration information of the information system network equipment sent by the data collection module; the first database is a parallel database, and the parallel database executes database tasks in parallel through a plurality of processing nodes, so that the performance and the availability of the whole database system are improved, and the parallelization of the operation processing of the database is realized;
a second database; for storing failure problems and solutions for information system network devices of known history;
a fault detection module based on the association rule; means for associating information system network device configuration information retrieved from the first database with selected device state parameters of the failure event to enable failure detection of the network device and storing the configuration information in a second database;
each information system network device comprises a monitoring agent module, and a plurality of monitoring agent modules are connected through a local server; the local server receives the fault detection notification message transmitted by the fault elimination module and performs information interaction with the data collection modules through the IP network; the fault detection module also comprises a client network database, a report module, a notification module and an association rule management module, receives a fault detection report message from the data collection module, retrieves the correlation of the identified configuration information from the client database, and outputs the correlation of the identified configuration information through the report module; and sending a retrieval command to the association rule management module through the central communication module, carrying out association rule data mining on the first database and the fault detection notification message by the association rule management module, sharing the processed result with the second database, feeding back the result to the association rule management module and the central communication module, and outputting a fault detection result through the notification module.
In a specific embodiment, as shown in fig. 2, the fault detection module includes:
an association rule management module; for identifying selected information system network device operational status parameters; the association rule management module adopts a Levy whale-based optimization algorithm to carry out data mining, and detects the correlation between the operation state parameters of the identifiable and selected information system network equipment and the historical fault problem of the information system network equipment in the second database according to the association rule;
a notification module; a client contact for actively sending out a message identifying the network device; the notification module adopts simplex communication to send fault event notification information to a client contact; the customer contact includes a customer contact database configured to store information including a customer ID, a network device ID, detailed device hardware information, a device software version, a list of enabled device software features, an IP address, a username and password, a topology, and a geographic location, a customer contact manager interface, and a customer information center interface. The information stored in the customer network database is organized using a structured definition language, such as an XML markup based network definition language, which enables text based markup to be used to describe the topology, thereby enabling a model to be reconstructed from the customer network database that accurately describes the customer network topology.
The client network database is also configured to store data associated with the detected network events, including prescribed states or state transactions that may be associated with a given information system network device. In particular, the network event management module is configured to store network event related information associated with respective designated information system network devices within the client network database based on network event messages received from the message routes. Thus, upon detection of a prescribed network event affecting an information system network device, the data collection module can retrieve the device state associated with the network event congestion condition and forward the network event related information and related device state attributes to the data collection module for storage by the message routing network event management module.
A data update module; the system is used for updating the historical fault problem or solution information of the second database; the data updating module adopts a big data model open source tool, a Flume technical component for acquiring the information system fault problem, and the Flume technical component is used for acquiring the changed data generated in the information system in time and finishing the data updating of the second database;
a central communication module; the system is used for controlling information interaction of a plurality of data collection modules, and comprises the following steps:
(1) message routing; the system is used for controlling routing messages among the practice module, the equipment problematic answer module, the network event management module, the inventory management module and the association rule management module; message routing is implemented using a message broker of TIBCO software corporation, which receives the report message from the data collection module, typically in the form of an XML tag within the message, which specifies the nature of the message. The message routing adopts duplex communication to complete the control command interaction among the practice module, the equipment trouble-shooting module, the network event management module, the inventory management module and the association rule management module; the message router also provides bidirectional transactions, i.e., messages generated in the failure detection module and sent to the data collection module are forwarded by the message router back to the corresponding data collection module for delivery to the data collection module; depending on the implementation, the message routing may also be a message generated by the notification module. Each module within the central communication module is configured to perform prescribed application operations associated with network management; thus, message routing, upon receipt of a message from a data collection module, forwards the message to the appropriate module for execution.
(2) A practice module; the system comprises a client network database module, a Levy whale optimization algorithm flow, a correlation rule management module and a data processing module, wherein the client network database module is used for receiving a fault event of information system network equipment;
(3) a device trouble-solving module; for responding to the fault event report message by obtaining operational status parameters of the information system network device associated with the fault event report message; the equipment problematic answering module sends a command for inquiring a second database to associate historical fault information to the association rule management module through RS232C port serial port communication, and sends an inquiry result to a client network database;
(4) a network event management module; the information system network equipment operation state parameters are used for managing fault events and non-fault events; the network event management module adopts a Hadoop platform to manage the fault event, and a Map-Reduce distributed data processor in the Hadoop platform can quickly process the running state parameters of the information system network equipment of the fault event and the non-fault event;
(5) an inventory management module; the system comprises a data collection module, a report module and a report module, wherein the data collection module is used for receiving a report request message from information system network equipment through a corresponding data collection module and transmitting the report request message to the report module through a client network database; the inventory management module manages report messages by adopting a HBase column-oriented storage distributed management method, and receives and sends the report messages through RS232C port serial port communication;
in another embodiment, the association rule management module further comprises associating the identifiable information system network device hardware configuration with the stored information, outputting the device parameter settings of the network device to the advertisement module.
In another embodiment, the first database further comprises a storing step for providing recognizable network device hardware configuration information to a corresponding one of the data collection modules by the monitoring agent through the IP network at initialization for storage in the first database.
In another embodiment, the second database includes a storing step of detecting that the network device identifiable hardware configuration information is stored in the second database from the first database after the first database storing step.
In a specific embodiment, as shown in fig. 3, a method for detecting a failure in an information system includes the following steps:
(S1) outputting, by the monitoring agent module, operation state parameter information describing the information system network device, and generating at least one fault event report message, the monitoring agent module outputting the fault event report message to the local server;
(S2) the fault elimination module connected with the local server requests to screen the running state parameter of the information system network device in the fault event report message, and the fault elimination module sends the screened running state parameter of the information system network device to the data collection module in an XML form through the IP network;
(S3) the data collection module receives a fault event report message from the IP network through the RS232C port, the fault event report message including the information system network device operation state parameters, the data collection module sends the information system network device operation state parameters to the fault detection module, and the fault detection module is loaded into the report module according to the client network database;
(S4) the fault detection module retrieves normal operation information of the information system network equipment from the network client database, and the association rule management module performs association analysis on the screened operation state parameters and the normal operation information according to an optimized association rule algorithm based on the whale of Leivy to obtain a fault detection processing result;
(S5) the association rule management module sends the fault detection processing result to the central communication module, the central communication module sends the result to the reporting module, and the reporting module outputs the fault detection result.
In a specific embodiment, as shown in fig. 4, in step (S1), the monitoring agent module further actively sends a fault event notification message to the client contact by using simplex communication, including the following steps:
(S11) the customer contacts database receiving the notification information sent by the notification module;
(S12) the customer contact parsing the troubleshooting module through the versed DAC-R26 decoder to obtain the annunciation information, associating the annunciation information with the customer contact database to the ID of the fault information system network device according to the association rule; in this step, the customer contact periodically parses the troubleshooting database through the versed DAC-R26 decoder to locate updated information related to the last performed correlation and correlates the updated information with the customer network database to identify the failed network device based on the configuration information stored in the customer network database.
(S13) the customer contact database transmitting the customer contact information and the identified fault information of the fault information system network device and the resolution information; in this step, the contact information is retrieved from the customer contact database and the customer contact information and corresponding information identifying the malfunctioning network device and updated resolution information are sent. The notification module sends the fault detection information and the solution information to the network equipment of the information system to be subjected to the fault, and the monitoring agent module can examine the fault detection information and the solution information and acquire any active information required for realizing the fault in the network equipment of the information system.
(S14) the notification module sends the resolution information to the central communication module using duplex communication, and then to the monitoring agent module of the fault information system network device via the central communication module.
In a specific embodiment, as shown in fig. 5, in the step (S3), the data collection module receives a failure event report message from the IP network through the RS232C port, where the failure event report message includes the operation status parameters of the information system network devices, the data collection module sends the operation status parameters of the information system network devices to the failure detection module, and the failure detection module is loaded into the reporting module through the client network database, which includes the following steps:
(S31) the information system network device, upon startup, transmitting a fault event report message including the operation state parameters through a Hello protocol;
(S32) receiving, at the local server, a report message from the information system network device, and then storing relevant operating state parameters of the network device in a local database;
(S33) the local server selecting a data collection module communication connection through the IP network to send a report message to the data collection module through the RS232C port;
(S34) the data collection module sends a report to a message route through the RS232C port, the message route sends the report to the inventory management module through the RS232C port, the inventory management module manages the report message by using the HBase column-oriented storage distributed management method, and updates the client network database;
(S35) the customer network database transmits the report message to the report module through the RS232C port.
In a specific embodiment, in the step (S4), the association rule management module performs association analysis on the screened operation state parameters and the operation normal information according to an association rule optimization algorithm based on the sperm whale, and the method includes the following steps:
(S41) suppose that at oneDRandomly generating a dimension in spaceNWhen the population of fault detection information particles is searched for the first database by adopting LWO, the coefficient vectorADeciding whether to search globally or locally, vectorAValue and parameter ofaThe contact is close to the contact between the two parts,Ais the interval-a,a]When a random value ofaAt the time of enlargementAAnd the parameter is increased accordinglyaThe update formula is:
Figure 41265DEST_PATH_IMAGE010
(1)
in the formula (1), the first and second groups,tindicating number of iterations, subscriptmaxRepresents the maximum value; LWO, the algorithm is easy to fall into local optimum during iterative training, and the position update formula obtained by introducing the Levy optimization strategy LWO is as follows:
Figure 554286DEST_PATH_IMAGE011
(2)
in the formula (2), the first and second groups of the compound,Xindicating the optimal solution for the location of the fault-detection information particles,
Figure 969218DEST_PATH_IMAGE003
a constant representing the shape of the logarithmic spiral,Levy() A Levy optimization strategy function is represented,pthe parameters of the normal state are represented,lrepresents the interval [0,1]The parameters of (1);
(S42) limiting the positions of the fault detection information particles to {0,1} by using LWO algorithm, and encoding the fault detection information particles into a binary sequence by a sigmoid function;
Figure 491466DEST_PATH_IMAGE012
(3)
in the formula (3), T 1 To T 5 Represents five types of fault detection information particle populations, and has totalI 1 ToI 6 Each population of the six items contains different numbers of items, if the item exists, the item is marked as 1, otherwise, the item is marked as 0;I 1 is 11, representsI 1 This term exists inT 1 Associating within the rule and being a antecedent to the rule;I 2 is 10, representsI 2 This term existsT 1 Associating within the rule and being a back-part of the rule;I 3 the code for this term is 01, meaningI 3 This term does not existT 1 Within the association rules, and therefore without having to consider the issue of context, the entire rule is interpreted asI 1I 2 The association rules of (2);
(S43) applying the binary sequence of fault detection information to the multi-target association rule data mining, including the support degreeSConfidence of the imageCMinimum support minSMinimum confidence minCDegree of liftingLAnd a certainty factorCFWherein:
degree of supportSAB) Is represented in the failure detection information setKMiddle item setABPercentage of occurrences taken together:
Figure 924722DEST_PATH_IMAGE013
(3)
in the formula (3), P: (AB) To representAAndBsimultaneous detection of information sets in a faultKThe number of occurrences; confidence levelCAB) Representing in a collection of itemsAAlready existing, item setBProbability of occurrence:
Figure 874223DEST_PATH_IMAGE014
(4)
in the formula (4), P: (A) To representADetecting information sets in a faultKThe number of occurrences; minimum support minSAnd minimum confidence minCSatisfy the degree of supportSAnd confidence levelCLower limit of (1) in the set of fault detection information to be minedKIn, ifABDegree of support ofSAnd confidence levelCAre all greater than minSAnd minCThen, thenABIs a strong association rule; degree of liftingLAB) Metric representing the evaluation correlation coefficient:
Figure 507942DEST_PATH_IMAGE015
(5)
in the formula (5), whenL>1, the association ruleABIs in a positive correlation with each other,Acan promoteBOccurrence of (1); when in useL<1, then the association rule isThe negative number of the correlation is negative,Awill inhibitB(iii) occurrence of (a);Lif =1, it indicates that the predecessor and successor of the association rule are independent from each other; certainty factorCFRepresents the probability of occurrence of a back-piece in case of occurrence of a front-piece in the evaluation association rule:
Figure 833881DEST_PATH_IMAGE016
(6)
in the formula (6), the first and second groups,CF>0, then the association ruleABPositive correlation;CF<0, then the association ruleABNegative correlation;CAB)=S(B) Time-pieceCF=0, then the association ruleABThe front piece and the rear piece are mutually independent; some misleading rules in the strong association rules are corrected through multi-objective evaluation, and the generation of the association rules which are numerically correct but actually irrelevant in fault detection information is avoided;
(S44) fault detection information mining is carried out through a multi-target association rule, and a fitness function obtained through weighting is as follows:
Figure 121643DEST_PATH_IMAGE017
(7)
in the formula (7), the first and second groups,frepresenting a fitness function byfAnd judging the relevance of the fault detection information, thereby outputting the information system network equipment fault information with the relevance.
Although specific embodiments of the present invention have been described above, it will be understood by those skilled in the art that these specific embodiments are merely illustrative and that various omissions, substitutions and changes in the form and details of the methods and systems described above may be made by those skilled in the art without departing from the spirit and scope of the invention; for example, it is within the scope of the present invention to combine the steps of the methods described above to perform substantially the same function in substantially the same way to achieve substantially the same result; accordingly, the scope of the invention is to be limited only by the following claims.

Claims (9)

1. An information system, characterized by: the information system includes:
an information system network device having an identifiable configuration; the information system network equipment comprises a monitoring agent module which is used for identifying configuration information when the information system network equipment is initialized;
the monitoring agent module is used for identifying the running state parameter information of the information system network equipment, and the running state parameter information of the information system network equipment is the CPU utilization rate and the memory utilization rate of the information system network equipment; the information system network equipment detects at least one state parameter based on a communication network and generates a fault detection notification message; the notification message comprises an identifier of each process executed by the network device when the notification message is generated and the memory module utilization rate of each corresponding identified process;
a troubleshooting module; the device comprises a fault elimination module, a Telnet module, an SNMP module and an SSH module, wherein the fault elimination module comprises an XML operation command designated for a fault elimination application program, and a logic command Telnet module, the SNMP module and the SSH module, and provides data collection capability so as to respond to the notification message from the network equipment according to transmission service information;
a data collection module; for receiving identifiable configuration information upon initialization of the information system network device and collecting together selected device state parameters associated with the failure event; the data collection module is executed by an executable software module with XML analysis capability, and information interaction is carried out through an IP network;
a first database; the data collection module is used for storing the configuration information of the information system network equipment sent by the data collection module; the first database is a parallel database, and the parallel database can realize the parallelization of the operation processing of the database;
a second database; for storing failure problems and solutions for information system network devices of known history;
a fault detection module based on the association rule; means for associating information system network device configuration information retrieved from the first database with selected device state parameters of the failure event to enable failure detection of the network device and storing the configuration information in a second database;
each information system network device comprises a monitoring agent module, and a plurality of monitoring agent modules are connected through a local server; the local server receives the fault detection notification message transmitted by the fault elimination module and performs information interaction with the data collection modules through the IP network; the fault detection module also comprises a client network database, a report module, a notification module and an association rule management module, receives a fault detection report message from the data collection module, retrieves the correlation of the identified configuration information from the client database, and outputs the correlation of the identified configuration information through the report module; and sending a retrieval command to the association rule management module through the central communication module, carrying out association rule data mining on the first database and the fault detection notification message by the association rule management module, sharing the processed result with the second database, feeding back the result to the association rule management module and the central communication module, and outputting a fault detection result through the notification module.
2. An information system according to claim 1, characterized in that: the fault detection module includes:
an association rule management module; for identifying the selected information system network device operational status parameter; the association rule management module adopts a Levy whale-based optimization algorithm to carry out data mining, and detects the correlation between the operation state parameters of the selected identifiable information system network equipment and the historical fault problem of the information system network equipment in the second database according to the association rule;
a notification module; a client contact for actively sending out a message identifying the network device; the notification module adopts simplex communication to send fault event notification information to a client contact;
a data update module; the system is used for updating the historical fault problem or solution information of the second database; the data updating module adopts a big data model open source tool Flume technical component to acquire the problem of information system faults, and the Flume technical component is used for acquiring changed data generated in the information system in time and finishing data updating of the second database;
a central communication module; for controlling information interaction among a plurality of data collection modules, the central communication module comprising:
(1) message routing; the system is used for controlling routing messages among the practice module, the equipment problematic answer module, the network event management module, the inventory management module and the association rule management module; the message routing receives the report message sent by the data collection module, and the message routing adopts duplex communication to complete the control command interaction among the practice module, the equipment trouble-shooting answer module, the network event management module, the inventory management module and the association rule management module;
(2) a practice module; the Levy whale-based optimization algorithm flow is used for outputting fault events of the information system network equipment to the client network database module and obtaining fault information processing based on the association rule management module;
(3) a device trouble-solving module; for responding to the fault event report message by obtaining operational status parameters of the information system network device associated with the fault event report message; the equipment problematic answering module sends a command for inquiring a second database to associate historical fault information to the association rule management module through RS232C port serial port communication, and sends an inquiry result to a client network database;
(4) a network event management module; the information system network equipment operation state parameters are used for managing fault events and non-fault events; the network event management module adopts a Hadoop platform to manage the fault event, and a Map-Reduce distributed data processor in the Hadoop platform can quickly process the running state parameters of the information system network equipment of the fault event and the non-fault event;
(5) an inventory management module; the system comprises a data collection module, a report module and a report module, wherein the data collection module is used for receiving a report request message from the information system network equipment through a corresponding data collection module and transmitting the report request message to the report module through a client network database; the inventory management module manages report messages by adopting a HBase column-oriented storage distributed management method, and receives and sends the report messages through RS232C port serial port communication;
the central communication module is respectively connected with the client network database, the association rule management module and the notification module.
3. An information system according to claim 2, characterized in that: the association rule management module also associates the identifiable information system network device hardware configuration with the stored information and outputs the device parameter settings of the network device to the notification module.
4. An information system according to claim 2, characterized in that: the first database further includes a storing step for providing identifiable network device hardware configuration information to a corresponding one of the data collection modules by the monitoring agent over the IP network for storage in the first database at initialization.
5. An information system according to claim 4, characterized in that: the second database includes a storing step after the storing step of the first database, the hardware configuration information identifiable by the network device is detected from the first database and stored in the second database.
6. An information system fault detection method is characterized in that: the method comprises the following steps:
(S1) outputting, by the monitoring agent module, operation state parameter information describing the information system network device, and generating at least one fault event report message, the monitoring agent module outputting the fault event report message to the local server;
(S2) the troubleshooting module connected with the local server requests to screen the running state parameters of the information system network equipment in the failure event report message, and the troubleshooting module sends the screened running state parameters of the information system network equipment to the data collection module in an XML form through the IP network;
(S3) the data collection module receives a fault event report message from the IP network through the RS232C port, the fault event report message including the information system network device operation state parameters, the data collection module sends the information system network device operation state parameters to the fault detection module, and the fault detection module is loaded into the report module according to the client network database;
(S4) the fault detection module retrieves normal operation information of the information system network equipment from the network client database, and the association rule management module performs association analysis on the screened operation state parameters and the normal operation information according to an optimized association rule algorithm based on the whale of Leivy to obtain a fault detection processing result;
(S5) the association rule management module sends the fault detection processing result to the central communication module, the central communication module sends the result to the reporting module, and the reporting module outputs the fault detection result.
7. The information system fault detection method of claim 6, wherein: in step (S1), the monitoring agent module further actively sends fault event notification information to the client contact using simplex communication, including the steps of:
(S11) the client contacts database receiving the notification information sent by the notification module;
(S12) the customer contact parsing the troubleshooting module through the versed DAC-R26 decoder to obtain the annunciation information, associating the annunciation information with the customer contact database to the ID of the fault information system network device according to the association rule;
(S13) the customer contact database sending customer contact information and fault information for the identified fault information system network device and resolution information;
(S14) the notification module transmits the resolution information to the central communication module using duplex communication, and then to the monitoring agent module of the fault information system network device via the central communication module.
8. The information system fault detection method of claim 6, wherein: in step (S3), the data collection module receives a fault event report message from the IP network through the RS232C port, the fault event report message including information system network device operation status parameters, the data collection module sends the information system network device operation status parameters to the fault detection module, and the fault detection module is loaded into the reporting module through the client network database, including the following steps:
(S31) the information system network device transmitting a fault event report message including the operation state parameters through a Hello protocol at the time of startup;
(S32) receiving, at the local server, a report message from the information system network device, and then storing the relevant operating state parameters of the network device in a local database;
(S33) the local server selecting a data collection module communication connection through the IP network to send a report message to the data collection module through the RS232C port;
(S34) the data collection module sends a report to a message route through the RS232C port, the message route sends the report to the inventory management module through the RS232C port, the inventory management module manages the report message by using the HBase column-oriented storage distributed management method, and updates the client network database;
(S35) the client network database transmits the report message to the report module through the RS232C port.
9. The information system fault detection method of claim 6, wherein: in step (S4), the association rule management module performs association analysis on the screened operation state parameters and the operation normal information according to an optimized association rule algorithm based on whale, including the following steps:
(S41) suppose that at oneDRandomly generating a dimension in spaceNA population of fault detection information particles, using LWO pairsCoefficient vector when the first database is searchedADeciding whether to search globally or locally, vectorAValue and parameter ofaThe contact is close to the contact of the two parts,Ais the interval-a,a]Random value of (2) whenaAt the time of enlargementAAnd the parameter is increased accordinglyaThe update formula is:
Figure 659247DEST_PATH_IMAGE001
(1)
in the formula (1), the first and second groups,tindicating number of iterations, subscriptmaxRepresents a maximum value; LWO, the algorithm is easy to fall into local optimum during iterative training, and a Levy optimization strategy is introduced to obtain a LWO position update formula as follows:
Figure 183769DEST_PATH_IMAGE002
(2)
in the formula (2), the first and second groups,Xindicating the optimal solution for the location of the fault-detection information particles,
Figure 469257DEST_PATH_IMAGE004
a constant representing the shape of a logarithmic spiral,Levy() A Levy optimization strategy function is represented,pit is indicated that the constant parameter is,lrepresents the interval [0,1]The parameters of (1);
(S42) limiting the positions of the fault detection information particles to {0,1} by using LWO algorithm, and encoding the fault detection information particles into a binary sequence by a sigmoid function;
Figure 737427DEST_PATH_IMAGE005
(3)
in the formula (3), T 1 To T 5 Represents five types of fault detection information particle populations, and has totalI 1 ToI 6 Each population of the six items contains different numbers of items, if the item exists, the item is marked as 1, otherwise, the item is marked as 0;I 1 ofCode is 11, representsI 1 This term exists inT 1 Associating within the rule and being a antecedent to the rule;I 2 is 10, representsI 2 This term existsT 1 Associating within the rule and being a back-part of the rule;I 3 the code for this term is 01, meaningI 3 This term does not existT 1 Within the association rule, therefore, the problem of the front and back pieces does not need to be considered, and the whole rule is interpreted asI 1I 2 The association rule of (1);
(S43) applying the binary sequence of fault detection information to the multi-target association rule data mining, including the support degreeSConfidence of the imageCMinimum support minSMinimum confidence minCDegree of liftingLAnd a certainty factorCFWherein:
degree of supportSAB) Is represented in the failure detection information setKMiddle item setABPercentage of occurrences taken together:
Figure 542704DEST_PATH_IMAGE006
(3)
in the formula (3), P: (AB) RepresentAAndBsimultaneous detection of information sets in a faultKThe number of occurrences; confidence levelCAB) Is represented in item setAAlready existing, item setBProbability of occurrence:
Figure 503706DEST_PATH_IMAGE007
(4)
in the formula (4), P: (A) RepresentADetecting information sets in a faultKThe number of occurrences; minimum support minSAnd minimum confidence minCSatisfy the degree of supportSAnd confidence levelCLower limit of (1) in the set of fault detection information to be minedKIn, ifABDegree of support ofSAnd confidence levelCAre all greater than minSAnd minCThen, thenABIs a strong association rule; degree of liftingLAB) A metric representing an evaluation correlation coefficient:
Figure 73228DEST_PATH_IMAGE008
(5)
in the formula (5), whenL>1, the association ruleABIs in a positive correlation with each other,Acan promoteB(iii) occurrence of (a); when in useL<1, then the association rule is negatively related,Awill inhibitBOccurrence of (1);Lif =1, it indicates that the predecessor and successor of the association rule are independent from each other; certainty factorCFRepresents the probability of occurrence of a back-piece in case of occurrence of a front-piece in the evaluation association rule:
Figure 627313DEST_PATH_IMAGE009
(6)
in the formula (6), the first and second groups,CF>0, then the association ruleABPositive correlation;CF<0, then the association ruleABNegative correlation;CAB)=S(B) Time of flightCF=0, then the association ruleABThe front piece and the rear piece are mutually independent; some misleading rules existing in the strong association rules are corrected through multi-target evaluation, and the generation of association rules which are numerically correct but actually irrelevant in fault detection information is avoided;
(S44) fault detection information mining is carried out through a multi-target association rule, and a fitness function obtained through weighting is as follows:
Figure 801942DEST_PATH_IMAGE010
(7)
in the formula (7), the first and second groups,frepresenting a fitness function byfAnd judging the relevance of the fault detection information, thereby outputting the information system network equipment fault information with the relevance.
CN202210870184.1A 2022-07-22 2022-07-22 Information system and information system fault detection method Pending CN114928529A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210870184.1A CN114928529A (en) 2022-07-22 2022-07-22 Information system and information system fault detection method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210870184.1A CN114928529A (en) 2022-07-22 2022-07-22 Information system and information system fault detection method

Publications (1)

Publication Number Publication Date
CN114928529A true CN114928529A (en) 2022-08-19

Family

ID=82815801

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210870184.1A Pending CN114928529A (en) 2022-07-22 2022-07-22 Information system and information system fault detection method

Country Status (1)

Country Link
CN (1) CN114928529A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116170362A (en) * 2023-02-22 2023-05-26 北京乐研科技股份有限公司 Gateway equipment rapid inspection method and system based on ARM platform
CN116502128A (en) * 2023-05-09 2023-07-28 盐城工学院 Bearing fault diagnosis method based on improved association rule
CN118143740A (en) * 2024-05-13 2024-06-07 常州市泰德精机科技有限公司 Spindle detection method and system of numerical control machine tool
CN119476654A (en) * 2025-01-15 2025-02-18 湖南真通智用人工智能科技有限公司 Multi-device fault dispatching method, device, equipment and medium for voice control system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7080141B1 (en) * 2002-04-12 2006-07-18 Cisco Technology, Inc. Arrangement for automated fault detection and fault resolution of a network device
CN106992877A (en) * 2017-03-08 2017-07-28 中国人民解放军国防科学技术大学 Network Fault Detection and restorative procedure based on SDN frameworks

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7080141B1 (en) * 2002-04-12 2006-07-18 Cisco Technology, Inc. Arrangement for automated fault detection and fault resolution of a network device
CN106992877A (en) * 2017-03-08 2017-07-28 中国人民解放军国防科学技术大学 Network Fault Detection and restorative procedure based on SDN frameworks

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张艾欣: "基于改进鲸鱼优化算法的多目标关联规则挖掘方法", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116170362A (en) * 2023-02-22 2023-05-26 北京乐研科技股份有限公司 Gateway equipment rapid inspection method and system based on ARM platform
CN116170362B (en) * 2023-02-22 2024-01-30 北京乐研科技股份有限公司 Gateway equipment rapid inspection method and system based on ARM platform
CN116502128A (en) * 2023-05-09 2023-07-28 盐城工学院 Bearing fault diagnosis method based on improved association rule
CN118143740A (en) * 2024-05-13 2024-06-07 常州市泰德精机科技有限公司 Spindle detection method and system of numerical control machine tool
CN119476654A (en) * 2025-01-15 2025-02-18 湖南真通智用人工智能科技有限公司 Multi-device fault dispatching method, device, equipment and medium for voice control system
CN119476654B (en) * 2025-01-15 2025-04-11 湖南真通智用人工智能科技有限公司 Multi-device fault dispatching method, device, equipment and medium for voice control system

Similar Documents

Publication Publication Date Title
CN114928529A (en) Information system and information system fault detection method
US7509415B2 (en) Arrangement for automated fault detection and fault resolution of a network device
Zhang et al. BFTCloud: A byzantine fault tolerance framework for voluntary-resource cloud computing
US8051330B2 (en) Fault location in telecommunications networks using bayesian networks
US8583779B2 (en) Root cause analysis approach with candidate elimination using network virtualization
US12271287B2 (en) Method and system for recommending runbooks for detected events
US8423826B2 (en) Method, apparatus and system for displaying result of failure root cause analysis
US7275017B2 (en) Method and apparatus for generating diagnoses of network problems
US9450819B2 (en) Autonomic network sentinels
US10728085B1 (en) Model-based network management
CN100452017C (en) Method and apparatus for triage of network alarms
US20080270225A1 (en) Apparatus and Methods for Determining Availability and Performance of Entities Providing Services in a Distributed System Using Filtered Service Consumer Feedback
US7469287B1 (en) Apparatus and method for monitoring objects in a network and automatically validating events relating to the objects
US20080228873A1 (en) Method and system for generic application liveliness monitoring for business resiliency
CN119473638A (en) A multi-service system interface integration and dynamic host automatic management method and system
CN116723154B (en) Route distribution method and system based on load balancing
CN117891641A (en) Fault object positioning method and device, storage medium and electronic device
CN119484247B (en) Automatic operation and maintenance method, device, server and storage medium
CN118779182B (en) Fault self-healing method, device, equipment and storage medium
US20250247283A1 (en) Responsible incident prediction
KR102719678B1 (en) Method and apparatus for identifying network failure type on software-defined network environment based on machine learning
Yamauchi et al. Deep Reinforcement Learning based Command Control System for Automating Fault Diagnosis
US20250016042A1 (en) Enhanced event-driven diagnostics for communication networks
US20250193077A1 (en) Method and apparatus for determining optimized network configuration
CN119052059A (en) Fault analysis method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20220819

RJ01 Rejection of invention patent application after publication