JP2021069009A

JP2021069009A - Information processing system, information processing system control method, information processing device, and program

Info

Publication number: JP2021069009A
Application number: JP2019192939A
Authority: JP
Inventors: 周岡本; Shu Okamoto; 大地小野寺; Daichi Onodera
Original assignee: NS Solutions Corp
Current assignee: NS Solutions Corp
Priority date: 2019-10-23
Filing date: 2019-10-23
Publication date: 2021-04-30
Anticipated expiration: 2039-10-23
Also published as: JP7263206B2

Abstract

【課題】複数の監視手段を利用した監視対象の状態の監視をより好適な態様で実現可能とする。【解決手段】本発明に係る情報処理システムは、互いに異なる監視条件に基づき状態の監視を行う複数の監視手段それぞれから当該監視の結果に応じたメッセージを取得する取得手段と、前記複数の監視手段のうち少なくとも１以上の監視手段から取得された前記メッセージに基づきイベントデータを生成する生成手段と、所定の抽出条件に基づき前記イベントデータを抽出する抽出手段と、前記イベントデータの抽出結果に応じた出力情報の出力を制御する出力制御手段と、を備える。【選択図】図１PROBLEM TO BE SOLVED: To realize monitoring of a state of a monitoring target using a plurality of monitoring means in a more preferable manner. An information processing system according to the present invention includes an acquisition means for acquiring a message according to a result of the monitoring from each of a plurality of monitoring means for monitoring a state based on different monitoring conditions, and the plurality of monitoring means. According to the generation means for generating event data based on the message acquired from at least one of the monitoring means, the extraction means for extracting the event data based on the predetermined extraction conditions, and the extraction result of the event data. An output control means for controlling the output of output information is provided. [Selection diagram] Fig. 1

Description

本開示は、情報処理システム、情報処理システムの制御方法、情報処理装置、及びプログラムに関する。 The present disclosure relates to an information processing system, a control method of the information processing system, an information processing device, and a program.

システムやコンピュータの状態の監視に各種監視ツールが利用される場合がある。このような監視ツールは、例えば、サーバ、ネットワーク、アプリケーション等のような監視対象の状態を監視し、異常と推測される事象の発生を検知すると、メッセージの出力等によりユーザ（例えば、管理者等）に対して情報の報知を行う。これにより、ユーザは、異常の発生を認識して、当該異常に対して迅速に対応を行うことが可能となる。例えば、特許文献１には、システムやコンピュータの状態の監視に係る技術の一例が開示されている。 Various monitoring tools may be used to monitor the status of systems and computers. Such a monitoring tool monitors the status of a monitoring target such as a server, network, application, etc., and when it detects the occurrence of an event presumed to be abnormal, it outputs a message or the like to a user (for example, an administrator or the like). ) Is notified of information. As a result, the user can recognize the occurrence of the abnormality and quickly respond to the abnormality. For example, Patent Document 1 discloses an example of a technique for monitoring the state of a system or a computer.

特に、近年では、所謂クラウドシステム等のような、従来に比べてより規模の大きいシステムが導入されるケースもあり、監視ツールを利用することでシステムの管理に係るユーザの負担をより軽減する技術が注目されている。 In particular, in recent years, there have been cases where a system larger than the conventional one, such as a so-called cloud system, has been introduced, and a technology that further reduces the burden on the user related to system management by using a monitoring tool. Is attracting attention.

特開２００２−２５２６１４号公報JP-A-2002-252614

ところで、近年では、監視ツールとしては多様なものが提案されており、これらの中には、監視対象（例えば、サーバ、ネットワーク、アプリケーション等）や監視方法の異なる監視ツールも含まれている。そのため、監視ツールごとに、監視対象の種別や監視に係る条件（例えば、制約）等に応じて、監視対象の状態の監視に係る機能や性能が異なる場合もある。 By the way, in recent years, various monitoring tools have been proposed, and among these, monitoring tools having different monitoring targets (for example, servers, networks, applications, etc.) and monitoring methods are also included. Therefore, for each monitoring tool, the function and performance related to monitoring the state of the monitored target may differ depending on the type of monitored target, the conditions related to monitoring (for example, restrictions), and the like.

特に、規模の大きいシステムにおいては、より多くのサーバ、ネットワーク、及びアプリケーション等が連携して動作することで相互に影響を及ぼすような場合もある。このような状況下では、監視対象の種別や監視に係る条件等に応じた監視ツールごとの機能差や性能差の影響がより顕著に顕在化する場合もある。このような背景から、複数の監視手段（例えば、監視ツール）を利用した監視対象の状態の監視を可能とする技術の実現が求められている。 In particular, in a large-scale system, there are cases where more servers, networks, applications, and the like operate in cooperation with each other to affect each other. Under such circumstances, the effects of functional differences and performance differences for each monitoring tool according to the type of monitoring target and the conditions related to monitoring may become more noticeable. Against this background, there is a demand for the realization of a technique that enables monitoring of the state of a monitored object using a plurality of monitoring means (for example, a monitoring tool).

そこで、本開示では、複数の監視手段を利用した監視対象の状態の監視をより好適な態様で実現可能とする技術を提案する。 Therefore, the present disclosure proposes a technique that makes it possible to monitor the state of a monitored object using a plurality of monitoring means in a more preferable manner.

本発明に係る情報処理システムは、互いに異なる監視条件に基づき状態の監視を行う複数の監視手段それぞれから当該監視の結果に応じたメッセージを取得する取得手段と、前記複数の監視手段のうち少なくとも１以上の監視手段から取得された前記メッセージに基づきイベントデータを生成する生成手段と、所定の抽出条件に基づき前記イベントデータを抽出する抽出手段と、前記イベントデータの抽出結果に応じた出力情報の出力を制御する出力制御手段と、を備える。 The information processing system according to the present invention includes an acquisition means for acquiring a message according to the result of the monitoring from each of a plurality of monitoring means for monitoring the state based on different monitoring conditions, and at least one of the plurality of monitoring means. A generation means that generates event data based on the message acquired from the above monitoring means, an extraction means that extracts the event data based on predetermined extraction conditions, and output of output information according to the extraction result of the event data. It is provided with an output control means for controlling the above.

複数の監視手段を利用した監視対象の状態の監視をより好適な態様で実現することが可能となる。 It is possible to monitor the state of the monitored object using a plurality of monitoring means in a more preferable manner.

図１は、監視システムの機能構成の一例を示したブロック図である。FIG. 1 is a block diagram showing an example of the functional configuration of the monitoring system. 図２は、情報処理装置のハードウェア構成の一例を示した図である。FIG. 2 is a diagram showing an example of the hardware configuration of the information processing device. 図３は、監視システムの処理の一例を示したフローチャートである。FIG. 3 is a flowchart showing an example of processing of the monitoring system. 図４は、監視ツールごとに出力されるメッセージの一例を示した図である。FIG. 4 is a diagram showing an example of a message output for each monitoring tool. 図５は、フィルタによる状態判定の条件の規定方法の一例を示した図である。FIG. 5 is a diagram showing an example of a method of defining the conditions for determining the state by the filter.

以下に添付図面を参照しながら、本開示の好適な実施の形態について詳細に説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略する。 Preferred embodiments of the present disclosure will be described in detail below with reference to the accompanying drawings. In the present specification and the drawings, components having substantially the same functional configuration are designated by the same reference numerals, so that duplicate description will be omitted.

＜概要＞
まず、本実施形態に係る監視システムの概要について説明する。なお、以降の説明では、本実施形態に係る監視システムを、他のシステムと区別するために、便宜上「監視システム１００」とも称する。監視システム１００は、複数の監視部（例えば、監視ツール）を利用して、監視対象となるシステムの状態を監視し、当該監視の結果に応じた情報を所定の出力先に出力する。なお、本開示では、監視対象となるシステムは、複数の装置により実現されるシステムであってもよく、１つの装置により実現されるシステムであってよいものとする。 <Overview>
First, the outline of the monitoring system according to the present embodiment will be described. In the following description, the monitoring system according to the present embodiment will be referred to as "monitoring system 100" for convenience in order to distinguish it from other systems. The monitoring system 100 monitors the state of the system to be monitored by using a plurality of monitoring units (for example, a monitoring tool), and outputs information according to the result of the monitoring to a predetermined output destination. In the present disclosure, the system to be monitored may be a system realized by a plurality of devices, or may be a system realized by one device.

監視システム１００が利用する複数の監視部には、互いに異なる監視条件に基づき所望の監視対象の状態を監視する２以上の監視部が含まれてもよい。具体的な一例として、監視システム１００は、監視対象の種別や監視方法が互いに異なる２以上の監視部を利用することで、所望の監視対象の状態の監視を行ってもよい。
また、監視部として既存のものが利用されてもよい。具体的な一例として、監視システム１００は、複数の監視部のうちの少なくとも一部として、ＯＳＳ（ＯｐｅｎＳｏｕｒｃｅＳｏｆｔｗａｒｅ）として公開されている監視ツールを利用してもよい。 The plurality of monitoring units used by the monitoring system 100 may include two or more monitoring units that monitor the state of a desired monitoring target based on different monitoring conditions. As a specific example, the monitoring system 100 may monitor the state of a desired monitoring target by using two or more monitoring units having different types of monitoring targets and monitoring methods.
Further, an existing monitoring unit may be used. As a specific example, the monitoring system 100 may use a monitoring tool published as OSS (Open Source Software) as at least a part of the plurality of monitoring units.

監視システム１００は、複数の監視部それぞれによる監視結果を組み合わせることで、監視対象となるシステムの状態を推定することも可能である。これにより、個々の監視部による監視結果のみでは推定することが困難な状態を推定することが可能となる。 The monitoring system 100 can also estimate the state of the system to be monitored by combining the monitoring results of each of the plurality of monitoring units. This makes it possible to estimate a state that is difficult to estimate only from the monitoring results of individual monitoring units.

具体的な一例として、監視部Ａが、監視結果に応じて、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）の負荷が閾値以上となったことを示すメッセージが出力したものとする。この監視結果のみでは、システムに対して何らかの異常が発生したことで、ＣＰＵの負荷が増大しているものと推測される可能性がある。一方で、バックアップジョブ等のように実行が予定されている通常の処理の中には、実行に伴う負荷が比較的大きいものもある。そのため、例えば、監視部Ａが上記メッセージを出力する直前に、他の監視部Ｂが、監視結果に応じて、バックアップジョブ開始のイベントを検出したことを示すメッセージを出力していたものとする。この場合には、監視部Ａ及び監視部Ｂそれぞれから出力されたメッセージに基づき、バックアップジョブの実行に伴いＣＰＵの負荷が増大しているため、監視部Ａの監視結果が正常な動作によるものであると認識することが可能となる。 As a specific example, it is assumed that the monitoring unit A outputs a message indicating that the load of the CPU (Central Processing Unit) has exceeded the threshold value according to the monitoring result. From this monitoring result alone, it may be presumed that the load on the CPU is increasing due to the occurrence of some abnormality in the system. On the other hand, some of the normal processes scheduled to be executed, such as backup jobs, have a relatively large load associated with the execution. Therefore, for example, it is assumed that immediately before the monitoring unit A outputs the above message, another monitoring unit B outputs a message indicating that an event for starting a backup job has been detected according to the monitoring result. In this case, the load on the CPU increases with the execution of the backup job based on the messages output from each of the monitoring unit A and the monitoring unit B, so that the monitoring result of the monitoring unit A is due to normal operation. It becomes possible to recognize that there is.

また、他の一例として、監視部Ａが、監視結果に応じて、Ｗｅｂサーバのプロセスが停止したことを示すメッセージを出力したものとする。この監視結果のみでは、システムに対して何らかの異常が発生したことで、Ｗｅｂサーバのプロセスが停止したものと推測される可能性がある。これに対して、例えば、監視部Ａが上記メッセージを出力する直前に、他の監視部Ｂが、監視結果に応じて、クラウドプロバイダにてＷｅｂサーバのスケールインが実施されたことを示すイベントを検出したことを示すメッセージを出力していたものとする。Ｗｅｂサーバのスケールインはリクエスト数の増減にあわせて不定期に発生するが、その際にＷｅｂサーバのプロセスが停止する場合がある。そのため、この場合には、監視部Ａ及び監視部Ｂそれぞれから出力されたメッセージに基づき、Ｗｅｂサーバのスケールインにより当該Ｗｅｂサーバのプロセスが停止しているため、監視部Ａの監視結果が正常な動作によるものであると認識することが可能となる。 Further, as another example, it is assumed that the monitoring unit A outputs a message indicating that the process of the Web server has stopped according to the monitoring result. From this monitoring result alone, it may be presumed that the process of the Web server has stopped due to some abnormality occurring in the system. On the other hand, for example, immediately before the monitoring unit A outputs the above message, another monitoring unit B sends an event indicating that the scale-in of the Web server has been executed by the cloud provider according to the monitoring result. It is assumed that a message indicating that the detection has been output has been output. The scale-in of the Web server occurs irregularly according to the increase or decrease in the number of requests, but at that time, the process of the Web server may be stopped. Therefore, in this case, based on the messages output from each of the monitoring unit A and the monitoring unit B, the process of the Web server is stopped due to the scale-in of the Web server, so that the monitoring result of the monitoring unit A is normal. It becomes possible to recognize that it is due to the operation.

監視システム１００は、上記に例示したように、複数の監視部それぞれから監視結果に応じて出力される情報（メッセージ）を、所定のフォーマットで規定された、状態検出の対象とする事象ごとのイベントデータを生成する。
なお、メッセージとは、各監視部が監視結果に応じて出力する情報のうち、特に文字情報で表現されたものを示している。また、イベントデータは、監視システム１００が、各監視部から出力されるメッセージに含まれる少なくとも一部の情報を、所望の事象に対応するイベントに対応付けて生成するデータである。
監視システム１００は、イベントデータの生成に際し、複数の情報（メッセージ）を関連付けることで、１つのイベントデータを生成してもよい。また、監視システム１００は、１つのメッセージから複数の情報を抽出し、個々の情報についてイベントデータを生成してもよい。以上のようにして、監視システム１００では、状態検出に係る検出単位ごとにイベントデータが生成されることとなる。 As illustrated above, the monitoring system 100 specifies information (messages) output from each of the plurality of monitoring units according to the monitoring results in a predetermined format, and is an event for each event targeted for state detection. Generate data.
The message indicates, among the information output by each monitoring unit according to the monitoring result, the information expressed in character information in particular. Further, the event data is data generated by the monitoring system 100 in association with at least a part of the information included in the message output from each monitoring unit in association with the event corresponding to the desired event.
The monitoring system 100 may generate one event data by associating a plurality of information (messages) when generating the event data. Further, the monitoring system 100 may extract a plurality of pieces of information from one message and generate event data for each piece of information. As described above, in the monitoring system 100, event data is generated for each detection unit related to the state detection.

そして、監視システム１００は、生成した一連のイベントデータを、当該イベントデータの生成に係るフォーマットに応じたフィルタに基づき、正常な状態を示すイベントデータと、異常な状態を示すイベントデータと、に振り分ける。これにより、監視部ごとに固有のルール（例えば、フィルタの規定に係るルール等）に依存せずに、一連のイベントデータを所望の条件に基づき振り分けることが可能となる。
また、監視システム１００は、所定の条件に基づき抽出される２以上のイベントデータの組み合わせに基づき、正常及び異常のいずれかを判定してもよい。このような構成により、監視対象の種別や監視に係る条件等に応じて監視ツールごとの機能差や性能差を活かした監視を実施することができ、監視対象の状態をより正確に把握することが可能となる。 Then, the monitoring system 100 divides the generated series of event data into event data indicating a normal state and event data indicating an abnormal state based on a filter according to the format related to the generation of the event data. .. As a result, a series of event data can be distributed based on desired conditions without depending on a rule unique to each monitoring unit (for example, a rule related to the regulation of the filter).
Further, the monitoring system 100 may determine either normal or abnormal based on a combination of two or more event data extracted based on a predetermined condition. With such a configuration, it is possible to carry out monitoring that makes use of the functional differences and performance differences of each monitoring tool according to the type of monitoring target and the conditions related to monitoring, and to grasp the status of the monitoring target more accurately. Is possible.

そこで、以降では、本実施形態に係る監視システム１００について、特に上述した特徴を実現するための機能構成や処理に着目してより詳細に説明する。 Therefore, the monitoring system 100 according to the present embodiment will be described in more detail below, paying particular attention to the functional configuration and processing for realizing the above-mentioned features.

＜機能構成＞
図１を参照して、本実施形態に係る監視システム１００を含むシステム１の機能構成の一例について説明する。図１は、システム１の機能構成の一例を示したブロック図である。システム１は、１以上の監視対象１９０と、監視システム１００と、を含む。監視システム１００は、システム１のうち、１以上の監視対象１９０それぞれの状態の監視を実現するための構成に相当する。監視対象１９０は、監視システム１００による状態監視の監視対象を模式的に示したものである。なお、システム１のうち、少なくとも監視システム１００に相当する部分が、本実施形態に係る「情報処理システム」の一例に相当する。 <Functional configuration>
An example of the functional configuration of the system 1 including the monitoring system 100 according to the present embodiment will be described with reference to FIG. FIG. 1 is a block diagram showing an example of the functional configuration of the system 1. The system 1 includes one or more monitoring targets 190 and a monitoring system 100. The monitoring system 100 corresponds to a configuration for realizing monitoring of the state of each of one or more monitoring targets 190 in the system 1. The monitoring target 190 schematically shows a monitoring target for condition monitoring by the monitoring system 100. In addition, at least the part corresponding to the monitoring system 100 in the system 1 corresponds to an example of the "information processing system" according to the present embodiment.

本実施形態において、監視システム１００が状態監視の対象とする監視対象１９０は、ハードウェア的な構成であってもよいし、ソフトウェア的な構成であってもよい。ハードウェア的な構成としての監視対象１９０の一例としては、サーバ、ネットワークスイッチ、ストレージ等のようなハードウェアが挙げられる。また、ソフトウェア的な構成としての監視対象１９０の一例として、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）、ハイパーバイザ、ミドルウェア、データベース、アプリケーション、コンテナ等が挙げられる。また、所謂ネットワークについても、監視対象１９０に含まれ得る。この場合には、物理的なネットワークと、１以上の物理的なネットワークを含むように規定された論理的なネットワークと、のいずれも監視対象１９０に含まれ得る。
また、サーバ等の情報処理装置を監視対象とする場合においても、当該情報処理装置が所謂ヴァーチャルマシンを利用することでソフトウェア的に実現されていてもよい。この場合には、ヴァーチャルマシンを動作させるハードウェアとしての情報処理装置と、ヴァーチャルマシン上でソフトウェア的に動作する情報処理装置と、のいずれも監視対象１９０に含まれ得る。また、このような状況下では、例えば、複数のハードウェア的な情報処理装置を、ソフトウェア的に１つの情報処理装置として見立てて運用するような場合も想定され得る。このような場合においても、複数のハードウェア的な情報処理装置それぞれと、ソフトウェア的に運用される情報処理装置と、のいずれも監視対象１９０に含まれ得る。
また、図１に示す複数の監視対象１９０のそれぞれは、互いに種別の異なる監視対象であってもよい。 In the present embodiment, the monitoring target 190 targeted by the monitoring system 100 for status monitoring may have a hardware configuration or a software configuration. An example of the monitoring target 190 as a hardware configuration includes hardware such as a server, a network switch, and a storage. Further, as an example of the monitoring target 190 as a software configuration, an OS (Operating System), a hypervisor, middleware, a database, an application, a container, and the like can be mentioned. The so-called network can also be included in the monitoring target 190. In this case, both the physical network and the logical network defined to include one or more physical networks may be included in the monitored object 190.
Further, even when an information processing device such as a server is to be monitored, the information processing device may be realized by software by using a so-called virtual machine. In this case, both the information processing device as hardware for operating the virtual machine and the information processing device for software operation on the virtual machine can be included in the monitoring target 190. Further, under such a situation, for example, it can be assumed that a plurality of hardware-like information processing devices are operated as if they are software-like one information processing device. Even in such a case, each of the plurality of hardware-like information processing devices and the information processing device operated by software can be included in the monitoring target 190.
Further, each of the plurality of monitoring targets 190 shown in FIG. 1 may be monitoring targets of different types.

監視システム１００は、複数の監視部１１０と、イベント生成部１２１と、メッセージ記憶部１２２と、イベント抽出部１２３と、ルール記憶部１２４と、イベント記憶部１２５と、出力制御部１２６と、出力部１２７とを含む。なお、図１に示す例では、複数の監視部１１０として、特徴の異なる監視部１１０ａ及び１１０ｂが含まれるものとする。 The monitoring system 100 includes a plurality of monitoring units 110, an event generation unit 121, a message storage unit 122, an event extraction unit 123, a rule storage unit 124, an event storage unit 125, an output control unit 126, and an output unit. Includes 127 and. In the example shown in FIG. 1, it is assumed that the monitoring units 110a and 110b having different characteristics are included as the plurality of monitoring units 110.

監視部１１０は、所定の監視条件に基づき１以上の監視対象１９０の状態を監視する。監視部１１０ａ及び１１０ｂは、互いに異なる監視条件に基づき１以上の監視対象１９０の状態を監視してもよい。また、監視部１１０ａ及び１１０ｂのそれぞれは、互いに異なる監視対象１９０（例えば、互いに種別の異なる監視対象１９０）の状態を監視してもよい。 The monitoring unit 110 monitors the status of one or more monitoring targets 190 based on predetermined monitoring conditions. The monitoring units 110a and 110b may monitor the status of one or more monitoring targets 190 based on different monitoring conditions. Further, each of the monitoring units 110a and 110b may monitor the states of different monitoring targets 190 (for example, different types of monitoring targets 190).

具体的な一例として、少なくとも一部の監視部１１０は、監視対象１９０の状態のうち、特に監視対象１９０の性能に関する状態を監視してもよい。性能に関する状態の一例としては、ＣＰＵの使用率、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等のようなワーキングエリアの使用率、補助記憶装置等のような各種データを記憶させる領域の使用率、ネットワークの通信速度等が挙げられる。
また、他の一例として、少なくとも一部の監視部１１０は、監視対象１９０において所定の記憶領域のファイルに逐次記録される履歴を監視することで、当該監視対象１９０の状態を監視してもよい。この場合には、監視部１１０は、例えば、監視対象１９０において所定の種別の履歴が記録されたことを検出した場合に、当該履歴の検出結果に応じて当該監視対象１９０の状態を判定してもよい。
もちろん、上記はあくまで一例であり、必ずしも監視部１１０による監視対象１９０の状態の監視に係る監視条件を限定するものではない。例えば、アプリケーション、コンテナ、ハイパーバイザ、ＯＳ、ハードウェア、及びネットワーク等のいずれの種別の監視対象１９０の状態を監視するかに応じて監視条件が決定されてもよい。また、監視部１１０として利用される監視ツールに応じて監視条件が決定される場合もある。
また、監視部１１０による監視対象１９０の監視方法についても、当該監視対象１９０の種別に応じて適宜変更されてもよい。例えば、監視部１１０は、他の装置を監視する場合には、ネットワークを介して当該装置から各種情報を収集することで、当該情報に基づき当該装置の監視を行ってもよい。 As a specific example, at least a part of the monitoring unit 110 may monitor the state of the monitored object 190, particularly the state related to the performance of the monitored object 190. Examples of performance-related states include CPU usage rate, working area usage rate such as RAM (Random Access Memory), usage rate of areas for storing various data such as auxiliary storage devices, and network communication speed. And so on.
Further, as another example, at least a part of the monitoring units 110 may monitor the state of the monitored target 190 by monitoring the history sequentially recorded in the file of the predetermined storage area in the monitored target 190. .. In this case, for example, when the monitoring unit 110 detects that a history of a predetermined type has been recorded in the monitoring target 190, the monitoring unit 110 determines the state of the monitoring target 190 according to the detection result of the history. May be good.
Of course, the above is only an example, and does not necessarily limit the monitoring conditions related to the monitoring of the state of the monitored object 190 by the monitoring unit 110. For example, the monitoring conditions may be determined depending on which type of monitoring target 190, such as an application, a container, a hypervisor, an OS, hardware, or a network, is to be monitored. In addition, monitoring conditions may be determined according to the monitoring tool used as the monitoring unit 110.
Further, the monitoring method of the monitoring target 190 by the monitoring unit 110 may be appropriately changed according to the type of the monitoring target 190. For example, when monitoring another device, the monitoring unit 110 may monitor the device based on the information by collecting various information from the device via the network.

監視部１１０は、監視対象１９０の状態の監視結果に応じた情報を所定の出力先に出力する。なお、以降では、本実施形態に係る監視システム１００の特徴をより分かりやすくするために、監視部１１０は、監視対象１９０の状態の監視結果に応じた情報が文字情報により示されたメッセージを所定の出力先に出力するものとする。
監視部１１０から出力されたメッセージは、後述するイベント生成部１２１により利用される。そのため、例えば、監視部１１０は、上記監視結果に応じたメッセージをイベント生成部１２１に直接出力してもよい。また、他の一例として、監視部１１０は、上記監視結果に応じたメッセージを所定の記憶領域に記憶させてもよい。メッセージ記憶部１２２は、監視部１１０から出力されたメッセージを記憶する記憶領域を示している。この場合には、イベント生成部１２１は、メッセージ記憶部１２２に記憶されたメッセージを抽出して利用してもよい。 The monitoring unit 110 outputs information according to the monitoring result of the state of the monitoring target 190 to a predetermined output destination. In the following, in order to make the features of the monitoring system 100 according to the present embodiment easier to understand, the monitoring unit 110 specifies a message in which information corresponding to the monitoring result of the state of the monitoring target 190 is indicated by character information. It shall be output to the output destination of.
The message output from the monitoring unit 110 is used by the event generation unit 121, which will be described later. Therefore, for example, the monitoring unit 110 may directly output a message according to the monitoring result to the event generation unit 121. Further, as another example, the monitoring unit 110 may store a message according to the monitoring result in a predetermined storage area. The message storage unit 122 indicates a storage area for storing a message output from the monitoring unit 110. In this case, the event generation unit 121 may extract and use the message stored in the message storage unit 122.

なお、複数の監視部１１０のうち、一部の監視部１１０による監視対象１９０の状態の監視に係る条件が、「第１の監視条件」の一例に相当し、他の一部の監視部１１０による監視対象１９０の状態の監視に係る条件が、「第２の監視条件」の一例に相当する。また、第１の監視条件に基づく監視結果に応じて対応する監視部１１０から出力されるメッセージが、「第１のメッセージ」の一例に相当する。同様に、第２の監視条件に基づく監視結果に応じて対応する監視部１１０から出力されるメッセージが、「第２のメッセージ」の一例に相当する。
より具体的な一例として、監視対象１９０の状態の監視に係る条件のうち、特に、監視対象１９０の性能に関する監視条件が、「第１の監視条件」の一例に相当する。また、監視対象１９０の状態の監視に係る条件のうち、特に、監視対象１９０において逐次記録される履歴に関する監視条件が、「第２の監視条件」の一例に相当する。
なお、第１の監視条件に基づき監視対象１９０の状態を監視する監視部１１０を、便宜上「第１の監視手段」と称する場合がある。同様に、第２の監視条件に基づき監視対象１９０の状態を監視する監視部１１０を、便宜上「第２の監視手段」と称する場合がある。 Of the plurality of monitoring units 110, the conditions related to the monitoring of the state of the monitored target 190 by some of the monitoring units 110 correspond to an example of the "first monitoring condition", and some of the other monitoring units 110. The condition relating to the monitoring of the state of the monitoring target 190 by the above corresponds to an example of the "second monitoring condition". Further, the message output from the monitoring unit 110 corresponding to the monitoring result based on the first monitoring condition corresponds to an example of the "first message". Similarly, the message output from the monitoring unit 110 corresponding to the monitoring result based on the second monitoring condition corresponds to an example of the "second message".
As a more specific example, among the conditions relating to the monitoring of the state of the monitoring target 190, the monitoring condition relating to the performance of the monitoring target 190 corresponds to an example of the "first monitoring condition". Further, among the conditions relating to the monitoring of the state of the monitoring target 190, the monitoring condition relating to the history sequentially recorded in the monitoring target 190 corresponds to an example of the "second monitoring condition".
The monitoring unit 110 that monitors the state of the monitoring target 190 based on the first monitoring condition may be referred to as a "first monitoring means" for convenience. Similarly, the monitoring unit 110 that monitors the state of the monitored object 190 based on the second monitoring condition may be referred to as a "second monitoring means" for convenience.

イベント生成部１２１は、監視部１１０が監視対象１９０の状態の監視結果に応じて出力するメッセージに基づき、監視システム１００が状態検出の対象とする事象ごとにイベントデータを生成する。例えば、イベント生成部１２１は、個々の監視部１１０が出力するメッセージごとにイベントデータを生成してもよい。また、他の一例として、イベント生成部１２１は、複数のメッセージに基づき１つのイベントデータを生成してもよい。また、イベント生成部１２１は、１つのメッセージに含まれる複数の情報を抽出し、当該情報ごとにイベントデータを生成してもよい。 The event generation unit 121 generates event data for each event that the monitoring system 100 targets for state detection, based on a message output by the monitoring unit 110 according to the monitoring result of the state of the monitoring target 190. For example, the event generation unit 121 may generate event data for each message output by each monitoring unit 110. Further, as another example, the event generation unit 121 may generate one event data based on a plurality of messages. Further, the event generation unit 121 may extract a plurality of information included in one message and generate event data for each of the information.

イベント生成部１２１は、イベントデータの生成に際し、メッセージの出力元となる監視部１１０に関わらず、共通のフォーマットを使用する。
一方で、複数の監視部１１０それぞれから出力されるメッセージについては、出力元となる監視部１１０ごとに異なるフォーマットで記載されている場合がある。そのため、イベント生成部１２１は、複数の監視部１１０それぞれから出力されるメッセージを、当該監視部１１０に対応付けられたフォーマットに基づき解析することでメッセージ中の情報を抽出し、抽出した情報に基づきイベントデータを生成してもよい。
また、１つのメッセージに対して複数の事象に関する情報が含まれる場合もある。このような場合には、イベント生成部１２１は、対象となるメッセージを、当該メッセージの出力元となる監視部１１０に対応付けられたフォーマットに基づき解析することで、複数の事象それぞれに対応する情報を個別に抽出することが可能である。この場合には、前述したように、イベント生成部１２１は、抽出した情報それぞれについてイベントデータを生成してもよい。
また、イベント生成部１２１は、複数のメッセージそれぞれから抽出した情報に基づき、当該複数のメッセージ間に関連性があることを推定または認識した場合には、当該複数のメッセージを関連付けて１つのイベントデータを生成してもよい。
なお、各監視部１１０に対応付けられたフォーマットに関する情報については、イベント生成部１２１が参照可能な記憶領域（例えば、後述するルール記憶部１２４）に記憶させておけばよい。また、監視部１１０に対応付けられたフォーマット（すなわち、監視部１１０ごとに固有のフォーマット）が、「第１のフォーマット」の一例に相当する。 The event generation unit 121 uses a common format when generating event data, regardless of the monitoring unit 110 that is the output source of the message.
On the other hand, the messages output from each of the plurality of monitoring units 110 may be described in different formats for each monitoring unit 110 that is the output source. Therefore, the event generation unit 121 extracts the information in the message by analyzing the message output from each of the plurality of monitoring units 110 based on the format associated with the monitoring unit 110, and based on the extracted information. Event data may be generated.
In addition, one message may contain information about a plurality of events. In such a case, the event generation unit 121 analyzes the target message based on the format associated with the monitoring unit 110 that is the output source of the message, thereby providing information corresponding to each of the plurality of events. Can be extracted individually. In this case, as described above, the event generation unit 121 may generate event data for each of the extracted information.
Further, when the event generation unit 121 estimates or recognizes that the plurality of messages are related based on the information extracted from each of the plurality of messages, the event generation unit 121 associates the plurality of messages with one event data. May be generated.
Information about the format associated with each monitoring unit 110 may be stored in a storage area (for example, a rule storage unit 124 described later) that can be referred to by the event generation unit 121. Further, the format associated with the monitoring unit 110 (that is, the format unique to each monitoring unit 110) corresponds to an example of the "first format".

また、イベント生成部１２１は、複数の監視部１１０それぞれから出力される一連のメッセージのうち、一部のメッセージを抽出してイベントの生成に利用してもよい。換言すると、イベント生成部１２１は、必ずしも複数の監視部１１０それぞれから出力される一連のメッセージ全てについてイベントデータを生成しなくてもよい。この場合には、イベント生成部１２１は、複数の監視部１１０それぞれから出力される一連のメッセージを所定の条件に基づきフィルタリングすることで、イベントの生成に利用するメッセージを抽出してもよい。なお、メッセージの抽出に係る条件については、イベントの生成にメッセージを利用する監視部１１０の種別や、イベントの生成対象となる事象に応じて適宜設定してもよい。 Further, the event generation unit 121 may extract a part of the series of messages output from each of the plurality of monitoring units 110 and use them for event generation. In other words, the event generation unit 121 does not necessarily have to generate event data for all the series of messages output from each of the plurality of monitoring units 110. In this case, the event generation unit 121 may extract the message to be used for event generation by filtering a series of messages output from each of the plurality of monitoring units 110 based on a predetermined condition. The conditions for extracting the message may be appropriately set according to the type of the monitoring unit 110 that uses the message for generating the event and the event for which the event is to be generated.

なお、イベント生成部１２１によるイベントの生成に係る処理については、具体的な例とあわせて詳細を別途後述する。そして、イベント生成部１２１は、生成したイベントをイベント抽出部１２３に出力する。 The details of the process related to the event generation by the event generation unit 121 will be described later together with a specific example. Then, the event generation unit 121 outputs the generated event to the event extraction unit 123.

イベント抽出部１２３は、イベント生成部１２１から生成されたイベントデータを取得する。イベント抽出部１２３は、取得した一連のイベントデータを所定の抽出条件に基づきフィルタリングすることで、少なくとも一部のイベントデータを抽出する。具体的な一例として、イベント抽出部１２３は、生成された一連のイベントデータから、所定の文字列を含むイベントデータを抽出してもよい。イベントデータの抽出に係る抽出条件を示す情報については、イベント抽出部１２３が参照可能な所定の記憶領域にあらかじめ記憶させておけばよい。例えば、ルール記憶部１２４が、上述したイベントデータの抽出に係る抽出条件を示す情報を記憶してもよい。ルール記憶部１２４は、イベント生成部１２１やイベント抽出部１２３が利用する各種情報やデータを記憶する記憶領域である。 The event extraction unit 123 acquires the event data generated from the event generation unit 121. The event extraction unit 123 extracts at least a part of the event data by filtering the acquired series of event data based on a predetermined extraction condition. As a specific example, the event extraction unit 123 may extract event data including a predetermined character string from the generated series of event data. The information indicating the extraction conditions related to the extraction of the event data may be stored in advance in a predetermined storage area that can be referred to by the event extraction unit 123. For example, the rule storage unit 124 may store information indicating extraction conditions related to the above-mentioned extraction of event data. The rule storage unit 124 is a storage area for storing various information and data used by the event generation unit 121 and the event extraction unit 123.

また、イベント抽出部１２３は、所定の抽出条件に基づき関連性を有することが推定される２以上のイベントデータを関連付けて抽出してもよい。
具体的な一例として、イベント抽出部１２３は、概要として前述した例のように、ＣＰＵの負荷の増加を示すイベントデータと、バックアップジョブ開始のイベントに関するイベントデータとを関連付けてもよい。これにより、個々のイベントデータのみでは監視対象１９０が異常な状態にあると判定されるような状況下においても、相互に関連付けられた複数のイベントデータに基づき、監視対象１９０が正常な状態にあると判定することも可能となる。 Further, the event extraction unit 123 may correlate and extract two or more event data that are presumed to have relevance based on a predetermined extraction condition.
As a specific example, the event extraction unit 123 may associate event data indicating an increase in CPU load with event data related to a backup job start event, as described above as an outline. As a result, even in a situation where it is determined that the monitored target 190 is in an abnormal state based on individual event data alone, the monitored target 190 is in a normal state based on a plurality of event data associated with each other. It is also possible to determine that.

なお、イベント抽出部１２３によるイベントデータの抽出に係る処理については、具体的な例とあわせて詳細を別途後述する。そして、イベント抽出部１２３は、抽出したイベントデータをイベント記憶部１２５に記憶させる。例えば、イベント記憶部１２５は、抽出されたイベントデータを記憶するための記憶領域である。 The details of the process related to the extraction of event data by the event extraction unit 123 will be described later together with a specific example. Then, the event extraction unit 123 stores the extracted event data in the event storage unit 125. For example, the event storage unit 125 is a storage area for storing the extracted event data.

出力制御部１２６は、イベント記憶部１２５に記憶されたイベントデータ（すなわち、イベント抽出部１２３により抽出されたイベントデータ）を読み出し、当該イベントデータに応じた情報を所定の出力先に出力する。例えば、出力制御部１２６は、読み出したイベントデータに応じた情報を所定の出力部（例えば、後述する出力部１２７）に出力させることで、ユーザに対して情報の報知を行ってもよい。また、他の一例として、出力制御部１２６は、読み出したイベントデータを、ネットワークを介して他の装置に転送してもよい。これにより、当該他の装置は、取得したイベントデータを利用して各種処理を実行することが可能となる。
また、出力制御部１２６は、イベント記憶部１２５に記憶されたイベントデータの読み出しに際し、所定の条件に応じて少なくとも一部のイベントデータを読み出してもよい。具体的な一例として、出力制御部１２６は、ユーザによる対応をうながす画面の表示のために監視対象１９０の状態として「異常」を示すイベントデータをイベント記憶部１２５から読み出してもよい。
また、出力制御部１２６は、互いに関連付けられた複数のイベントデータを読み出してもよい。この場合には、出力制御部１２６は、当該複数のイベントデータの組み合わせに応じた情報を所定の出力部に出力させてもよい。具体的な一例として、出力制御部１２６は、当該複数のイベントデータの組み合わせに基づく監視対象１９０の状態（例えば、正常または異常）に応じた情報を所定の出力部に出力させてもよい。 The output control unit 126 reads the event data stored in the event storage unit 125 (that is, the event data extracted by the event extraction unit 123), and outputs the information corresponding to the event data to a predetermined output destination. For example, the output control unit 126 may notify the user of the information by outputting the information corresponding to the read event data to a predetermined output unit (for example, the output unit 127 described later). Further, as another example, the output control unit 126 may transfer the read event data to another device via the network. As a result, the other device can execute various processes using the acquired event data.
Further, the output control unit 126 may read at least a part of the event data according to a predetermined condition when reading the event data stored in the event storage unit 125. As a specific example, the output control unit 126 may read event data indicating "abnormality" as the state of the monitoring target 190 from the event storage unit 125 in order to display a screen prompting the user to respond.
Further, the output control unit 126 may read out a plurality of event data associated with each other. In this case, the output control unit 126 may output information according to the combination of the plurality of event data to a predetermined output unit. As a specific example, the output control unit 126 may output information according to the state (for example, normal or abnormal) of the monitoring target 190 based on the combination of the plurality of event data to a predetermined output unit.

出力部１２７は、各種情報を出力するための構成であり、ユーザに対する各種情報の提示に利用される。具体的な一例として、出力部１２７は、ディスプレイ等の表示デバイスにより実現されてもよい。この場合には、出力部１２７は、各種表示情報を表示させることで、ユーザに対して情報を提示する。
また、他の一例として、出力部１２７は、音声や電子音等の音響を出力するスピーカ等の音響出力デバイスにより実現されてもよい。この場合には、出力部１２７は、音声や電信等の音響を出力することで、ユーザに対して情報を提示する。
このように、出力部１２７として適用されるデバイスは、ユーザに対して情報を提示するために利用する媒体に応じて適宜変更されてもよい。 The output unit 127 has a configuration for outputting various information, and is used for presenting various information to the user. As a specific example, the output unit 127 may be realized by a display device such as a display. In this case, the output unit 127 presents the information to the user by displaying various display information.
Further, as another example, the output unit 127 may be realized by an acoustic output device such as a speaker that outputs an acoustic sound such as a voice or an electronic sound. In this case, the output unit 127 presents information to the user by outputting sound such as voice or telegraph.
As described above, the device applied as the output unit 127 may be appropriately changed depending on the medium used for presenting the information to the user.

なお、図１に示す機能構成はあくまで一例であり、必ずしもシステム１（特に、監視システム１００）の機能構成を限定するものではない。例えば、監視システム１００の各機能構成が１つの装置により実現されてもよい。なお、この場合には、当該１つの装置が、本実施形態に係る「情報処理装置」の一例に相当する。
また、複数の装置を協働させることで、監視システム１００の各機能構成が実現されてもよい。この場合には、例えば、監視システム１００の各機能構成のうち少なくとも一部の機能構成の処理が複数の装置により実行されることで、当該処理に係る負荷が当該複数の装置に分散されてもよい。また、他の一例として、監視システム１００の各機能構成のうち一部の機能構成と他の機能構成とが、ネットワークを介して接続された互いに異なる装置により実現されてもよい。具体的な一例として、イベント生成部１２１及びイベント抽出部１２３に相当する機能構成と、出力制御部１２６に相当する機能構成と、が互いに異なる装置により実現されてもよい。 The functional configuration shown in FIG. 1 is merely an example, and does not necessarily limit the functional configuration of the system 1 (particularly, the monitoring system 100). For example, each functional configuration of the monitoring system 100 may be realized by one device. In this case, the one device corresponds to an example of the "information processing device" according to the present embodiment.
Further, each functional configuration of the monitoring system 100 may be realized by coordinating a plurality of devices. In this case, for example, even if the processing of at least a part of the functional configurations of the monitoring system 100 is executed by the plurality of devices, the load related to the processing is distributed to the plurality of devices. Good. Further, as another example, some of the functional configurations of the monitoring system 100 and other functional configurations may be realized by different devices connected via a network. As a specific example, the functional configuration corresponding to the event generation unit 121 and the event extraction unit 123 and the functional configuration corresponding to the output control unit 126 may be realized by different devices.

また、監視システム１００の各機能構成のうち一部の機能構成が、監視システム１００の外部に設けられていてもよい。具体的な一例として、複数の監視部１１０のうち少なくとも一部の監視部１１０が、監視システム１００の外部に設けられており、ネットワークを介して監視システム１００に接続されていてもよい。 Further, some of the functional configurations of the monitoring system 100 may be provided outside the monitoring system 100. As a specific example, at least a part of the monitoring units 110 among the plurality of monitoring units 110 may be provided outside the monitoring system 100 and connected to the monitoring system 100 via a network.

＜ハードウェア構成＞
図２は、本実施形態に係る監視システム１００を実現するために利用される情報処理装置２００のハードウェア構成の一例を示した図である。図２に示すように、本実施形態に係る情報処理装置２００は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２１０と、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）２２０と、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）２３０とを含む。また、情報処理装置２００は、補助記憶装置２４０と、出力装置２５０と、入力装置２６０と、ネットワークＩ／Ｆ２７０とを含む。ＣＰＵ２１０と、ＲＯＭ２２０と、ＲＡＭ２３０と、補助記憶装置２４０と、出力装置２５０と、入力装置２６０と、ネットワークＩ／Ｆ２７０とは、バス２８０を介して相互に接続されている。 <Hardware configuration>
FIG. 2 is a diagram showing an example of the hardware configuration of the information processing apparatus 200 used to realize the monitoring system 100 according to the present embodiment. As shown in FIG. 2, the information processing device 200 according to the present embodiment includes a CPU (Central Processing Unit) 210, a ROM (Read Only Memory) 220, and a RAM (Random Access Memory) 230. The information processing device 200 includes an auxiliary storage device 240, an output device 250, an input device 260, and a network I / F 270. The CPU 210, ROM 220, RAM 230, auxiliary storage device 240, output device 250, input device 260, and network I / F 270 are connected to each other via a bus 280.

ＣＰＵ２１０は、情報処理装置２００の各種動作を制御する中央演算装置である。例えば、ＣＰＵ２１０は、情報処理装置２００全体の動作を制御してもよい。ＲＯＭ２２０は、ＣＰＵ２１０で実行可能な制御プログラムやブートプログラムなどを記憶する。ＲＡＭ２３０は、ＣＰＵ２１０の主記憶メモリであり、ワークエリア又は各種プログラムを展開するための一時記憶領域として用いられる。 The CPU 210 is a central processing unit that controls various operations of the information processing device 200. For example, the CPU 210 may control the operation of the entire information processing device 200. The ROM 220 stores a control program, a boot program, and the like that can be executed by the CPU 210. The RAM 230 is the main storage memory of the CPU 210, and is used as a work area or a temporary storage area for developing various programs.

補助記憶装置２４０は、各種データや各種プログラムを記憶する。補助記憶装置２４０は、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）や、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）に代表される不揮発性メモリ等のような、各種データを一時的または持続的に記憶可能な記憶デバイスにより実現される。 The auxiliary storage device 240 stores various data and various programs. The auxiliary storage device 240 is realized by a storage device that can temporarily or continuously store various data such as an HDD (Hard Disk Drive) and a non-volatile memory typified by an SSD (Solid State Drive). ..

出力装置２５０は、各種情報を出力する装置であり、ユーザに対する各種情報の提示に利用される。本実施形態では、出力装置２５０は、ディスプレイ等の表示デバイスにより実現される。出力装置２５０は、各種表示情報を表示させることで、ユーザに対して情報を提示する。ただし、他の例として、出力装置２５０は、音声や電子音等の音を出力する音響出力デバイスにより実現されてもよい。この場合には、出力装置２５０は、音声や電信等の音を出力することで、ユーザに対して情報を提示する。また、出力装置２５０として適用されるデバイスは、ユーザに対して情報を提示するために利用する媒体に応じて適宜変更されてもよい。なお、出力装置２５０が、各種情報の提示に利用される「出力部」の一例に相当する。 The output device 250 is a device that outputs various types of information, and is used for presenting various types of information to the user. In the present embodiment, the output device 250 is realized by a display device such as a display. The output device 250 presents information to the user by displaying various display information. However, as another example, the output device 250 may be realized by an acoustic output device that outputs sounds such as voice and electronic sounds. In this case, the output device 250 presents information to the user by outputting sounds such as voice and telegraph. Further, the device applied as the output device 250 may be appropriately changed depending on the medium used for presenting information to the user. The output device 250 corresponds to an example of an "output unit" used for presenting various types of information.

入力装置２６０は、ユーザからの各種指示の受け付けに利用される。本実施形態では、入力装置２６０は、マウス、キーボード、タッチパネル等の入力デバイスを含む。ただし、他の例として、入力装置２６０は、マイクロフォン等の集音デバイスを含み、ユーザが発話した音声を集音してもよい。この場合には、集音された音声に対して音響解析や自然言語処理等の各種解析処理が施されることで、この音声が示す内容がユーザからの指示として認識される。また、入力装置２６０として適用されるデバイスは、ユーザからの指示を認識する方法に応じて適宜変更されてもよい。また、入力装置２６０として複数種類のデバイスが適用されてもよい。 The input device 260 is used for receiving various instructions from the user. In the present embodiment, the input device 260 includes an input device such as a mouse, a keyboard, and a touch panel. However, as another example, the input device 260 may include a sound collecting device such as a microphone and collect sound spoken by the user. In this case, the collected voice is subjected to various analysis processes such as acoustic analysis and natural language processing, so that the content indicated by the voice is recognized as an instruction from the user. Further, the device applied as the input device 260 may be appropriately changed depending on the method of recognizing the instruction from the user. Further, a plurality of types of devices may be applied as the input device 260.

ネットワークＩ／Ｆ２７０は、外部の装置とのネットワークを介した通信に利用される。なお、ネットワークＩ／Ｆ２７０として適用されるデバイスは、通信経路の種別や適用される通信方式に応じて適宜変更されてもよい。 The network I / F270 is used for communication with an external device via a network. The device applied as the network I / F270 may be appropriately changed according to the type of communication path and the applicable communication method.

ＣＰＵ２１０は、ＲＯＭ２２０又は補助記憶装置２４０に記憶されたプログラムをＲＡＭ２３０に展開し、このプログラムを実行することで、図１に示す監視システム１００の機能構成や、図３に示すフローチャートで示された処理が実現される。 The CPU 210 expands the program stored in the ROM 220 or the auxiliary storage device 240 into the RAM 230, and by executing this program, the functional configuration of the monitoring system 100 shown in FIG. 1 and the processing shown in the flowchart shown in FIG. Is realized.

＜処理＞
図３を参照して、監視システム１００の処理の一例について説明する。図３は、本実施形態に係る監視システム１００の処理の一例を示したフローチャートである。 <Processing>
An example of processing of the monitoring system 100 will be described with reference to FIG. FIG. 3 is a flowchart showing an example of processing of the monitoring system 100 according to the present embodiment.

Ｓ１０１において、イベント生成部１２１は、複数の監視部１１０それぞれから、監視対象１９０の状態の監視結果に応じたメッセージを取得する。 In S101, the event generation unit 121 acquires messages from each of the plurality of monitoring units 110 according to the monitoring result of the state of the monitoring target 190.

Ｓ１０２において、イベント生成部１２１は、各監視部１１０から取得したメッセージに基づき、監視システム１００が状態検出の対象とする事象ごとにイベントデータを生成する。 In S102, the event generation unit 121 generates event data for each event for which the monitoring system 100 targets state detection, based on the message acquired from each monitoring unit 110.

Ｓ１０３において、イベント抽出部１２３は、イベント生成部１２１により生成された一連のイベントデータを所定の抽出条件に基づきフィルタリングすることで、少なくとも一部のイベントデータを抽出する。具体的な一例として、イベント抽出部１２３は、生成された一連のイベントデータから、所定の文字列を含むイベントデータを抽出してもよい。また、イベント抽出部１２３は、所定の抽出条件に基づき関連性を有することが推定される２以上のイベントデータを関連付けて抽出してもよい。 In S103, the event extraction unit 123 extracts at least a part of the event data by filtering a series of event data generated by the event generation unit 121 based on a predetermined extraction condition. As a specific example, the event extraction unit 123 may extract event data including a predetermined character string from the generated series of event data. Further, the event extraction unit 123 may correlate and extract two or more event data that are presumed to have relevance based on a predetermined extraction condition.

Ｓ１０４において、出力制御部１２６は、イベント抽出部１２３により抽出されたイベントデータに応じた情報が所定の出力先に出力されるように制御する。例えば、出力制御部１２６は、イベント抽出部１２３により抽出されたイベントを出力部１２７に出力させることで、ユーザに対して情報の報知を行ってもよい。また、他の一例として、出力制御部１２６は、読み出したイベントデータを、ネットワークを介して他の装置に転送してもよい。
また、出力制御部１２６は、互いに関連付けられた複数のイベントデータを読み出してもよい。この場合には、出力制御部１２６は、当該複数のイベントデータの組み合わせに応じた情報を所定の出力先に出力してもよい。 In S104, the output control unit 126 controls so that the information corresponding to the event data extracted by the event extraction unit 123 is output to a predetermined output destination. For example, the output control unit 126 may notify the user of information by causing the output unit 127 to output the event extracted by the event extraction unit 123. Further, as another example, the output control unit 126 may transfer the read event data to another device via the network.
Further, the output control unit 126 may read out a plurality of event data associated with each other. In this case, the output control unit 126 may output information according to the combination of the plurality of event data to a predetermined output destination.

監視システム１００は、図３に示す一連の処理を、所定の契機ごとに実行する。具体的な一例として、監視システム１００は、所定の期間ごとに定期的に、図３に示す一連の処理を実行してもよい。また、他の一例として、監視システム１００は、所定のトリガに連動して、図３に示す一連の処理を実行してもよい。 The monitoring system 100 executes a series of processes shown in FIG. 3 at predetermined triggers. As a specific example, the monitoring system 100 may execute a series of processes shown in FIG. 3 periodically at predetermined intervals. Further, as another example, the monitoring system 100 may execute a series of processes shown in FIG. 3 in conjunction with a predetermined trigger.

＜実施例＞
続いて、監視システム１００の実施例として、監視部１１０から出力されるメッセージに基づく、イベントデータの生成及び抽出に係る処理の一例について、具体的な例を挙げて説明する。なお、本実施例では、複数の監視部１１０として、Ｚａｂｂｉｘと称されるＯＳＳ（ＯｐｅｎＳｏｕｒｃｅＳｏｆｔｗａｒｅ）の監視ツールと、Ｐｒｏｍｅｔｈｅｕｓと称されるＯＳＳの監視ツールと、が利用される場合の一例について説明する。 <Example>
Subsequently, as an embodiment of the monitoring system 100, an example of processing related to generation and extraction of event data based on a message output from the monitoring unit 110 will be described with specific examples. In this embodiment, an example will be described in which an OSS (Open Source Software) monitoring tool called Zabbix and an OSS monitoring tool called Prometheus are used as the plurality of monitoring units 110. To do.

監視システム１００では、複数の監視部１１０それぞれから出力されるメッセージに基づきイベントデータが生成され、当該イベントデータに基づき監視対象１９０の状態の監視が行われる。具体的には、監視システム１００では、所望の監視対象１９０の異常な動作を検出する場合に、上記イベントデータの規定に係るフォーマットに基づき、所望の事象に対応するイベントデータを抽出することが可能である。そのため、監視システム１００に依れば、各監視部１１０が出力するメッセージのフォーマットに依存せずに、所望の事象に対応するイベントデータを抽出することが可能である。 In the monitoring system 100, event data is generated based on the messages output from each of the plurality of monitoring units 110, and the state of the monitored target 190 is monitored based on the event data. Specifically, in the monitoring system 100, when detecting an abnormal operation of a desired monitoring target 190, it is possible to extract event data corresponding to a desired event based on the format according to the above-mentioned event data regulation. Is. Therefore, according to the monitoring system 100, it is possible to extract event data corresponding to a desired event without depending on the format of the message output by each monitoring unit 110.

（イベントデータの生成）
ここで、各監視部１１０が出力するメッセージのフォーマットに依存せずに、所望の事象に対応するイベントデータの抽出を可能とするための、当該イベントデータの生成に係る処理の一例について具体的な例を挙げて説明する。 (Generation of event data)
Here, a specific example of a process related to the generation of the event data in order to enable the extraction of the event data corresponding to the desired event without depending on the format of the message output by each monitoring unit 110 is specified. An example will be given.

Ｚａｂｂｉｘでは、例えば、サーバの再起動を示すメッセージを抽出する場合に、当該抽出に係る条件式を以下のように表現する。 In Zabbix, for example, when extracting a message indicating the restart of the server, the conditional expression related to the extraction is expressed as follows.

また、Ｐｒｏｍｅｔｈｅｕｓでは、例えば、ＣＰＵ使用率が９０％を超過したことを示すメッセージを抽出する場合に、当該抽出に係る条件式を以下のように表現する。 Further, in Prometheus, for example, when extracting a message indicating that the CPU usage rate exceeds 90%, the conditional expression related to the extraction is expressed as follows.

これらの監視ツールにおいては、特定の条件を満たす場合に異常ではないものとしてメッセージの出力を抑制するためには、例えば、上記に示した監視ツールごとの条件式の記述方法を理解したうえで、制御に係るスクリプト等を記述または修正することとなる。すなわち、監視ツールごとに、当該監視ツールで用いられている項目名や構文の理解がユーザに求められる。そのため、複数の監視ツールが利用される状況下では、ユーザに対して、監視ツールごとに使用されている項目名や構文の理解が求められることとなり、ユーザへの負担が増加する場合がある。 In these monitoring tools, in order to suppress the output of a message as if it is not abnormal when a specific condition is satisfied, for example, after understanding the description method of the conditional expression for each monitoring tool shown above, Scripts related to control will be described or modified. That is, for each monitoring tool, the user is required to understand the item names and syntax used in the monitoring tool. Therefore, in a situation where a plurality of monitoring tools are used, the user is required to understand the item name and syntax used for each monitoring tool, which may increase the burden on the user.

これに対して、監視ツール群とは別にフィルタを用いる場合には、監視ツールが出力するメッセージに対してフィルタにより状態判定の条件を追加することが可能である。この場合には、当該条件については、メッセージの構成とフィルタツールの構文とにより記述することが可能であり、この際に、メッセージの出力元となる監視ツールにおける条件式の構文の理解は求められない。 On the other hand, when a filter is used separately from the monitoring tool group, it is possible to add a condition for status determination by the filter to the message output by the monitoring tool. In this case, the condition can be described by the message structure and the syntax of the filter tool. At this time, it is required to understand the syntax of the conditional expression in the monitoring tool that is the output source of the message. Absent.

例えば、図４は、監視ツールごとに出力されるメッセージの一例を示した図である。
具体的には、図４（ａ）は、サーバの再起動が行われた場合に、Ｚａｂｂｉｘが出力するメッセージの一例を示している。「ｄａｔｅ」は、メッセージが出力された日時を示すタイムスタンプである。「ｈｏｓｔ」は、監視対象となるサーバのホスト名を示している。「ｓｕｂｊｅｃｔ」は、メッセージの種別を示している。「ｍｅｓｓａｇｅ」は、メッセージの本文を示している。「ｓｅｖｅｒｉｔｙ」は、重要度や緊急度の指標を示している。
また、図４（ｂ）は、ＣＰＵ使用率が９０％を超過した場合に、Ｐｒｏｍｅｔｈｅｕｓが出力するメッセージの一例を示している。「ｄａｔｅ」は、メッセージが出力された日時を示すタイムスタンプである。「ｎｏｄｅ」は、監視対象となるサーバのホスト名を示している。「ｓｕｍｍａｒｙ」は、メッセージの種別を示している。「ｓｔａｔｕｓ」は、監視対象の状態を示している。「ｓｅｖｅｒｉｔｙ」は、重要度や緊急度の指標を示している。 For example, FIG. 4 is a diagram showing an example of a message output for each monitoring tool.
Specifically, FIG. 4A shows an example of a message output by Zabbix when the server is restarted. “Date” is a time stamp indicating the date and time when the message was output. “Host” indicates the host name of the server to be monitored. “Subject” indicates the type of message. "Message" indicates the body of the message. "Severity" indicates an index of importance or urgency.
Further, FIG. 4B shows an example of a message output by Prometheus when the CPU usage rate exceeds 90%. “Date” is a time stamp indicating the date and time when the message was output. “Node” indicates the host name of the server to be monitored. "Summary" indicates the type of message. “Status” indicates the status of the monitoring target. "Severity" indicates an index of importance or urgency.

また、図５は、監視ツールが出力するメッセージに対してフィルタにより状態判定の条件を追加する場合のスクリプトの一例を示した図である。具体的には、図５に示す例は、図４に例示したメッセージの出力を前提として、各メッセージが出力された時刻が２３時から２４時の間の場合には、正常な動作と判断する条件を規定したスクリプトの一例を示している。なお、図５はあくまで一例であり、利用するフィルタツールにおける構文に応じて少なくとも一部の記述方法が適宜変更されてもよい。 Further, FIG. 5 is a diagram showing an example of a script in which a condition for determining a state is added by a filter to a message output by a monitoring tool. Specifically, the example shown in FIG. 5 presupposes the output of the message illustrated in FIG. 4, and if the time when each message is output is between 23:00 and 24:00, the condition for determining normal operation is set. An example of the specified script is shown. Note that FIG. 5 is just an example, and at least a part of the description method may be appropriately changed depending on the syntax of the filter tool to be used.

具体的には、図５に示す例では、監視対象となるサーバのホスト名が、各ツールにおける当該ホスト名を示す項目（それぞれ「ｈｏｓｔ」及び「ｎｏｄｅ」）から取得されている。また、メッセージの種別を表す情報（文字列）が、各ツールにおける当該メッセージの種別を示す項目（それぞれ「ｓｕｂｊｅｃｔ」及び「ｓｕｍｍａｒｙ」）から取得されている。以上のようにして取得された情報が所定の条件を満たす場合に、さらに、メッセージが出力された日時を示す情報が、当該日時を示す項目（いずれも「ｄａｔｅ」）から取得される。そのうえで、メッセージが出力された時間が２３時から２４時の間か否かに応じて、正常及び異常のいずれかが判定されている。
なお、図５を参照するとわかるように、項目名やメッセージの内容については監視ツールごとに異なる可能性があるが、条件判定のための条件式については各監視ツールに依存せずに記述することが可能である。 Specifically, in the example shown in FIG. 5, the host name of the server to be monitored is acquired from the items (“host” and “node”, respectively) indicating the host name in each tool. In addition, information (character string) indicating the type of the message is acquired from the items (“subject” and “summary”, respectively) indicating the type of the message in each tool. When the information acquired as described above satisfies a predetermined condition, information indicating the date and time when the message was output is further acquired from the item indicating the date and time (both are "date"). Then, either normal or abnormal is determined depending on whether or not the time when the message is output is between 23:00 and 24:00.
As can be seen by referring to FIG. 5, the item name and the content of the message may differ depending on the monitoring tool, but the conditional expression for determining the condition should be described without depending on each monitoring tool. Is possible.

本実施形態に係る監視システム１００では、上述した特性を利用することで、監視ツールごとに固有のルールに依存せずに、各監視ツールが監視結果に応じて出力する情報それぞれを、共通のルールに基づき一元的に管理可能とする。 In the monitoring system 100 according to the present embodiment, by utilizing the above-mentioned characteristics, the information output by each monitoring tool according to the monitoring result can be set to a common rule without depending on the rules unique to each monitoring tool. It will be possible to manage centrally based on.

具体的には、イベント生成部１２１は、上述した特性を利用することで、各監視部１１０から出力されるメッセージから所望の情報を抽出し、所定のフォーマットに基づき抽出した情報を利用してイベントデータを生成する。なお、イベントデータの規定に係るフォーマットが、「第２のフォーマット」の一例に相当する。 Specifically, the event generation unit 121 extracts desired information from the message output from each monitoring unit 110 by using the above-mentioned characteristics, and uses the information extracted based on a predetermined format to perform an event. Generate data. The format according to the definition of event data corresponds to an example of the "second format".

以下にイベントデータの一例について、（Ａ）〜（Ｄ）として示す種別ごとに列挙する。
（Ａ）履歴に関するメッセージで何らかの事象が発生したことを表すもの
・バックアップジョブが開始した
・ソフトウェアで特定のエラーが発生した
・ネットワーク機器のポートの接続が切断された
・サーバが再起動した
・クラウドストレージのリソースが追加された
（Ｂ）履歴に関するメッセージでユーザが何らかの操作を行ったことを表すもの
・管理者がサーバにリモートログインした
・管理者がユーザアカウント作成操作を実行した
・利用者がクラウドにネットワーク追加操作を実行した
・利用者がクラウド上のサーバ削除操作を実行した
（Ｃ）性能や状態に関するメッセージで事前に定義した条件に該当したことを表すもの
・ＣＰＵ使用率が９０％を超えた
・Ｗｅｂサーバへの接続が３回連続で失敗した
・バックアップジョブの実行が３時間を超過しても続いている
・プロセスが実行状態から停止状態に変化した
（Ｄ）性能や状態に関するメッセージで現在の状態を表すもの
・サーバが現在停止中である
・プロセスが現在停止状態である
・現在のＣＰＵ使用率が２０％である
・バックアップジョブが実行中である An example of event data is listed below for each type shown as (A) to (D).
(A) A message related to history indicating that something has happened-A backup job has started-A specific error has occurred in the software-The port of the network device has been disconnected-The server has restarted-The cloud Storage resource added (B) History message indicating that the user has performed some operation-The administrator has remotely logged in to the server-The administrator has executed the user account creation operation-The user is in the cloud -The user executed the server deletion operation on the cloud. (C) The message regarding the performance and status indicates that the conditions defined in advance were met.-The CPU usage rate exceeds 90%.・ Connection to the Web server failed 3 times in a row ・ Backup job execution continues even after 3 hours ・ Process changed from running state to stopped state (D) Message about performance and status Represents the current state-The server is currently stopped-The process is currently stopped-The current CPU usage is 20% -The backup job is running

また、イベント抽出部１２３は、イベントデータの生成に係るフォーマットに応じた所定の抽出条件に基づき、生成された一連のイベントデータから、少なくとも一部のイベントデータを抽出する。
なお、前述したように、イベント抽出部１２３は、所定の抽出条件に基づき関連性を有することが推定される２以上のイベントデータを関連付けて抽出してもよい。このように複数のイベントデータが関連付けられることで、当該複数のイベントデータに基づき、監視対象１９０の状態をより正確に判定することが可能となる。 In addition, the event extraction unit 123 extracts at least a part of the event data from the generated series of event data based on the predetermined extraction conditions according to the format related to the generation of the event data.
As described above, the event extraction unit 123 may correlate and extract two or more event data that are presumed to have relevance based on predetermined extraction conditions. By associating a plurality of event data in this way, it is possible to more accurately determine the state of the monitored target 190 based on the plurality of event data.

以下に、イベントデータの抽出に係る条件と、抽出結果に応じたる監視対象１９０の状態の判定と、の一例（換言すると、フィルタの適用例）について、特に２以上のイベントデータを組み合わせて利用する場合に着目して具体的な例を挙げて説明する。なお、以下に示す例では、上述した（Ａ）〜（Ｄ）のイベントデータのうち２以上のイベントデータを組み合わせる場合に着目して、対応する監視対象１９０の状態の判定方法の一例について説明する。 Below, for an example (in other words, an application example of a filter) of the conditions related to the extraction of event data and the determination of the state of the monitored object 190 according to the extraction result, in particular, two or more event data are used in combination. A specific example will be given to explain by focusing on the case. In the example shown below, an example of a method for determining the state of the corresponding monitoring target 190 will be described focusing on the case where two or more event data out of the above-mentioned event data (A) to (D) are combined. ..

−フィルタの適用例１
（Ａ）として示すイベントデータが「異常」な状態を示す場合においても、（Ｂ）として示すイベントデータとの組み合わせに応じて「正常」な状態にあると判定されてもよい。
具体的な一例として、「ネットワーク機器のポートの接続が切れた」ことを示すイベントに対応するイベントデータは、単体では監視対象１９０が「異常」な状態にあることを示している。一方で、過去３分以内に「クラウドでユーザがネットワーク削除操作を行った」ことを示すイベントに対応するイベントデータが生成されている場合には、当該操作に起因してネットワーク機器のポートの接続が切れたものと推定することが可能である。すなわち、この場合には、監視対象１９０が「正常」な状態にあると判定することが可能である。 -Filter application example 1
Even when the event data shown as (A) shows an "abnormal" state, it may be determined that the event data is in the "normal" state according to the combination with the event data shown as (B).
As a specific example, the event data corresponding to the event indicating that "the port of the network device has been disconnected" indicates that the monitored target 190 is in an "abnormal" state by itself. On the other hand, if event data corresponding to the event indicating that "the user has performed a network deletion operation in the cloud" has been generated within the past 3 minutes, the connection of the network device port is caused by the operation. It is possible to presume that has expired. That is, in this case, it is possible to determine that the monitored object 190 is in the "normal" state.

−フィルタの適用例２
（Ｃ）として示すイベントデータが「異常」な状態を示す場合においても、（Ａ）として示すイベントデータとの組み合わせに応じて「正常」な状態にあると判定されてもよい。
具体的な一例として、「サーバのＣＰＵ使用率が９０％を超えた」ことを示すイベントに対応するイベントデータは、単体では監視対象１９０が「異常」な状態にあることを示している。一方で、過去１０分以内に「サーバが再起動された」ことを示すイベントに対応するイベントデータが生成されている場合には、当該再起動に起因してサーバのＣＰＵ使用率が９０％を超えたものと推定することが可能である。すなわち、この場合には、監視対象１９０が「正常」な状態にあると判定することが可能である。 -Filter application example 2
Even when the event data shown as (C) shows an "abnormal" state, it may be determined that the event data is in the "normal" state according to the combination with the event data shown as (A).
As a specific example, the event data corresponding to the event indicating that "the CPU usage rate of the server has exceeded 90%" indicates that the monitored target 190 is in an "abnormal" state by itself. On the other hand, if event data corresponding to the event indicating "the server has been restarted" has been generated within the past 10 minutes, the CPU usage rate of the server will be 90% due to the restart. It is possible to presume that it has exceeded. That is, in this case, it is possible to determine that the monitored object 190 is in the "normal" state.

−フィルタの適用例３
（Ａ）として示すイベントデータが「正常」な状態を示す場合においても、（Ｄ）として示すイベントデータとの組み合わせに応じて「異常」な状態にあると判定されてもよい。
具体的な一例として、「サーバ上で特定のエラーが発生した」ことを示すイベントに対応するイベントデータは、当該エラーが直ちに影響を及ぼさない場合には「正常」な状態にあると判定することが可能である。一方で、当該サーバが依存している他のサーバについて、直近で「サーバが停止中である」ことを示すイベントに対応するイベントデータが生成されている場合には、上記エラーがシステム全体に影響を及ぼす可能性がある。すなわち、この場合には、監視対象１９０が「異常」な状態にあると判定されてもよい。 -Filter application example 3
Even when the event data shown as (A) shows a "normal" state, it may be determined that the event data is in an "abnormal" state depending on the combination with the event data shown as (D).
As a specific example, the event data corresponding to the event indicating that "a specific error has occurred on the server" is determined to be in the "normal" state if the error does not have an immediate effect. Is possible. On the other hand, if event data corresponding to the event indicating "the server is stopped" is generated for the other server on which the server depends, the above error affects the entire system. May affect. That is, in this case, it may be determined that the monitored object 190 is in an "abnormal" state.

−フィルタの適用例４
（Ｃ）として示すイベントデータが「異常」な状態を示す場合においても、（Ｂ）として示すイベントデータとの組み合わせに応じて「正常」な状態にあると判定されてもよい。
具体的な一例として、「プロセスが起動状態から停止状態に変化した」ことを示すイベントに対応するイベントデータは、単体では監視対象１９０が「異常」な状態にあることを示している。一方で、過去３分以内に「利用者がサーバ追加操作を行った」ことを示すイベントに対応するイベントデータが生成されている場合には、当該サーバ追加操作に起因してプロセスが起動状態から停止状態に変化したものと推定することが可能である。すなわち、この場合には、監視対象１９０が「正常」な状態にあると判定することが可能である。 -Filter application example 4
Even when the event data shown as (C) shows an "abnormal" state, it may be determined that the event data is in the "normal" state according to the combination with the event data shown as (B).
As a specific example, the event data corresponding to the event indicating that "the process has changed from the started state to the stopped state" indicates that the monitored target 190 is in the "abnormal" state by itself. On the other hand, if event data corresponding to the event indicating that "the user has performed the server addition operation" is generated within the past 3 minutes, the process is started from the started state due to the server addition operation. It is possible to presume that the state has changed to a stopped state. That is, in this case, it is possible to determine that the monitored object 190 is in the "normal" state.

−フィルタの適用例５
（Ｂ）として示すイベントデータが「異常」な状態を示す場合においても、（Ｄ）として示すイベントデータとの組み合わせに応じて「正常」な状態にあると判定されてもよい。
具体的な一例として、「管理者がＤＢサーバにリモートログインした」ことを示すイベントに対応するイベントデータは、ＤＢサーバへのリモートログインが運用ルール上禁止している場合には、単体では監視対象１９０が「異常」な状態にあることを示している。一方で、直近で「作業用サーバが停止中である」場合に、例外としてＤＢサーバへのリモートログインが運用ルール上許可されている場合もある。このような場合には、直近で「作業用サーバが停止中である」ことを示すイベントに対応するイベントデータが生成されている場合には、監視対象１９０が「正常」な状態にあると判定することが可能である。 -Filter application example 5
Even when the event data shown as (B) shows an "abnormal" state, it may be determined that the event data is in the "normal" state according to the combination with the event data shown as (D).
As a specific example, the event data corresponding to the event indicating that "the administrator has remotely logged in to the DB server" is monitored by itself when remote login to the DB server is prohibited by the operation rules. It indicates that 190 is in an "abnormal" state. On the other hand, in the case where "the work server is stopped" most recently, remote login to the DB server may be permitted as an exception according to the operation rules. In such a case, if the event data corresponding to the event indicating that "the work server is stopped" is generated most recently, it is determined that the monitored target 190 is in the "normal" state. It is possible to do.

−フィルタの適用例６
（Ｃ）として示すイベントデータが「異常」な状態を示す場合においても、（Ｄ）として示すイベントデータとの組み合わせに応じて「正常」な状態にあると判定されてもよい。
具体的な一例として、「サーバのメモリ使用率が８０％を超えた」ことを示すイベントに対応するイベントデータは、単体では監視対象１９０が「異常」な状態にあることを示している。一方で、直近で「バックアップジョブが実行中である」ことを示すイベントに対応するイベントデータが生成されている場合には、当該バックアップジョブの実行が起因してサーバのメモリ使用率が８０％を超えたものと推定することが可能である。すなわち、この場合には、監視対象１９０が「正常」な状態にあると判定することが可能である。 -Filter application example 6
Even when the event data shown as (C) shows an "abnormal" state, it may be determined that the event data is in the "normal" state according to the combination with the event data shown as (D).
As a specific example, the event data corresponding to the event indicating that "the memory usage rate of the server exceeds 80%" indicates that the monitored target 190 is in an "abnormal" state by itself. On the other hand, if event data corresponding to the event indicating that "the backup job is being executed" has been generated most recently, the memory usage rate of the server will be 80% due to the execution of the backup job. It is possible to presume that it has exceeded. That is, in this case, it is possible to determine that the monitored object 190 is in the "normal" state.

また、あるイベントが発生した場合に、条件判定のために、当該イベントに対応するイベントデータに対して、過去に発生した別のイベントに対応するイベントデータを関連付けるための条件の一例を以下に示す。
・イベントの発生タイミングを基点として、過去の一定期間内に発生した所定の種類のイベントの有無を検索する
・イベントの発生タイミングを基点として、以降の一定期間内に発生した所定の種類のイベントの有無を監視する
・（Ｄ）に示す状態にあることを示すイベントデータのうち最新のものを保持する
・（Ｃ）に示す状態に変化したことを示すイベントデータのうち最新のものを保持する
なお、上記はあくまで一例であり、必ずしも所望のイベントに対応するイベントデータに対して、過去に発生した別のイベントに対応するイベントデータを関連付ける方法を限定するものではない。 In addition, when a certain event occurs, an example of the condition for associating the event data corresponding to another event that occurred in the past with the event data corresponding to the event for condition determination is shown below. ..
-Search for the presence or absence of a predetermined type of event that occurred within a certain period in the past based on the event occurrence timing-For a predetermined type of event that occurred within a certain period thereafter based on the event occurrence timing Monitor the presence / absence ・ Hold the latest event data indicating that the state is shown in (D) ・ Hold the latest event data indicating that the state has changed to the state shown in (C) , The above is only an example, and does not necessarily limit the method of associating the event data corresponding to another event that has occurred in the past with the event data corresponding to the desired event.

＜補足＞
実施形態として上述した例はあくまで一例であり、必ずしも監視システム１００を実現するための構成や、当該監視システム１００の処理を限定するものではない。すなわち、上述した技術思想を逸脱しない範囲で、監視システム１００の構成や処理の一部が適宜変更されてもよい。 <Supplement>
The above-mentioned example as an embodiment is merely an example, and does not necessarily limit the configuration for realizing the monitoring system 100 and the processing of the monitoring system 100. That is, a part of the configuration and processing of the monitoring system 100 may be appropriately changed without departing from the above-mentioned technical idea.

例えば、実施形態として上述した例では、監視部１１０から出力される情報として主にメッセージ（文字情報）を想定して、当該メッセージに基づきイベントデータを生成し、当該イベントデータに基づき監視対象１９０の状態を判定する場合に着目して説明した。
一方で、監視部１１０が出力する情報に応じて監視対象１９０の状態を判定することが可能であれば、当該情報の種別は特に限定されず、当該情報に基づきイベントデータを生成して監視対象１９０の状態の判定に利用することが可能である。 For example, in the above-described example as the embodiment, a message (character information) is mainly assumed as the information output from the monitoring unit 110, event data is generated based on the message, and the monitoring target 190 is based on the event data. The explanation focused on the case of determining the state.
On the other hand, if the state of the monitoring target 190 can be determined according to the information output by the monitoring unit 110, the type of the information is not particularly limited, and event data is generated based on the information to be monitored. It can be used to determine the state of 190.

具体的な一例として、監視部１１０として各種センサが適用されてもよい。この場合には、イベント生成部１２１は、例えば、監視部１１０（センサ）から所定の状態の検知結果に応じて出力される信号に基づき、イベントデータを生成してもよい。より具体的な一例として、イベント生成部１２１は、監視対象１９０の状態に応じて監視部１１０から出力される信号のレベルや位相が変化する場合には、当該信号のレベルや位相が示す当該監視対象１９０の状態に応じたイベントデータを生成すればよい。 As a specific example, various sensors may be applied as the monitoring unit 110. In this case, the event generation unit 121 may generate event data based on, for example, a signal output from the monitoring unit 110 (sensor) according to the detection result of a predetermined state. As a more specific example, when the level or phase of the signal output from the monitoring unit 110 changes according to the state of the monitoring target 190, the event generation unit 121 indicates the monitoring indicated by the level or phase of the signal. Event data may be generated according to the state of the target 190.

また、他の一例として、監視部１１０が、監視対象１９０の状態の監視結果に応じたコードを、監視結果に応じた情報として出力してもよい。このような場合においても、イベント生成部１２１は、例えば、各コードがどの状態に対応しているかが既知であれば、上記コードに基づき、監視部１１０による監視結果を特定してイベントデータを生成することが可能である。 Further, as another example, the monitoring unit 110 may output a code corresponding to the monitoring result of the state of the monitoring target 190 as information according to the monitoring result. Even in such a case, the event generation unit 121 generates event data by specifying the monitoring result by the monitoring unit 110 based on the above code, for example, if it is known which state each code corresponds to. It is possible to do.

また、他の一例として、監視部１１０が、監視結果に応じた情報として、監視対象１９０の状態の監視結果を符号化したデータを出力してもよい。このような場合においても、イベント生成部１２１は、例えば、情報の符号化方法が既知であれば、上記符号化後のデータに基づき、監視部１１０による監視結果を特定してイベントデータを生成することが可能である。
具体的な一例として、イベント生成部１２１は、監視対象１９０の状態の監視結果として取得される情報の一覧が既知であれば、これらの情報それぞれを既知の符号化方法に基づき符号化したデータをあらかじめ保持しておいてもよい。そのうえで、イベント生成部１２１は、監視部１１０から監視結果に応じて出力される符号化後のデータを、あらかじめ保持しておいた符号化後のデータそれぞれと照合することで、監視対象１９０の状態を特定してもよい。
また、他の一例として、イベント生成部１２１は、監視部１１０から監視結果に応じて出力される符号化後のデータを、既知の符号化方法に対応する復号方法により復号し、復号後の情報に基づき監視対象１９０の状態を特定してもよい。
これにより、イベント生成部１２１は、監視部１１０から監視結果に応じた情報として符号化後のデータを取得する場合においても、当該監視結果に応じたイベントデータを生成することが可能となる。 Further, as another example, the monitoring unit 110 may output data in which the monitoring result of the state of the monitoring target 190 is encoded as information according to the monitoring result. Even in such a case, for example, if the information coding method is known, the event generation unit 121 identifies the monitoring result by the monitoring unit 110 and generates event data based on the coded data. It is possible.
As a specific example, if the list of information acquired as the monitoring result of the state of the monitoring target 190 is known, the event generation unit 121 obtains data in which each of the information is encoded based on a known coding method. It may be held in advance. Then, the event generation unit 121 collates the encoded data output from the monitoring unit 110 according to the monitoring result with each of the encoded data held in advance, whereby the state of the monitoring target 190 is set. May be specified.
Further, as another example, the event generation unit 121 decodes the encoded data output from the monitoring unit 110 according to the monitoring result by a decoding method corresponding to a known encoding method, and the decoded information. The state of the monitoring target 190 may be specified based on the above.
As a result, the event generation unit 121 can generate event data according to the monitoring result even when the encoded data is acquired from the monitoring unit 110 as information according to the monitoring result.

また、図１に示す例では複数の監視部１１０それぞれにより互いに異なる監視条件に応じた監視対象１９０の状態の監視が行われている。一方で、複数の監視部１１０の役割を１つの装置が担ってもよい。この場合には、当該装置は、互いに異なる監視条件それぞれに基づく監視対象１９０の状態の監視結果に応じた情報をそれぞれ出力すればよい。 Further, in the example shown in FIG. 1, the state of the monitored object 190 is monitored by each of the plurality of monitoring units 110 according to different monitoring conditions. On the other hand, one device may play the role of a plurality of monitoring units 110. In this case, the apparatus may output information according to the monitoring result of the state of the monitored object 190 based on the different monitoring conditions.

また、上述した監視システム１００の各機能構成を実現するためのコンピュータプログラムを作製し、パーソナルコンピュータ等に実装することが可能である。また、このようなコンピュータプログラムが格納された、コンピュータで読み取り可能な記録媒体も提供することが可能である。記録媒体としては、例えば、磁気ディスク、光ディスク、光磁気ディスク、フラッシュメモリ等が挙げられる。また、上記のコンピュータプログラムは、記録媒体を用いずに、例えばネットワークを介して配信されてもよい。また、上記コンピュータプログラムを実行させるコンピュータの数は特に限定されない。例えば、上記コンピュータプログラムを、複数のコンピュータ（例えば、複数のサーバ等）が互いに連携して実行してもよい。また、他の一例として、上記コンピュータが複数のプロセッサを備える場合には、上記コンピュータプログラムを、この複数のプロセッサが互いに連携して実行してもよい。 Further, it is possible to create a computer program for realizing each functional configuration of the monitoring system 100 described above and implement it on a personal computer or the like. It is also possible to provide a computer-readable recording medium in which such a computer program is stored. Examples of the recording medium include magnetic disks, optical disks, magneto-optical disks, flash memories, and the like. Further, the above computer program may be distributed via, for example, a network without using a recording medium. Further, the number of computers for executing the above computer program is not particularly limited. For example, the computer program may be executed by a plurality of computers (for example, a plurality of servers) in cooperation with each other. Further, as another example, when the computer includes a plurality of processors, the computer program may be executed by the plurality of processors in cooperation with each other.

＜むすび＞
以上説明したように、本実施形態に係る情報処理システム（例えば、監視システム１００）は、互いに異なる監視条件に基づき状態の監視を行う複数の監視手段それぞれから当該監視の結果に応じたメッセージを取得する。また、上記情報処理システムは、上記複数の監視手段のうち少なくとも１以上の監視手段から取得された上記メッセージに基づきイベントデータを生成する。また、上記情報処理システムは、所定の抽出条件に基づき上記イベントデータを抽出する。そのうえで、上記情報処理システムは、上記イベントデータの抽出結果に応じた出力情報の出力を制御する。 <Conclusion>
As described above, the information processing system (for example, the monitoring system 100) according to the present embodiment acquires a message according to the result of the monitoring from each of a plurality of monitoring means for monitoring the state based on different monitoring conditions. To do. In addition, the information processing system generates event data based on the message acquired from at least one of the plurality of monitoring means. In addition, the information processing system extracts the event data based on predetermined extraction conditions. Then, the information processing system controls the output of output information according to the extraction result of the event data.

以上のような構成により、各監視手段が監視結果に応じて出力する情報を、監視手段ごとに固有のルールに依存せずに、共通のルールに基づき一元的に管理することが可能となる。これにより、各監視手段が監視結果に応じて出力する情報に基づく監視対象の状態を、共通のルールにより規定された条件（換言すると、フィルタ）に基づき判定することが可能となる。すなわち、本実施形態に係る情報処理システムに依れば、複数の監視手段を利用した監視対象の状態の監視をより好適な態様で実現することが可能となる。 With the above configuration, it is possible to centrally manage the information output by each monitoring means according to the monitoring result based on the common rules without depending on the rules unique to each monitoring means. As a result, the state of the monitoring target based on the information output by each monitoring means according to the monitoring result can be determined based on the conditions (in other words, the filter) defined by the common rule. That is, according to the information processing system according to the present embodiment, it is possible to realize monitoring of the state of the monitored object using a plurality of monitoring means in a more preferable manner.

また、上述した特性から、本実施形態に係る情報処理システムでは、互いに異なる複数の監視手段による監視結果を、共通のルールに基づき組み合わせることで、監視対象の状態を判定することも可能となる。これにより、個々の監視手段による監視結果のみを利用する場合に比べて、監視対象の状態を、より運用に即した態様で正確に判定することが可能となる。 Further, from the above-mentioned characteristics, in the information processing system according to the present embodiment, it is possible to determine the state of the monitoring target by combining the monitoring results by a plurality of different monitoring means based on a common rule. As a result, the state of the monitoring target can be accurately determined in a more operational manner as compared with the case where only the monitoring results by the individual monitoring means are used.

１００監視システム
１１０監視部
１２１イベント生成部
１２２メッセージ記憶部
１２３イベント抽出部
１２４ルール記憶部
１２５イベント記憶部
１２６出力制御部
１２７出力部
１９０監視対象 100 Monitoring system 110 Monitoring unit 121 Event generation unit 122 Message storage unit 123 Event extraction unit 124 Rule storage unit 125 Event storage unit 126 Output control unit 127 Output unit 190 Monitoring target

Claims

An acquisition means for acquiring a message according to the result of the monitoring from each of a plurality of monitoring means for monitoring the status based on different monitoring conditions, and an acquisition means.
A generation means that generates event data based on the message acquired from at least one of the plurality of monitoring means, and a generation means.
An extraction means that extracts the event data based on predetermined extraction conditions, and
An output control means that controls the output of output information according to the extraction result of the event data, and
Information processing system equipped with.

The predetermined extraction conditions include extraction conditions corresponding to two or more monitoring conditions that are different from each other.
The extraction means associates two or more event data extracted based on the extraction conditions corresponding to the two or more monitoring conditions, respectively.
The output control means controls the output of the output information according to two or more associated event data.
The information processing system according to claim 1.

The first monitoring means among the plurality of monitoring means monitors the state based on the first monitoring condition regarding the performance of the monitored target.
The acquisition means acquires the first message according to the result of monitoring based on the first monitoring condition.
The information processing system according to claim 1 or 2.

The second monitoring means among the plurality of monitoring means monitors the state based on the second monitoring condition regarding the history sequentially recorded in the monitoring target.
The acquisition means acquires a second message according to the result of monitoring based on the second monitoring condition.
The information processing system according to any one of claims 1 to 3.

The generation means extracts information of a predetermined type from the message acquired from the monitoring means based on the first format associated with the monitoring means, and obtains the event data based on the extracted information. The information processing system according to any one of claims 1 to 4, which is generated.

The information processing system according to claim 5, wherein the generation means generates the event data by applying the predetermined type of information extracted from the message to the second format.

The information processing system according to claim 6, wherein the extraction conditions are set based on the second format.

The information processing system according to any one of claims 1 to 7, further comprising at least a part of the plurality of monitoring means.

It is a control method for information processing systems.
An acquisition step of acquiring a message according to the result of the monitoring from each of a plurality of monitoring means that monitor the status based on different monitoring conditions, and an acquisition step.
A generation step of associating the messages acquired from at least two or more monitoring means among the plurality of monitoring means to generate event data, and a generation step.
An extraction step for extracting the event data based on predetermined extraction conditions, and
An output control step that controls the output of output information according to the extraction result of the event data, and
Information processing system control methods, including.

An acquisition means for acquiring a message according to the result of the monitoring from each of a plurality of monitoring means for monitoring the status based on different monitoring conditions, and an acquisition means.
A generation means that generates event data based on the message acquired from at least one of the plurality of monitoring means, and a generation means.
An extraction means that extracts the event data based on predetermined extraction conditions, and
An output control means that controls the output of output information according to the extraction result of the event data, and
Information processing device.

On the computer
An acquisition step of acquiring a message according to the result of the monitoring from each of a plurality of monitoring means that monitor the status based on different monitoring conditions, and an acquisition step.
A step of generating event data based on the message acquired from at least one of the plurality of monitoring means, and a step of generating event data.
An extraction step for extracting the event data based on predetermined extraction conditions, and
An output control step that controls the output of output information according to the extraction result of the event data, and
A program to execute.