JPWO2018061070A1

JPWO2018061070A1 - Computer system and analysis source data management method

Info

Publication number: JPWO2018061070A1
Application number: JP2018541740A
Authority: JP
Inventors: 中島　淳; 淳中島; 伸手塚; 田口　雄一; 雄一田口
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2016-09-27
Filing date: 2016-09-27
Publication date: 2019-06-24
Anticipated expiration: 2036-09-27
Also published as: WO2018061070A1; JP6680897B2

Abstract

管理計算機のプロセッサは、記憶装置に格納された分析ソースデータを読み込んでデータ格納領域に格納し、分析ソースデータに対して分析を行った結果を分析結果データとして出力し、読み込んだ分析ソースデータの所在をデータ所在情報に格納し、分析結果データと分析ソースデータを関連付けて分析結果生成元情報に格納し、分析ソースデータがデータ格納領域から削除されたときにはデータ所在情報を更新し、分析結果データを含むデータ取得要求を受け付けて、分析結果データから分析結果生成元情報を参照して分析結果データに対応する分析ソースデータを特定し、特定した分析ソースデータでデータ所在情報を検索し、分析ソースデータの所在を特定する。 The processor of the management computer reads the analysis source data stored in the storage device and stores it in the data storage area, outputs the analysis result of the analysis source data as analysis result data, and reads the analysis source data read. The location is stored in the data location information, the analysis result data and the analysis source data are associated and stored in the analysis result generation source information, and when the analysis source data is deleted from the data storage area, the data location information is updated, and the analysis result data Receiving the data acquisition request including the data, referring to the analysis result generation source information from the analysis result data, specifying analysis source data corresponding to the analysis result data, searching data location information with the specified analysis source data, and analyzing source Identify the location of the data.

Description

本発明は、分析システムにおけるデータ探索方法に関する。 The present invention relates to a data search method in an analysis system.

クラウドコンピューティング（以下、クラウド）の普及に伴い、顧客のデータセンタにおける業務システムのデータをクラウド上に集約して、各種分析サービスが登場している。顧客は当該分析サービスを利用して分析レポートを作成し、業務システムの改善の判断等を行う。近年では、分析サービスにおいて、地理的に分散した、複数のデータセンタ、及びＩｏＴ（Internet of Things）デバイスの情報を収集するエッジコンピュータ等、各所からの情報を含めた分析を行うこともある。 With the spread of cloud computing (hereinafter referred to as "cloud"), various analysis services have appeared by aggregating business system data in customer data centers on the cloud. The customer uses the analysis service to create an analysis report, and makes decisions on improvement of the business system. In recent years, analysis services may include analysis from various places such as geographically dispersed data centers, edge computers that collect information of IoT (Internet of Things) devices, and so forth.

分析サービスにおいて、一度生成した分析結果に対して、追加の情報を加えた上で再度分析したいケースが存在する。具体例を挙げると、例えば、電力会社の持つ、顧客情報（年齢、性別、職業等）から、エネルギー利用量の予測分析を実施し、料金プラン(分析レポートX)を提示していた。分析を行った３カ月後に、分析者が、電力使用状況等の情報、顧客行動情報もあわせて、利用量の予測分析を実施し、料金プランを提示したいと考え、３ヶ月前からの顧客情報（年齢、性別、職業等）も再度利用してデータ分析を実施する、といったケースである。 In the analysis service, there are cases where it is desirable to add additional information to the analysis result once generated and to analyze it again. As a specific example, for example, from the customer information (age, gender, occupation, etc.) possessed by the power company, the forecasting analysis of the energy usage was performed, and the rate plan (analysis report X) was presented. Three months after the analysis, the analyst, together with information such as power usage status, customer behavior information, perform forecasting analysis of usage amount, and wants to present a rate plan, and customer information from three months ago In this case, data analysis is performed using (age, gender, occupation, etc.) again.

データウェアハウスにおいて、ソースとなるデータ(分析前のデータ)と、データ変換実施後のデータ(分析結果)の関連を保持しておき、データ変換実施後のデータから、ソースとなるデータを探索可能とする技術が特許文献１に開示されている。 In the data warehouse, the relationship between source data (data before analysis) and data after data conversion (analysis result) is held, and source data can be searched from data after data conversion Patent Document 1 discloses a technique for making it.

国際公開第２０１４／０６２２７７号International Publication No. 2014/062277

クラウド上の分析サービスにおいて、分析に利用したデータ（顧客システムの情報等）は、契約上の理由（例えば、記憶領域の容量制限）等で分析の終了後にクラウド上から削除されることが多い。このように前記従来例では、既に分析したソースデータの情報が分析用のクラウド上に存在しないため、分析結果のソースデータを利用できず、既存の分析データと同一のソースデータを利用して、迅速に追加のレポートを生成することができない、という問題があった。 In the analysis service on the cloud, data used for analysis (customer system information and the like) is often deleted from the cloud after analysis is completed due to a contract reason (for example, storage capacity limitation). As described above, in the conventional example, since the information of the already analyzed source data does not exist on the analysis cloud, the source data of the analysis result can not be used, and the same source data as the existing analysis data is used. There was a problem that it was not possible to generate additional reports quickly.

本発明は、プロセッサとメモリを含む管理計算機と、前記管理計算機に接続された１以上の記憶装置と、を有する計算機システムであって、前記プロセッサは、前記記憶装置に格納された分析ソースデータを読み込んで所定のデータ格納領域に格納し、前記プロセッサは、前記データ格納領域の分析ソースデータに対して所定の分析を行った結果を分析結果データとして出力し、前記プロセッサは、前記読み込んだ分析ソースデータの所在をデータ所在情報に格納し、前記プロセッサは、前記分析結果データを生成する際に利用した前記分析ソースデータを、当該分析結果データと関連付けて分析結果生成元情報に格納し、前記プロセッサは、前記利用した分析ソースデータの所在が変更または削除されたときには、前記データ所在情報を更新し、前記プロセッサは、分析結果データの情報を含むデータ取得要求を受け付けて、前記分析結果データの情報から前記分析結果生成元情報を参照して、当該分析結果データを生成する際に利用した分析ソースデータを特定し、前記プロセッサは、前記特定した分析ソースデータで前記データ所在情報を検索し、当該分析ソースデータの所在を特定する。 The present invention is a computer system having a management computer including a processor and a memory, and one or more storage devices connected to the management computer, the processor including analysis source data stored in the storage device. The processor reads and stores it in a predetermined data storage area, and the processor outputs the result of performing a predetermined analysis on the analysis source data of the data storage area as analysis result data, and the processor reads the analysis source read in The location of data is stored in the data location information, and the processor associates the analysis source data used when generating the analysis result data with the analysis result data, and stores the analysis source data in analysis result generation source information, the processor When the location of the used analysis source data is changed or deleted, the data location information is updated. The processor receives a data acquisition request including information of analysis result data, and refers to the analysis result generation source information from the information of the analysis result data, and uses the analysis used when generating the analysis result data. The source data is identified, and the processor searches the data location information in the identified analysis source data to identify the location of the analysis source data.

本発明によれば、分析結果データから、分析ソースデータを辿れるようにすることで、追加の分析レポートを迅速に生成することが可能となる。 According to the present invention, tracing analysis source data from analysis result data enables quick generation of additional analysis reports.

本発明の第１の実施例を示し、データセンタにおける処理の一例を示すブロック図である。FIG. 5 is a block diagram showing a first embodiment of the present invention and showing an example of processing in a data center. 本発明の第１の実施例を示し、データセンタの計算機システムの一例を示すブロック図である。FIG. 2 is a block diagram showing a first embodiment of the present invention and showing an example of a computer system of a data center. 本発明の第１の実施例を示し、分析用クラウドの一例を示すブロック図である。FIG. 2 is a block diagram showing an example of an analysis cloud according to the first embodiment of the present invention. 本発明の第１の実施例を示し、ゲートウェイ計算機の一例を示すブロック図である。FIG. 2 is a block diagram showing an example of a gateway computer according to the first embodiment of this invention. 本発明の第１の実施例を示し、分析結果生成元テーブルの一例を示す図である。FIG. 7 shows the first embodiment of the present invention, and shows an example of an analysis result generation source table. 本発明の第１の実施例を示し、データ所在情報テーブルの一例を示す図である。FIG. 7 shows the first embodiment of the present invention, and shows an example of a data location information table. 本発明の第１の実施例を示し、データテーブルの一例を示す図である。FIG. 7 shows the first embodiment of the present invention, and is a diagram showing an example of a data table. 本発明の第１の実施例を示し、データ取得情報テーブルの一例を示す図である。FIG. 7 shows the first embodiment of the present invention, and is a diagram showing an example of a data acquisition information table. 本発明の第１の実施例を示し、データレプリケーション管理テーブルの一例を示す図である。FIG. 7 shows the first embodiment of the present invention, and shows an example of a data replication management table. 本発明の第１の実施例を示し、データバックアップ管理テーブルの一例を示す図である。FIG. 7 shows the first embodiment of the present invention, and shows an example of a data backup management table. 本発明の第１の実施例を示し、ＥＴＬ処理情報テーブルの一例を示す図である。FIG. 7 shows the first embodiment of the present invention, and shows an example of an ETL processing information table. 本発明の第１の実施例を示し、分析ソースデータトレース処理の一例を示すフローチャートである。It is a flow chart which shows a 1st example of the present invention, and shows an example of analysis source data trace processing. 本発明の第１の実施例を示し、分析ソースデータのメタデータ生成及び格納処理の一例を示すフローチャートである。It is a flow chart which shows a 1st example of the present invention, and shows an example of metadata generation and storage processing of analysis source data. 本発明の第１の実施例を示し、分析結果データのメタデータ生成及び格納処理の一例を示すフローチャートである。It is a flowchart which shows a 1st Example of this invention, and shows an example of a metadata production | generation and storage process of analysis result data. 本発明の第１の実施例を示し、分析ソースデータのメタデータ更新処理の一例を示すフローチャートである。It is a flowchart which shows a 1st Example of this invention, and shows an example of the metadata update process of analysis source data. 本発明の第２の実施例を示し、データセンタの計算機システムの一例を示すブロック図である。FIG. 8 is a block diagram showing a second embodiment of the present invention and showing an example of a computer system of a data center.

以下、本発明の一実施形態について添付図面を用いて説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the attached drawings.

幾つかの実施例を、図面を参照して説明する。なお、以下に説明する実施例は特許請求の範囲にかかる発明を限定するものではなく、また実施例の中で説明されている諸要素及びその組み合わせの全てが発明の解決手段に必須であるとは限らない。これらの図面において、複数の図を通じて同一の符号は同一の構成要素を示している。なお、以後の説明では「ａａａテーブル」等の表現にて本発明の情報を説明するが、これら情報はテーブル等のデータ構造以外で表現されていてもよい。そのため、データ構造に依存しないことを示すために「ａａａテーブル」等について「ａａａ情報」と呼ぶことがある。さらに、各情報の内容を説明する際に、「識別情報」、「識別子」、「名称」、「ＩＤ」という表現を用いるが、これらについてはお互いに置換が可能である。 Several embodiments will be described with reference to the drawings. The embodiments described below do not limit the invention according to the claims, and all of the elements described in the embodiments and their combinations are essential for the solution means of the invention. There is no limit. In these drawings, the same reference numerals indicate the same components throughout the drawings. In the following description, the information of the present invention will be described by the expression such as "aaa table", but such information may be expressed by other than the data structure such as the table. Therefore, the "aaa table" or the like may be called "aaa information" to indicate that it does not depend on the data structure. Furthermore, when describing the content of each information, although the expressions "identification information", "identifier", "name", and "ID" are used, they can be mutually substituted.

以後の説明では「プログラム」を主語として説明を行う場合があるが、プログラムはプロセッサによって実行されることで定められた処理をメモリ及び通信ポート（通信デバイス、管理Ｉ／Ｆ、データＩ／Ｆ）を用いながら行うため、プロセッサを主語とした説明としてもよい。また、プログラムを主語として開示された処理は管理サーバ等の計算機、情報処理装置が行う処理としてもよい。また、プログラムの一部または全ては専用ハードウェアによって実現されてもよい。また、各種プログラムはプログラム配布サーバや、計算機が読み取り可能な記憶メディアによって各計算機にインストールされてもよい。 In the following description, although the description may be made with "program" as the subject, the program may execute processing defined by the processor as a memory and communication port (communication device, management I / F, data I / F) In the description, the processor may be used as the subject. Further, the processing disclosed with the program as the subject may be processing performed by a computer such as a management server or an information processing apparatus. Also, part or all of the program may be realized by dedicated hardware. Also, various programs may be installed on each computer by a program distribution server or a storage medium readable by a computer.

以後、計算機システムを管理し、本発明の表示用情報を表示する一つ以上の計算機の集合を管理システムと呼ぶことがある。管理サーバが表示用情報を表示する場合は管理サーバが管理システムである、また、管理サーバと表示用計算機との組み合わせも管理システムである。また、管理処理の高速化や高信頼化のために複数の計算機で管理サーバと同等の処理を実現してもよく、この場合は当該複数の計算機（表示を表示用計算機が行う場合は表示用計算機も含め）が管理システムである。 Hereinafter, a set of one or more computers that manage computer systems and display the display information of the present invention may be referred to as a management system. When the management server displays the display information, the management server is the management system, and the combination of the management server and the display computer is also the management system. In addition, in order to speed up the management processing and increase reliability, processing equivalent to that of the management server may be realized by a plurality of computers, and in this case, the plurality of computers (for display when the display computer performs display) A computer is also a management system.

本実施例に係る計算機システムについて説明する。 A computer system according to the present embodiment will be described.

図１は本実施例の概略を示す図である。個々の構成要素の説明は後述する。データセンタ５の業務サーバ３０００上で稼働する業務システムにおいて利用するデータを、ＥＴＬ（Extract／Transform／Load）処理の実施後に分析用クラウド６に転送し、分析処理を実施する環境を前提とする。 FIG. 1 is a schematic view of the present embodiment. Description of each component will be described later. Assume that an environment in which data used in a business system operating on the business server 3000 of the data center 5 is transferred to an analysis cloud 6 after execution of ETL (Extract / Transform / Load) processing, and analysis processing is performed.

分析処理の実施後、過去に分析で利用したデーを参照したい場合に、分析ソースデータトレースプログラム９１５０は、（１）分析結果生成元テーブル（分析結果生成元情報）９１１０を参照して分析結果（分析結果データ）から当該分析に利用したデータ（分析ソースデータ）を特定し、（２）データ所在情報テーブル９１２０を参照して分析データの所在を特定し、（３）データ取得情報テーブル９１４０を参照してデータ取得方法を特定することで、過去に分析で利用したデータを取得可能とする。これらの処理を実施するため、データセンタ５上で保持しているＥＴＬ処理やデータ転送に関するメタデータを分析用クラウド６と共有する。 After performing analysis processing, when you want to refer to the data used for analysis in the past, the analysis source data trace program 9150 refers to (1) analysis result generation source table (analysis result generation source information) 9110 for analysis results ( Data (analysis source data) used for the analysis is specified from analysis result data), (2) location of analysis data is specified with reference to data location information table 9120, and (3) data acquisition information table 9140 is referenced. By specifying the data acquisition method, it is possible to acquire data used in analysis in the past. In order to carry out these processes, metadata related to ETL processes and data transfer held on the data center 5 are shared with the analysis cloud 6.

加えて、フィールド７のＩｏＴ機器１０、例えば工場における製造ロボットの情報や、工場内を撮影するカメラの情報などのデータを、フィールド７のゲートウェイ計算機２０において収集し、フィールド７内のデータ格納領域に格納し、分析用クラウド６にデータを転送し、分析用クラウド６上のメッセージブローカー３０によりデータテーブル（データ格納領域）９１３０に格納される構成を対象にしても良い。この場合においても前述の（１）から（３）の処理は同様となる。 In addition, data such as information on the IoT device 10 in field 7, for example, information on manufacturing robots in a factory, and information on a camera for photographing the inside of a factory are collected in the gateway computer 20 in field 7 and A configuration may be used in which the data is stored, transferred to the analysis cloud 6, and stored in the data table (data storage area) 9130 by the message broker 30 on the analysis cloud 6. Also in this case, the above-described processes (1) to (3) are the same.

ここで、フィールド７とは、例えば機械部品等を生産する１以上の工場を示し、１つ以上のＩｏＴ機器１０と１つ以上の管理計算機１０００及びゲートウェイ計算機２０を含む。 Here, the field 7 indicates, for example, one or more factories that produce machine parts and the like, and includes one or more IoT devices 10, one or more management computers 1000, and a gateway computer 20.

本実施例にかかわる計算機システムは、図２に示す１台以上の管理計算機（管理装置）１０００を含む１つ以上のデータセンタ５と、図３に示す１台以上の管理計算機９０００を含む一つ以上のデータセンタとしての分析用クラウド６または一つ以上のフィールド７から構成される。 The computer system according to this embodiment includes one or more data centers 5 including one or more management computer (management apparatus) 1000 shown in FIG. 2 and one including one or more management computer 9000 shown in FIG. It consists of the analysis cloud 6 or one or more fields 7 as the above data center.

図２は実施例１にかかわる計算機システムにおける、１台以上の管理計算機１０００を含むデータセンタ５の一例を示すブロック図である。 FIG. 2 is a block diagram showing an example of the data center 5 including one or more management computers 1000 in the computer system according to the first embodiment.

本実施例１では、１台以上の管理計算機１０００に加え、１台以上の業務サーバ３０００と、１台以上のストレージ装置２０００と、１台以上のＥＴＬサーバ４０００を含む例を示す。本実施例１で説明する全ての機能を管理計算機１０００が保有するなどしても良く、図示の構成に限定されない。 The first embodiment shows an example including one or more business servers 3000, one or more storage apparatuses 2000, and one or more ETL servers 4000 in addition to one or more management computers 1000. All functions described in the first embodiment may be held by the management computer 1000, for example, and the present invention is not limited to the illustrated configuration.

ストレージ装置２０００と、業務サーバ３０００と、ＥＴＬサーバ４０００は、ＳＡＮ（ＳｔｏｒａｇｅＡｒｅａＮｅｔｗｏｒｋ）などのデータ通信用のネットワーク６０００（具体的にはファイバチャネル等）を介して互いに接続される。 The storage apparatus 2000, the business server 3000, and the ETL server 4000 are connected to each other via a network 6000 (specifically, a fiber channel or the like) for data communication such as a SAN (Storage Area Network).

管理計算機１０００と、ストレージ装置２０００と、業務サーバ３０００と、ＥＴＬサーバ４０００は、管理用ネットワーク５０００を介して互いに接続される。図２に示す例では、ストレージ装置２０００と、業務サーバ３０００と、ＥＴＬサーバ４０００は、データ通信用のネットワーク６０００を介して互いに接続される。 The management computer 1000, the storage system 2000, the work server 3000, and the ETL server 4000 are connected to one another via a management network 5000. In the example shown in FIG. 2, the storage system 2000, the business server 3000, and the ETL server 4000 are mutually connected via the network 6000 for data communication.

当該接続は、ファイバチャネルを介して直接接続されるものに限定されず、１台以上のファイバチャネルスイッチ等のネットワーク機器を介して接続されても良い。また、当該接続は、データ通信用のネットワークであれば良く、ＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）ネットワークでも良い。また、データ通信用のネットワークを管理用ネットワーク５０００として、同じネットワークを利用しても良い。 The connection is not limited to direct connection via fiber channel, but may be via network equipment such as one or more fiber channel switches. The connection may be any network for data communication, and may be an IP (Internet Protocol) network. Also, the same network may be used as the management network 5000 as the data communication network.

管理計算機１０００は、メモリ１１００と、通信デバイス１２００と、プロセッサ１３００と、出力デバイス１４００と、入力デバイス１５００と、記憶デバイス１６００と、データＩ／Ｆ１８００を含み、これらは、内部バス１７００を介して互いに接続される。 The management computer 1000 includes a memory 1100, a communication device 1200, a processor 1300, an output device 1400, an input device 1500, a storage device 1600, and a data I / F 1800, which are mutually connected via the internal bus 1700. Connected

メモリ１１００は、構成情報収集プログラム１１１０と、メタデータ共有プログラム１１２０と、データ取得プログラム１１３０と、データレプリケーション管理テーブル１１４０と、データバックアップ管理テーブル１１５０を格納する。 The memory 1100 stores a configuration information collection program 1110, a metadata sharing program 1120, a data acquisition program 1130, a data replication management table 1140, and a data backup management table 1150.

構成情報収集プログラム１１１０は、ストレージ装置２０００と、業務サーバ３０００と、ＥＴＬサーバ４０００から、構成情報、例えば業務サーバ３０００と業務サーバで利用しているボリューム２２１０の関連情報や、図９に示すデータレプリケーション管理テーブル１１４０の情報や、図１０に示すデータバックアップ管理テーブル１１５０の情報等を収集するためのプログラムである。 The configuration information collection program 1110 includes configuration information from the storage device 2000, the business server 3000, and the ETL server 4000, for example, the related information of the volume 2210 used by the business server 3000 and the business server, and the data replication shown in FIG. It is a program for collecting information of the management table 1140, information of the data backup management table 1150 shown in FIG.

メタデータ共有プログラム１１２０は、データセンタ５上で保持する、分析用クラウド６に送信したデータに対応するメタデータ（具体的には図９または図１０に格納された情報）を分析用クラウド６と共有するためのプログラムである。 The metadata sharing program 1120 stores metadata (specifically, information stored in FIG. 9 or 10) corresponding to the data transmitted to the analysis cloud 6 held on the data center 5 with the analysis cloud 6 It is a program to share.

データ取得プログラム１１３０は、分析用クラウド６における分析ソースデータトレースプログラム９１５０から、分析用クラウド６において必要なデータの取得要求を受付けて、要求されたデータの取得を実行するプログラムである。 The data acquisition program 1130 is a program that receives an acquisition request for data required in the analysis cloud 6 from the analysis source data trace program 9150 in the analysis cloud 6 and executes acquisition of the requested data.

データレプリケーション管理テーブル１１４０には、データセンタ５から分析用クラウド６へのデータレプリケーションに関する情報を格納する。データバックアップ管理テーブル１１５０には、データセンタ５におけるデータバックアップに関する情報を格納する。 The data replication management table 1140 stores information on data replication from the data center 5 to the analysis cloud 6. The data backup management table 1150 stores information on data backup in the data center 5.

通信デバイス１２００は、管理用ネットワーク５０００に接続するためのデバイスである。プロセッサ１３００は、メモリ１１００上に展開されているプログラムを実行する。出力デバイス１４００は、管理計算機１０００が実行した処理結果を出力するデバイスで、例えばディスプレイ等である。入力デバイス１５００は、管理者が管理計算機１０００に指示を入力するためのデバイス、例えばキーボード等である。記憶デバイス１６００は、情報を格納するＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）や、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等である。データＩ／Ｆ１８００は、データ通信用のネットワーク６０００に接続するためのインタフェースデバイスである。 The communication device 1200 is a device for connecting to the management network 5000. The processor 1300 executes a program deployed on the memory 1100. The output device 1400 is a device that outputs the processing result executed by the management computer 1000, and is, for example, a display or the like. The input device 1500 is a device, such as a keyboard, for the administrator to input an instruction to the management computer 1000. The storage device 1600 is a hard disk drive (HDD) that stores information, a solid state drive (SSD), or the like. The data I / F 1800 is an interface device for connecting to the network 6000 for data communication.

図２に示す例では、各種プログラム及びテーブルは、メモリ１１００に格納されているが、記憶デバイス１６００または他の記憶媒体（図示しない）に格納されても良い。この場合、プロセッサ１３００は、プログラム実行時にメモリ１１００上に対象のプログラムを読みだし、読みだしたプログラムを実行する。 Although various programs and tables are stored in the memory 1100 in the example shown in FIG. 2, they may be stored in the storage device 1600 or another storage medium (not shown). In this case, the processor 1300 reads the target program on the memory 1100 at the time of program execution, and executes the read program.

また、ストレージ装置２０００のメモリ２１００に、前述のプログラム及びテーブルが格納され、ストレージ装置２０００と、業務サーバ３０００と、ＥＴＬサーバ４０００が、格納されたプログラムを実行しても良い。また、他の業務サーバ３０００またはスイッチ（図示省略）等の他の装置が、前述のプログラム及びテーブルを格納し、格納したプログラムを実行しても良い。 Further, the above-described program and table may be stored in the memory 2100 of the storage device 2000, and the storage device 2000, the business server 3000 and the ETL server 4000 may execute the stored programs. In addition, another business server 3000 or another device such as a switch (not shown) may store the above program and table and execute the stored program.

管理計算機１０００は、管理用ネットワーク５０００を介して、ストレージ装置２０００、業務サーバ３０００、ＥＴＬサーバ４０００上で動作するプログラムと通信できる。 The management computer 1000 can communicate with programs operating on the storage system 2000, the business server 3000, and the ETL server 4000 via the management network 5000.

ストレージ装置２０００は、メモリ２１００と、論理ボリューム提供部２２００と、ディスクＩ／Ｆコントローラ２３００と、管理Ｉ／Ｆ２４００と、プロセッサ２５００及びデータＩ／Ｆ２６００を含み、これらは内部バス等の通信路２７００を介して接続される。 The storage device 2000 includes a memory 2100, a logical volume providing unit 2200, a disk I / F controller 2300, a management I / F 2400, a processor 2500 and a data I / F 2600, which communicate with the communication path 2700 such as an internal bus. Connected through.

メモリ２１００は、ディスクキャッシュ２１１０を有する。また、メモリ２１００は、レプリケーションプログラム２１２０及びバックアッププログラム２１３０を格納する。ディスクキャッシュ２１１０は、情報を一時格納するための記憶領域である。レプリケーションプログラム２１２０は、ストレージ装置２０００のボリューム２２１０に格納されたデータを、別のストレージ装置２０００にレプリケーションを行うためのプログラムである。 The memory 2100 has a disk cache 2110. The memory 2100 also stores a replication program 2120 and a backup program 2130. The disk cache 2110 is a storage area for temporarily storing information. The replication program 2120 is a program for performing replication of data stored in the volume 2210 of the storage system 2000 to another storage system 2000.

バックアッププログラム２１３０は、ストレージ装置２０００のボリューム２２１０に格納されたデータを、別の場所、例えば別のボリューム２２１０にバックアップを行うためのプログラムである。これらのプログラムは、定期的に実行、あるいはユーザ要求などの任意のタイミングで実行されても良い。 The backup program 2130 is a program for backing up data stored in the volume 2210 of the storage system 2000 to another location, for example, another volume 2210. These programs may be periodically executed or may be executed at any timing such as a user request.

ここで、各種プログラム及びテーブルは、メモリ２１００に格納されているが、記憶デバイス（２２００）または他の記憶媒体（図示しない）に格納されても良い。この場合、プロセッサ２５００は、プログラム実行時にメモリ２１００上に対象のプログラムを読みだし、読みだしたプログラムを実行する。 Here, the various programs and tables are stored in the memory 2100, but may be stored in a storage device (2200) or another storage medium (not shown). In this case, the processor 2500 reads the target program on the memory 2100 at the time of program execution, and executes the read program.

論理ボリューム提供部２２００は、物理領域２２３０によって構成されるディスクプール２２２０を含み、ディスクプール２２２０の記憶領域を論理的に分割し、当該論理的に分割された記憶領域をボリューム２２１０として提供する。ここで物理領域２２３０は、物理ディスクや複数の物理ディスクから構成されるパリティグループなどである。 The logical volume provision unit 2200 includes a disk pool 2220 configured by a physical area 2230, logically divides the storage area of the disk pool 2220, and provides the logically divided storage area as a volume 2210. Here, the physical area 2230 is, for example, a physical disk or a parity group configured of a plurality of physical disks.

当該ストレージ装置２０００の外部の装置からはボリューム２２１０経由で物理領域２２３０にアクセスすることが可能である。なお、物理領域２２３０には物理領域番号が付与され、ディスクプール２２２０にはディスクプール番号が付与され、ボリューム２２１０にはボリューム番号が付与される。 A device outside the storage device 2000 can access the physical area 2230 via the volume 2210. A physical area number is assigned to the physical area 2230, a disk pool number is assigned to the disk pool 2220, and a volume number is assigned to the volume 2210.

これによって、ストレージ装置２０００は、物理領域２２３０と、ディスクプール２２２０及び論理ボリューム２２１０をそれぞれ一意に識別することができる。図２に示す例では、１つの物理領域（パリティグループＰＧ１）から構成されるディスクプール２２２０（ＰＯＯＬ１）が論理的に分割され、１つのボリューム２２１０（Ｖｏｌ１）がストレージ装置２０００の外部の装置（例えば、業務サーバ３０００）に提供される。 Thus, the storage system 2000 can uniquely identify the physical area 2230, the disk pool 2220, and the logical volume 2210. In the example shown in FIG. 2, the disk pool 2220 (POOL1) configured from one physical area (parity group PG1) is logically divided, and one volume 2210 (Vol1) is a device outside the storage apparatus 2000 (for example, , Business server 3000).

ディスクＩ／Ｆコントローラ２３００は、ボリューム提供部２２００に接続するためのインタフェースデバイスである。管理Ｉ／Ｆ２４００は管理用ネットワーク５０００に接続するためのインタフェースデバイスである。プロセッサ２５００は、メモリ２１００上に展開されたプログラムを実行する。データＩ／Ｆ２６００は、データ通信用のネットワーク６０００に接続するためのインタフェースデバイスである。 The disk I / F controller 2300 is an interface device for connecting to the volume providing unit 2200. The management I / F 2400 is an interface device for connecting to the management network 5000. The processor 2500 executes a program expanded on the memory 2100. The data I / F 2600 is an interface device for connecting to the network 6000 for data communication.

また、論理ボリューム提供部２２００は、１つのディスクプール２２２０の全記憶領域を１つの論理ボリューム２２１０として作成しても良い。また、論理ボリューム提供部２２００は、物理領域２２３０としてパリティグループ以外、例えば物理ディスクそのものや、フラッシュメモリ等の記憶媒体でも良い。 Also, the logical volume provision unit 2200 may create the entire storage area of one disk pool 2220 as one logical volume 2210. Further, the logical volume providing unit 2200 may be a physical disk 2230 other than a parity group, for example, a physical disk itself or a storage medium such as a flash memory.

業務サーバ３０００は、メモリ３１００と、データＩ／Ｆ３２００と、プロセッサ３３００及び管理Ｉ／Ｆ３４００を含み、これらは内部バス等の通信路３５００を介して互いに接続される。メモリ３１００は、業務プログラム３１１０を格納する。 The work server 3000 includes a memory 3100, a data I / F 3200, a processor 3300, and a management I / F 3400, which are connected to one another via a communication path 3500 such as an internal bus. The memory 3100 stores a business program 3110.

業務プログラム３１１０は、業務サーバ３０００が提供する業務を実現するためのプログラムであり、例えば、ＤＢＭＳ（ＤａｔａＢａｓｅＭａｎａｇｅｍｅｎｔＳｙｓｔｅｍ）やファイルシステム等である。業務サーバ３０００は、ストレージ装置２０００から提供された論理ボリューム２２１０に業務データを格納し、各種業務を提供する。 The business program 3110 is a program for realizing the business provided by the business server 3000, and is, for example, a DBMS (Data Base Management System) or a file system. The business server 3000 stores business data in the logical volume 2210 provided from the storage system 2000 and provides various business.

図２に示す例では、各種プログラムはメモリ３１００上に格納されているが、他の記憶装置（図示しない）に格納されていても良い。この場合、プロセッサ３３００は、処理実行時にメモリ３１００上の対象のプログラムを読みだし、読みだしたプログラムを実行する。 Although various programs are stored in the memory 3100 in the example shown in FIG. 2, they may be stored in another storage device (not shown). In this case, the processor 3300 reads a target program on the memory 3100 at the time of processing execution, and executes the read program.

データＩ／Ｆ３２００は、データ通信用のネットワーク６０００に接続するためのインタフェースデバイスである。プロセッサ３３００は、メモリ３１００上に展開されたプログラムを実行する。管理Ｉ／Ｆ３４００は管理用ネットワーク５０００に接続するためのインタフェースデバイスである。 The data I / F 3200 is an interface device for connecting to the network 6000 for data communication. The processor 3300 executes a program developed on the memory 3100. The management I / F 3400 is an interface device for connecting to the management network 5000.

ＥＴＬサーバ４０００は、メモリ４１００と、データＩ／Ｆ４２００と、プロセッサ４３００及び管理Ｉ／Ｆ４４００を含み、これらは内部バス等の通信路４５００を介して互いに接続される。メモリ４１００は、ＥＴＬプログラム４１１０、及びＥＴＬ処理情報テーブル４１２０を格納する。 The ETL server 4000 includes a memory 4100, a data I / F 4200, a processor 4300, and a management I / F 4400, which are connected to one another via a communication path 4500 such as an internal bus. The memory 4100 stores an ETL program 4110 and an ETL processing information table 4120.

ＥＴＬプログラム４１１０は、ＥＴＬサーバ４０００が提供するＥＴＬ処理を実現するためのプログラムであり、例えば、ストレージ装置２０００等に蓄積された業務データを選択して、分析しやすい形式に加工し、加工済みのデータを分析用クラウド６に転送するためのストレージ装置に書き込みを行うという一連の処理を実施する。具体的には、データの突き合わせ、重複削除、集計、ソート、コード変換、クレンジング、テーブル作成、テーブルへの挿入などの処理を実施する。 The ETL program 4110 is a program for realizing the ETL process provided by the ETL server 4000. For example, the business data stored in the storage device 2000 or the like is selected, processed into a format easy to analyze, and processed. A series of processing is performed to write data in a storage device for transferring data to the analysis cloud 6. Specifically, processing such as data matching, duplicate deletion, aggregation, sorting, code conversion, cleansing, table creation, and insertion into a table is performed.

ＥＴＬプログラム４１１０は、例えば、ＣＳＶ、ＸＭＬ、ＪＳＯＮ、ＺＩＰ、ＨＴＭＬなど周知または公知の形式や、Ｏｒａｃｌｅ、ＤＢ２、ＭｉｃｒｏｓｏｆｔＳＱＬ、ＰｏｓｔｇｒｅＳＱＬなどの定義されたテーブルスキーマ情報に基づく形式を相互に変換するための情報等、データの選択と加工と分析及び書き込みに必要な情報を保持し、これらを利用してＥＴＬ処理を実施する。 For example, the ETL program 4110 mutually converts a format based on well-known or publicly known formats such as CSV, XML, JSON, ZIP, HTML, and defined table schema information such as Oracle, DB2, Microsoft SQL, Postgre SQL, etc. Information necessary for data selection, processing, analysis, and writing, etc., and ETL processing is performed using these.

ＥＴＬ処理情報テーブル４１２０は、ＥＴＬプログラム４１１０により実行されたＥＴＬ処理の実行履歴情報を保持する。図２に示す例では、各種プログラムはメモリ４１００上に格納されているが、他の記憶装置（図示しない）に格納されていても良い。この場合、プロセッサ４３００は、処理実行時にメモリ４１００上の対象のプログラムを読みだし、読みだしたプログラムを実行する。 The ETL process information table 4120 holds execution history information of the ETL process executed by the ETL program 4110. Although various programs are stored in the memory 4100 in the example shown in FIG. 2, they may be stored in another storage device (not shown). In this case, the processor 4300 reads a target program on the memory 4100 at the time of processing execution, and executes the read program.

データＩ／Ｆ４２００は、データ通信用のネットワーク６０００に接続するためのインタフェースデバイスである。プロセッサ４３００は、メモリ４１００上に展開されたプログラムを実行する。管理Ｉ／Ｆ４４００は管理用ネットワーク５０００に接続するためのインタフェースデバイスである。 The data I / F 4200 is an interface device for connecting to the network 6000 for data communication. The processor 4300 executes a program expanded on the memory 4100. The management I / F 4400 is an interface device for connecting to the management network 5000.

図３は実施例１にかかわる計算機システムにおける、１台以上の管理計算機９０００を含む分析用クラウド６の一例を示すブロック図である。 FIG. 3 is a block diagram showing an example of an analysis cloud 6 including one or more management computers 9000 in the computer system according to the first embodiment.

本実施例１では、１台以上の管理計算機に加え、１台以上のストレージ装置（記憶装置）２０００、及びメッセージブローカー３０を含む例を示す。本実施例１で説明する全ての機能を管理計算機９０００が保有するなどしても良く、図３の構成に限定されない。 In the first embodiment, an example including one or more storage devices (storage devices) 2000 and a message broker 30 in addition to one or more management computers is shown. The management computer 9000 may hold all the functions described in the first embodiment, and is not limited to the configuration of FIG.

管理計算機９０００と、ストレージ装置２０００は、データ通信用のネットワーク６０００、を介して互いに接続される。データ通信用のネットワーク６０００としてどのようなものが使われても良く、例えばＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）ネットワークが利用される。 The management computer 9000 and the storage system 2000 are connected to each other via a network 6000 for data communication. Any network may be used as the network 6000 for data communication, and for example, an IP (Internet Protocol) network is used.

管理計算機９０００は、メモリ９１００と、通信デバイス９２００と、プロセッサ９３００と、出力デバイス９４００と、入力デバイス９５００と、記憶デバイス９６００と、データＩ／Ｆ９８００を含みこれらは、内部バス９７００を介して互いに接続される。 The management computer 9000 includes a memory 9100, a communication device 9200, a processor 9300, an output device 9400, an input device 9500, a storage device 9600, and a data I / F 9800, which are mutually connected via an internal bus 9700. Be done.

メモリ９１００は、分析結果生成元テーブル９１１０と、データ所在情報テーブル９１２０と、データテーブル９１３０と、データ取得情報テーブル９１４０と、分析ソースデータトレースプログラム９１５０と、メタデータ管理プログラム９１６０と、分析プログラム９１７０を含む。 The memory 9100 includes an analysis result generation source table 9110, a data location information table 9120, a data table 9130, a data acquisition information table 9140, an analysis source data tracing program 9150, a metadata management program 9160, and an analysis program 9170. Including.

分析結果生成元テーブル９１１０には、分析用クラウド６における、分析結果のデータと、分析に利用した分析ソースデータの関連情報を格納する。データ所在情報テーブル９１２０には、各時刻のデータの所在を表す情報を格納する。データテーブル９１３０には、データの値を表す情報を格納する。データ取得情報テーブル９１４０には、データの取得に必要な処理に関する情報を格納する。分析ソースデータトレースプログラム９１５０は、分析結果から分析に利用された分析ソースデータの所在を特定し、データを取得するためのプログラムである。メタデータ管理プログラム９１６０には、データの所在情報などの、データに関するメタ情報を格納する。分析プログラム９１７０は、分析ソースデータに基づき、分析を実施するプログラムである。 The analysis result generation source table 9110 stores data of the analysis result in the analysis cloud 6 and related information of analysis source data used for analysis. The data location information table 9120 stores information indicating the location of data at each time. The data table 9130 stores information representing data values. The data acquisition information table 9140 stores information on processing required to acquire data. The analysis source data trace program 9150 is a program for specifying the location of analysis source data used for analysis from analysis results and acquiring the data. The metadata management program 9160 stores meta information on data, such as location information on data. The analysis program 9170 is a program that performs analysis based on analysis source data.

通信デバイス９２００は、管理用ネットワーク５０００に接続するためのデバイスである。プロセッサ９３００は、メモリ９１００上に展開されているプログラムを実行する。出力デバイス９４００は、管理計算機９０００が実行した処理結果を出力するデバイス、例えばディスプレイ等である。入力デバイス９５００は、管理者が管理計算機９０００に指示を入力するためのデバイス、例えばキーボード等である。 The communication device 9200 is a device for connecting to the management network 5000. The processor 9300 executes a program loaded on the memory 9100. The output device 9400 is a device that outputs the processing result executed by the management computer 9000, such as a display. The input device 9500 is a device, such as a keyboard, for the administrator to input an instruction to the management computer 9000.

記憶デバイス９６００は、情報を格納するＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）や、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等である。データＩ／Ｆ９８００は、データ通信用のネットワーク６０００に接続するためのインタフェースデバイスである。 The storage device 9600 is a hard disk drive (HDD) that stores information, a solid state drive (SSD), or the like. The data I / F 9800 is an interface device for connecting to the network 6000 for data communication.

図３に示す例では、各種プログラム及びテーブルは、メモリ９１００に格納されているが、記憶デバイス９６００または他の記憶媒体（図示省略）に格納されても良い。この場合、プロセッサ９３００は、プログラム実行時にメモリ９１００上に対象のプログラムを読みだし、読みだしたプログラムを実行する。 Although various programs and tables are stored in the memory 9100 in the example shown in FIG. 3, they may be stored in the storage device 9600 or another storage medium (not shown). In this case, the processor 9300 reads a target program on the memory 9100 at the time of program execution, and executes the read program.

また、ストレージ装置２０００のメモリ２１００に、前述のプログラム及びテーブルが格納され、ストレージ装置２０００、格納されたプログラムを実行しても良い。また、サーバまたはスイッチ（図示しない）等の他の装置が、前述のプログラム及びテーブルを格納し、格納したプログラムを実行しても良い。管理計算機９０００は、管理用ネットワーク５０００を介して、ストレージ装置２０００上で動作するプログラムと通信できる。 Further, the above-described program and table may be stored in the memory 2100 of the storage device 2000, and the storage device 2000 may execute the stored program. Also, another device such as a server or a switch (not shown) may store the aforementioned program and table and execute the stored program. The management computer 9000 can communicate with a program operating on the storage system 2000 via the management network 5000.

ここで、データセンタ５と、分析用クラウド６間は、管理用ネットワーク５０００を介する通信のために、ゲートウェイ（図示しない）などを経由して互いに接続される。データセンタ５は、顧客の業務データを保存しておき、顧客の業務プログラムの実行が行われる施設であり、分析用クラウド６とは地理的に互いに分散していることが多い。データセンタ５と分析用クラウド６の間を繋ぐ管理用ネットワーク５０００は、ＷＡＮ（ワイドエリアネットワーク）や、ＬＡＮ（ローカルエリアネットワーク）などの任意のネットワークタイプであってよい。 Here, the data center 5 and the analysis cloud 6 are connected to each other via a gateway (not shown) or the like for communication via the management network 5000. The data center 5 is a facility for storing customer's business data and executing the customer's business program. The data center 5 and the analytic cloud 6 are often geographically dispersed with each other. The management network 5000 connecting between the data center 5 and the analysis cloud 6 may be any network type such as a wide area network (WAN) or a local area network (LAN).

データセンタ５、及び分析用クラウド６に配置されるゲートウェイはデータセンタ５内部、及び分析用クラウド６内部で利用されるネットワークプロトコルと、データセンタ５と分析用クラウド６間の通信のために利用されるネットワークプロトコルとの変換処理などを行う。 Gateways arranged in the data center 5 and the analysis cloud 6 are used for communication between the data center 5 and the analysis cloud 6 with a network protocol used in the data center 5 and in the analysis cloud 6. Conversion processing with the network protocol.

また、データセンタ５と分析用クラウド６が、データ通信用のネットワーク６０００を介するデータ通信のために互いに接続されており、データ通信用のネットワーク６０００は、ＳＡＮやＩＰなどの任意のネットワークタイプであってよく、また、管理用ネットワーク５０００と同一のネットワークを利用してもよい。 In addition, the data center 5 and the analysis cloud 6 are connected to each other for data communication via the network 6000 for data communication, and the network 6000 for data communication is any network type such as SAN or IP. The same network as the management network 5000 may be used.

また、本実施例では、ストレージ装置２０００を用意し、ストレージの保有する機能であるレプリケーションプログラム２１２０を利用し、データセンタ５と分析用クラウド６間のリモートレプリケーションを実施している。これに限定されるものではなく、例えば業務サーバ３０００上で動作するリモートコピープログラム（図示しない）を利用して、業務サーバ３０００上の記憶装置に格納したデータを分析用クラウド６に送信するなどしても良い。 Further, in the present embodiment, the storage apparatus 2000 is prepared, and remote replication between the data center 5 and the analysis cloud 6 is performed using the replication program 2120 which is a function held by the storage. For example, the data stored in the storage device on the business server 3000 is transmitted to the analysis cloud 6 using a remote copy program (not shown) operating on the business server 3000, for example. It is good.

メッセージブローカー３０は、フィールド７からのデータを受信し、テーブル定義にあわせてデータの形式変換を行うなどして、管理計算機９０００のデータテーブル９１３０に格納する役割を有する。ここでは管理計算機９０００と別にメッセージブローカー３０を用意する例を示したが、メッセージブローカー３０の機能を管理計算機９０００が提供しても良く、図示の例に限定されない。 The message broker 30 receives data from the field 7, performs format conversion of the data in accordance with the table definition, and stores the data in the data table 9130 of the management computer 9000. Although an example in which the message broker 30 is prepared separately from the management computer 9000 has been described here, the function of the message broker 30 may be provided by the management computer 9000 and is not limited to the illustrated example.

図４は実施例１にかかわる計算機システムにおける、フィールド７の一例を示すブロック図である。本実施例１では、１台以上の管理計算機９０００と、１台以上のＩｏＴ機器１０、１台以上のゲートウェイ計算機２０を含む例を示すが、本実施例１で説明する管理計算機９０００とゲートウェイ計算機２０の機能を、管理計算機またはゲートウェイ計算機のいずれかで提供しても良く、図示の例に限定されない。 FIG. 4 is a block diagram showing an example of the field 7 in the computer system according to the first embodiment. The first embodiment shows an example including one or more management computers 9000, one or more IoT devices 10, and one or more gateway computers 20. However, the management computer 9000 and the gateway computer described in the first embodiment will be described. The twenty functions may be provided by either a management computer or a gateway computer, and are not limited to the illustrated example.

また、管理計算機９０００と、ＩｏＴ機器１０と、ゲートウェイ計算機２０は、管理用ネットワーク５０００、を介して互いに接続される。管理用ネットワーク５０００としては、例えばＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）ネットワークを利用することができる。 The management computer 9000, the IoT device 10, and the gateway computer 20 are connected to one another via a management network 5000. As the management network 5000, for example, an IP (Internet Protocol) network can be used.

管理計算機９０００は、図３に示したものと同様であるため説明を省略する。ゲートウェイ計算機２０は、ＩｏＴ機器１０のデータ、例えばセンサデータ等を収集し、分析用クラウド６のメッセージブローカー３０に当該データを送信する手段を提供する。 The management computer 9000 is the same as that shown in FIG. The gateway computer 20 collects data of the IoT device 10, such as sensor data, and provides means for transmitting the data to the message broker 30 of the analysis cloud 6.

ゲートウェイ計算機２０は、メモリ２１と、通信デバイス２２と、プロセッサ２３と、出力デバイス２４と、入力デバイス２５と、記憶デバイス２６を含み、これらは、内部バス２７を介して互いに接続される。メモリ２１は、収集定義テーブル２１１と、ゲートウェイプログラム２１２を格納する。 The gateway computer 20 includes a memory 21, a communication device 22, a processor 23, an output device 24, an input device 25 and a storage device 26, which are connected to one another via an internal bus 27. The memory 21 stores a collection definition table 211 and a gateway program 212.

収集定義テーブル２１１は、ゲートウェイプログラム２１２によって参照され、どのＩｏＴ機器１０のセンサからセンサデータを収集し、収集されたセンサデータをどの分析用クラウド６へ送信するかを定義したテーブルである（図示省略）。 The collection definition table 211 is a table which is referred to by the gateway program 212, from which sensors of the IoT device 10 collect sensor data, and to which analysis cloud 6 to transmit the collected sensor data (not shown) ).

ゲートウェイプログラム２１２は、収集定義テーブル２１１に格納された収集定義情報に基づき、ＩｏＴ機器１０からデータを収集し、分析用クラウド６のメッセージブローカー３０に当該データを送信する。 The gateway program 212 collects data from the IoT device 10 based on the collection definition information stored in the collection definition table 211, and transmits the data to the message broker 30 of the analysis cloud 6.

通信デバイス２２と、プロセッサ２３と、出力デバイス２４と、入力デバイス２５と、記憶デバイス２６と、内部バス２７については、管理計算機９０００と同様の構成であるため説明を省略する。 The communication device 22, the processor 23, the output device 24, the input device 25, the storage device 26, and the internal bus 27 have the same configuration as the management computer 9000, and thus the description thereof is omitted.

図５は、実施例１にかかわる分析結果生成元テーブル９１１０の一例を示す図である。分析結果生成元テーブル９１１０は、分析用クラウド６上で実行された分析結果のデータと、当該分析に用いられた分析ソースデータの対応関係を示す情報を管理する。 FIG. 5 is a diagram showing an example of the analysis result generation source table 9110 according to the first embodiment. The analysis result generation source table 9110 manages information indicating correspondence between data of analysis results executed on the analysis cloud 6 and analysis source data used for the analysis.

分析結果生成元テーブル９１１０は、分析結果データ９１１１と、分析ソースデータ９１１２と、ソースデータ日時９１１３のフィールドを一つのエントリに含む。 The analysis result generation source table 9110 includes the fields of analysis result data 9111, analysis source data 9112, and source data date 9113 in one entry.

分析結果データ９１１１には、分析用クラウド６上で実行された分析結果のデータを示す識別子が格納される。分析ソースデータ９１１２には、分析結果データ９１１１を生成する際に用いられたデータを示す識別子が格納される。 The analysis result data 9111 stores an identifier indicating data of an analysis result executed on the analysis cloud 6. The analysis source data 9112 stores an identifier indicating data used when generating the analysis result data 9111.

ソースデータ日時９１１３には、分析結果データ９１１１を生成する際に利用した、分析ソースデータ９１１２の時刻情報を示す値（例えば、生成された日時）が格納される。図５に示した例では、２０１６／０６／０１−０６／１４の期間のデータＡの情報と、２０１６／０６／０１−０６／１４の期間のデータＢの情報を利用して、分析結果データＸが生成されたことを示す。 The source data date 9113 stores a value (for example, a generated date) indicating time information of the analysis source data 9112 used when generating the analysis result data 9111. In the example shown in FIG. 5, analysis result data using the information of data A of the period 2016/06 / 01-06 / 14 and the information of data B of the period 2016/06 / 01-06 / 14 Indicates that X was generated.

図６は、実施例１にかかわるデータ所在情報テーブル９１２０の一例を示す図である。データ所在情報テーブル９１２０は、現時点におけるデータの所在を表す情報を日時情報とあわせて管理する。 FIG. 6 is a diagram showing an example of the data location information table 9120 according to the first embodiment. The data location information table 9120 manages information indicating the location of data at the current time together with date and time information.

データ所在情報テーブル９１２０は、データＩＤ９１２１と、分析結果フラグ９１２２と、日時９１２３と、データ所在９１２４のフィールドを一つのエントリに含む。 The data location information table 9120 includes fields of a data ID 9121, an analysis result flag 9122, a date and time 9123, and a data location 9124 in one entry.

データＩＤ９１２１には、データを特定する識別子が格納される。分析結果フラグ９１２２には、データＩＤ９１２１で特定されるデータが分析結果データか、分析結果を生成する際に利用された分析ソースデータかを示す識別子が格納される。具体的には、分析結果の場合は、分析結果フラグとして"Ｔｒｕｅ"が、分析ソースデータの場合は、分析結果フラグとして"Ｆａｌｓｅ"が格納される。 The data ID 9121 stores an identifier for specifying data. The analysis result flag 9122 stores an identifier indicating whether the data specified by the data ID 9121 is analysis result data or analysis source data used when generating an analysis result. Specifically, in the case of an analysis result, "True" is stored as an analysis result flag, and in the case of analysis source data, "False" is stored as an analysis result flag.

日時９１２３には、データＩＤ９１２１で示されるデータの取得（または生成）日時を表す情報が格納される。データ所在９１２４には、日時９１２３で示される日時における、データＩＤ９１２１で示されるデータの所在を示す識別子が格納される。 The date and time 9123 stores information indicating the acquisition (or generation) date and time of the data indicated by the data ID 9121. The data location 9124 stores an identifier indicating the location of the data indicated by the data ID 9121 at the date and time indicated by the date and time 9123.

図６に示した例の一行目は、分析ソースデータであるデータＡの、日時２０１６／０６／１５−０６／３０の期間の値は、"Ｌｏｃａｌ"すなわち分析用クラウド６に保存され、２０１６／０１／０１−０６／３０の期間の値は、"Ｄａｔａｃｅｎｔｅｒ１"で示されるデータセンタ５に格納されていることを示している。 In the first line of the example shown in FIG. 6, the value of the period of date and time 2016/06 / 15-06 / 30 of the data A, which is analysis source data, is stored in “Local”, ie, the analysis cloud 6, 2016 / The value of the period 01 / 01-06 / 30 is stored in the data center 5 indicated by "Datacenter1".

すなわち、一行目のデータは、２０１６／０１／０１−０６／１４の期間のデータは分析用クラウド６からは削除されていることを表している。また、図６に示した例の三行目は、分析ソースデータであるデータＣの、日時２０１６／０１／０１−０６／３０の期間の値は、"Ｌｏｃａｌ"すなわち分析用クラウド６と、"Ｆｉｅｌｄ１"で示されるフィールド７、例えば工場の両方に格納されていることを示している。 That is, the data in the first row represents that the data of the period 2016/01 / 01-06 / 14 has been deleted from the analysis cloud 6. In the third line of the example shown in FIG. 6, the value of the period of date and time 2016/01 / 01-06 / 30 of the data C which is the analysis source data is "Local", that is, the analysis cloud 6, " It indicates that it is stored in both of the fields 7 indicated by Field 1 ", for example, the factory.

図７は、実施例にかかわるデータテーブル９１３０の一例を示す図である。データテーブル９１３０は、データセンタ５から分析用クラウド６に送信されるデータを管理する。 FIG. 7 is a diagram showing an example of the data table 9130 according to the embodiment. The data table 9130 manages data transmitted from the data center 5 to the analysis cloud 6.

データテーブル９１３０は、データＩＤ９１３１と、日時９１３２と、Ｖａｌｕｅ９１３３のフィールドを一つのエントリに含む。 The data table 9130 includes fields of data ID 9131, date and time 9132, and value 9133 in one entry.

データＩＤ９１３１には、データを特定する識別子が格納される。日時９１３２には、データの取得（または生成）日時を表す情報が格納される。Ｖａｌｕｅ９１３３には、データＩＤ９１３１で表されるデータの、日時９１３２における値が格納される。図７に示した例では、データＡの日時"２０１６／０６／１５１０：００"の値は１８００、日時"２０１６／０６／１５１１：００"の値は２０００、日時"２０１６／０６／１５１２：００"の値は３０００であること表している。 The data ID 9131 stores an identifier for specifying data. The date and time 9132 stores information indicating the date of acquisition (or generation) of data. The Value 9133 stores the value at the date 9132 of the data represented by the data ID 9131. In the example shown in FIG. 7, the value of the date and time "2016/06/15 10:00" of data A is 1800, the value of date and time "2016/06/15 11:00" is 2000, and date and time "2016/06/15 The value of 12:00 "represents 3000.

本実施例１では、説明を簡易にするためにＶａｌｕｅが単純な数値である例を示したが、これに限定されず、例えば数値や文字列の組み合わせからなるような値や、配列で示される値、Ｋｅｙ−Ｖａｌｕｅのセットで構成される値などどのような形式の値でも良い。 In the first embodiment, an example in which Value is a simple numerical value is shown to simplify the description, but the present invention is not limited to this. For example, the value is represented by a combination of numerical values and character strings It may be any form of value, such as a value or a value composed of a set of Key-Value.

図８は、実施例１にかかわるデータ取得情報テーブル９１４０の一例を示す図である。データ取得情報テーブル９１４０は、データセンタ５に存在するデータの取得方法を管理する。データ取得情報テーブル９１４０は、データＩＤ９１４１と、データ所在９１４２と、データ取得コマンド９１４３のフィールドを一つのエントリに含む。 FIG. 8 is a diagram showing an example of the data acquisition information table 9140 according to the first embodiment. The data acquisition information table 9140 manages an acquisition method of data existing in the data center 5. The data acquisition information table 9140 includes fields of a data ID 9141, a data location 9142, and a data acquisition command 9143 in one entry.

データＩＤ９１４１には、データを特定する識別子が格納される。データ所在９１４２には、データの所在を示す情報が格納される。データ取得コマンド９１４３には、データＩＤ９１４１で特定されるデータを取得するために必要なコマンドが格納される。 The data ID 9141 stores an identifier for specifying data. The data location 9142 stores information indicating the location of data. The data acquisition command 9143 stores commands necessary for acquiring the data specified by the data ID 9141.

図８に示した例の一行目では、データＡを"ＤａｔａＣｅｎｔｅｒ１"から取得するために、"Ｇｅｔｃｏｎｔｒａｃｔ（Ａ）"というコマンドを実行する必要がある旨を表している。 The first line of the example shown in FIG. 8 indicates that it is necessary to execute a command “Get contract (A)” in order to obtain data A from “DataCenter 1”.

また、二行目の例では、データＢを"ＤａｔａＣｅｎｔｅｒ２"から取得するために、"Ｒｅｓｔｏｒｅｂａｃｋｕｐｖｏｌｕｍｅ１２０"コマンドを時刻情報を指定して実行することで、指定した時点の識別子１２０で表されるボリュームのバックアップからデータを復元し、"Ｇｅｔｖｏｌｕｍｅ（１２０'）"というコマンドで、識別子１２０'で表されるボリュームに復元したデータを取得し、"ＥＴＬｆｒｏｍＯｒａｃｌｅｔｏＰｏｓｔｇｒｅｓ"というコマンドでボリューム１２０'のデータをＯｒａｃｌｅ形式からＰｏｓｔｇｒｅｓ形式にＥＴＬプログラムで変換し、"Ｓｅｔｒｅｍｏｔｅｒｅｐｌｉｃａｔｉｏｎ"というコマンドで"ＤａｔａＣｅｎｔｅｒ２"から分析用クラウド６へデータの転送を行う、という処理を実行する必要がある旨を表している。 Also, in the example of the second line, in order to obtain data B from "DataCenter 2", it is represented by the identifier 120 at the designated time point by executing the "Restore backup volume 120" command with designating the time information. Restore the data from the volume backup, get the data restored to the volume represented by the identifier 120 'with the command "Get volume (120')", and use the command "ETL from Oracle to Postgres" with the volume 120 ' Data from Oracle format to Postgres format by ETL program, and execute data transfer from "DataCenter 2" to Cloud for analysis 6 using "Set remote replication" command. It represents the fact that there is a necessity.

本実施例１では、データＩＤ９１４１で特定されるデータは、元々存在するデータセンタ５と、分析用クラウド６の最大二か所のみに存在する例であり、データＩＤ９１４１で特定されるデータ毎に一つのデータ所在のみを有する例を示しているが、時刻によってデータＩＤ９１４１で示されるデータの所在が変わる場合は、時刻情報カラムを追加し、時刻毎のデータ所在、及びデータ取得コマンドを示せるようにしても良い。 In the first embodiment, the data specified by the data ID 9141 is an example existing at only the maximum two places of the originally existing data center 5 and the analysis cloud 6, and one data is specified for each data specified by the data ID 9141. An example is shown in which only one data location is shown, but if the location of the data indicated by the data ID 9141 changes depending on the time, a time information column is added so that the data location for each time and data acquisition command can be indicated. Also good.

図９は、実施例１にかかわるデータレプリケーション管理テーブル１１４０の一例を示す図である。 FIG. 9 is a diagram showing an example of the data replication management table 1140 according to the first embodiment.

データレプリケーション管理テーブル１１４０は、コピー元所在ＩＤ１１４１と、コピー元ストレージＩＤ１１４２と、コピー元ボリュームＩＤ１１４３と、コピー先所在ＩＤ１１４４と、コピー先ストレージＩＤ１１４５と、コピー先ボリュームＩＤ１１４６と、最終更新時刻１１４７のフィールドを一つのエントリに含む。 The data replication management table 1140 includes fields of copy source location ID 1141, copy source storage ID 1142, copy source volume ID 1143, copy destination location ID 1144, copy destination storage ID 1145, copy destination volume ID 1146, and last update time 1147. Include in one entry.

コピー元所在ＩＤ１１４１には、データセンタ５から分析用クラウド６に、データを送信する際の、データセンタ５の識別子が格納される。コピー元ストレージＩＤ１１４２には、データセンタ５から分析用クラウド６にデータを送信する際の、データセンタ５におけるデータ格納先であるストレージの識別子が格納される。 The copy source location ID 1141 stores an identifier of the data center 5 when transmitting data from the data center 5 to the analysis cloud 6. The copy source storage ID 1142 stores an identifier of a storage that is a data storage destination in the data center 5 when data is transmitted from the data center 5 to the analysis cloud 6.

コピー元ボリュームＩＤ１１４３には、データセンタ５から分析用クラウド６にデータを送信する際の、データセンタ５におけるデータ格納先であるストレージのボリュームの識別子が格納される。コピー先所在ＩＤ１１４４には、データセンタ５から分析用クラウド６に、データを送信する際の、分析用クラウド６の識別子が格納される。 The copy source volume ID 1143 stores an identifier of a volume of a storage that is a data storage destination in the data center 5 when data is transmitted from the data center 5 to the analysis cloud 6. The copy destination location ID 1144 stores an identifier of the analysis cloud 6 when data is transmitted from the data center 5 to the analysis cloud 6.

コピー先ストレージＩＤ１１４５には、データセンタ５から分析用クラウド６にデータを送信する際の、データセンタ５におけるデータ格納先であるストレージの識別子が格納される。コピー先ボリュームＩＤ１１４６には、データセンタ５から分析用クラウド６にデータを送信する際の、分析用クラウド６におけるデータ格納先であるストレージのボリュームの識別子が格納される。最終更新時刻１１４７には、データセンタ５から分析用クラウド６にデータを転送した最終時刻が格納される。 The copy destination storage ID 1145 stores an identifier of a storage that is a data storage destination in the data center 5 when data is transmitted from the data center 5 to the analysis cloud 6. The copy destination volume ID 1146 stores an identifier of a storage volume which is a data storage destination in the analysis cloud 6 when data is transmitted from the data center 5 to the analysis cloud 6. The last update time 1147 stores the last time data was transferred from the data center 5 to the analysis cloud 6.

本実施例１では、Ｖｏｌｕｍｅ単位でのリモートレプリケーションの例を示したが、これに限定されず、ファイル単位やテーブル単位などの単位でのリモートレプリケーションでも良い。 In the first embodiment, an example of remote replication in units of Volumes has been described. However, the present invention is not limited to this, and remote replication in units of files or tables may be used.

図１０は、実施例１にかかわるデータバックアップ管理テーブル１１５０の一例を示す構成図である。データバックアップ管理テーブル１１５０は、ストレージＩＤ１１５１と、ボリュームＩＤ１１５２と、バックアップ先ストレージＩＤ１１５３と、バックアップ先ボリュームＩＤ１１５４と、バックアップ日時１１５５のフィールドを一つのエントリに含む。 FIG. 10 is a configuration diagram showing an example of the data backup management table 1150 according to the first embodiment. The data backup management table 1150 includes fields of storage ID 1151, volume ID 1152, backup destination storage ID 1153, backup destination volume ID 1154 and backup date 1155 in one entry.

ストレージＩＤ１１５１には、ストレージを一意に特定する示す識別子が格納される。ボリュームＩＤ１１５２には、ストレージＩＤ１１５１内のボリュームを一意に特定する識別子が格納される。バックアップ先ストレージＩＤ１１５３には、バックアップ先のストレージを一意に特定する識別子が格納される。バックアップ先ボリュームＩＤ１１５４には、バックアップ先のボリュームを一意に特定する識別子が格納される。バックアップ日時１１５５には、バックアップが実行された日時の情報が格納される。 The storage ID 1151 stores an identifier that uniquely identifies the storage. The volume ID 1152 stores an identifier for uniquely identifying the volume in the storage ID 1151. The backup destination storage ID 1153 stores an identifier that uniquely identifies the storage of the backup destination. The backup destination volume ID 1154 stores an identifier for uniquely identifying the backup destination volume. The backup date 1155 stores information on the date and time when the backup was performed.

図１１は、実施例１にかかわるＥＴＬ処理情報テーブル１１６０の一例の構成図である。 FIG. 11 is a block diagram of an example of the ETL processing information table 1160 according to the first embodiment.

ＥＴＬ処理情報テーブル１１６０は、ＥＴＬサーバ４０００のＥＴＬプログラム４１１０が実施した、業務サーバ３０００上の業務プログラム３１１０で利用し、ストレージ装置２０００等に蓄積されたデータを選択して、分析しやすい形式に加工し、データを分析用クラウド６に転送するためのストレージ装置に書き込みを行うという一連の処理に関する情報を管理する。 The ETL processing information table 1160 is used by the business program 3110 on the business server 3000 executed by the ETL program 4110 of the ETL server 4000 to select data stored in the storage device 2000 etc. and process it into a format easy to analyze Manage information on a series of processes of writing data in a storage device for transferring data to the analysis cloud 6.

ＥＴＬ処理情報テーブル１１６０は、実行時刻１８０１と、Ｏｐｅｒａｔｉｏｎ１８０２と、Ｓｏｕｒｃｅ１８０３と、Ｔａｒｇｅｔ１８０４のフィールドを一つのエントリに含む。 The ETL processing information table 1160 includes the fields of execution time 1801, operation 1802, source 1803, and target 1804 in one entry.

実行時刻１８０１はＥＴＬ処理が実行された時刻を示す情報が格納される。Ｏｐｅｒａｔｉｏｎ１８０２は、ＥＴＬプログラムにより実行された処理を示す情報が格納される。 The execution time 1801 stores information indicating the time when the ETL process was performed. Operation 1802 stores information indicating processing executed by the ETL program.

Ｓｏｕｒｃｅ１８０３はＯｐｅｒａｔｉｏｎ１８０２で示される処理への入力データが格納されたストレージのボリューム識別子が格納される。Ｔａｒｇｅｔ１８０４はＯｐｅｒａｔｉｏｎ１８０２で示される処理の出力データを格納するストレージのボリューム識別子が格納される。 The Source 1803 stores the volume identifier of the storage in which the input data to the process shown in Operation 1802 is stored. The Target 1804 stores the volume identifier of the storage that stores the output data of the process shown in Operation 1802.

図１１では、ＥＴＬサーバ４０００が、２０１６／０６／０１１０：００にＥＴＬ処理を開始し、Ｓｔｏｒａｇｅ１のＶｏｌｕｍｅ１０のデータを読み込み、読み込んだデータからＳＱＬのリクエストを生成し、転送用のデータ形式のスキーマ情報に基づきテーブルをＳｔｏｒａｇｅ１のボリューム３に生成し、生成されたＳＱＬリクエストを利用して、生成されたテーブルにデータを挿入する例を示している。 In FIG. 11, the ETL server 4000 starts ETL processing at 10:00 on the 1st of June, reads data of Volume 10 of Storage 1, generates SQL request from the read data, and schema of data format for transfer. An example is shown in which a table is generated on the volume 3 of Storage 1 based on the information, and data is inserted into the generated table using the generated SQL request.

次に、管理計算機９０００が実行する各処理について説明する。 Next, each process executed by the management computer 9000 will be described.

図１２は、実施例１に係わる分析結果の生成元のデータ（分析ソースデータ）をトレースする処理のフローチャート１００である。 FIG. 12 is a flowchart 100 of processing of tracing data (analysis source data) of a generation source of an analysis result according to the first embodiment.

データをトレースする処理とは、データの所在を特定し、記録する処理にあたる。本処理は、管理計算機９０００のプロセッサ９３００が、メモリ９１００上に展開された分析ソースデータトレースプログラム９１５０を実行することによって行われる。以下、本処理の具体例を示す。 The process of tracing data is a process of specifying and recording the location of data. This process is performed by the processor 9300 of the management computer 9000 executing the analysis source data trace program 9150 developed on the memory 9100. Hereafter, the specific example of this process is shown.

まず、分析ソースデータトレースプログラム９１５０は、既存の分析結果のソースとして利用されたデータの取得要求を受信する（ステップ１０１）。データの取得要求は、管理用ネットワーク５０００を介して、外部の計算機から受信するのに加え、入力デバイス９５００からの取得要求を受け付けても良い。例えば、ユーザが管理計算機９０００の入力デバイス９５００を介して、既存の分析結果に別のデータを加えたレポートを作成する要求を入力した際等に、当該要求を受信する。具体例としては、ユーザが分析結果Ｘに、データＣを追加してレポートを作成しようとした際に、管理計算機９０００は分析結果Ｘのソースとして利用されたデータ（分析ソースデータ）の取得要求を受信する。なお、取得要求には、分析結果のデータの識別子が含まれる。 First, the analysis source data trace program 9150 receives a request for acquiring data used as a source of existing analysis results (step 101). The data acquisition request may be received from the input device 9500 in addition to being received from an external computer via the management network 5000. For example, when the user inputs a request to create a report in which another data is added to the existing analysis result via the input device 9500 of the management computer 9000, the request is received. As a specific example, when the user adds data C to the analysis result X to create a report, the management computer 9000 requests acquisition of data (analysis source data) used as a source of the analysis result X To receive. The acquisition request includes the identifier of the data of the analysis result.

次に、分析ソースデータトレースプログラム９１５０は、図５に示した分析結果生成元テーブル９１１０を参照し、ステップ１０１で受信した既存分析結果のソースとして利用された分析ソースデータを特定する（ステップ１０２）。例えば、分析結果Ｘのソースとして利用されたデータが、データＡの２０１６／６／１〜２０１６／６／１４の情報と、データＢの２０１６／６／１〜２０１６／６／１４の情報であることを特定する。 Next, the analysis source data trace program 9150 refers to the analysis result generation source table 9110 shown in FIG. 5, and specifies the analysis source data used as the source of the existing analysis result received in step 101 (step 102). . For example, the data used as a source of analysis result X is the information of 2016/6/1 to 2016/6/14 of data A and the information of 2016/6/1 to 2016/6/14 of data B. Identify that.

次に、分析ソースデータトレースプログラム９１５０は、図６に示したデータ所在情報テーブル９１２０を参照し、ステップ１０２で特定したデータの格納場所を特定する（ステップ１０３）。例えば、データＡの２０１６／６／１〜２０１６／６／１４の情報と、データＢの２０１６／６／１〜２０１６／６／１４の情報の所在を特定するため、分析ソースデータトレースプログラム９１５０は、データ所在情報テーブル９１２０を参照し、データＡの該当時刻の情報は"Ｄａｔａｃｅｎｔｅｒ１"に存在し、データＢの該当時刻の情報は、分析用クラウド６及びＤａｔａｃｅｎｔｅｒ２に存在する、と特定する。 Next, the analysis source data trace program 9150 refers to the data location information table 9120 shown in FIG. 6, and identifies the storage location of the data identified in step 102 (step 103). For example, in order to specify the location of 2016/6/1 to 2016/6/14 of data A and the location of 2016/6/1 to 2016/6/14 of data B, the analysis source data trace program 9150 It refers to the data location information table 9120, and specifies that the information of the corresponding time of the data A exists in "Datacenter1" and the information of the corresponding time of the data B exists in the analysis cloud 6 and Datacenter2.

本実施例１では、ステップ１０１で取得要求を受信したソースデータと同一時刻のデータが存在する例を説明したが、分析の内容によっては必ずしも完全一致する必要がないため、取得要求の時刻前後で、ソースデータが存在する最も近い時刻の情報で代用してもよい。 In the first embodiment, an example has been described in which data having the same time as the source data for which the acquisition request has been received in step 101 exists, but it is not necessary to completely match depending on the content of analysis. Alternatively, information on the closest time at which the source data exists may be substituted.

そして、分析ソースデータトレースプログラム９１５０は、分析用クラウド６に該当のデータが存在するか否かを判定し（ステップ１０４）、分析用クラウド６に該当のデータが存在する場合、管理計算機９０００のデータテーブル９１３０から情報を取得し、処理を終了する（ステップ１０５）。 Then, the analysis source data trace program 9150 determines whether or not the corresponding data exists in the analysis cloud 6 (step 104), and if the corresponding data exists in the analysis cloud 6, the data of the management computer 9000 Information is acquired from the table 9130, and the process is ended (step 105).

分析用クラウド６に該当のデータが存在しない場合、分析ソースデータトレースプログラム９１５０は、図８に示したデータ取得情報テーブル９１４０を参照し、管理計算機１０００のデータ取得プログラム１１３０に対して、該当時刻の情報の取得を要求する（ステップ１０６）。 When there is no corresponding data in the analysis cloud 6, the analysis source data trace program 9150 refers to the data acquisition information table 9140 shown in FIG. 8 and compares the data acquisition program 1130 of the management computer 1000 with the corresponding time. It requests acquisition of information (step 106).

ここで、分析ソースデータトレースプログラム９１５０は、図８に示した例のように、具体的な操作内容、例えば、該当時刻のバックアップボリュームをリストアするデータベースの形式をＯｒａｃｌｅからＰｏｓｔｇｒｅｓに変換する、あるいは、分析用クラウド６へのリモートコピーを実施する、などを指定して該当時刻の情報取得要求を実施する。 Here, as in the example shown in FIG. 8, the analysis source data trace program 9150 converts the specific operation content, for example, the format of the database for restoring the backup volume at the corresponding time from Oracle to Postgres, or Designate a remote copy to the analysis cloud 6, and the like, and make an information acquisition request at the corresponding time.

あるいは、分析ソースデータトレースプログラム９１５０が、データ取得コマンド９１４３のみをステップ１０３で特定したデータ格納場所の管理計算機１０００のデータ取得プログラム１１３０に対して発行することで該当時刻の情報取得要求を実施し、管理計算機１０００側で具体的な操作内容を決定しても良い。 Alternatively, the analysis source data trace program 9150 issues an information acquisition request for the corresponding time by issuing only the data acquisition command 9143 to the data acquisition program 1130 of the management computer 1000 of the data storage location specified in step 103, Specific operation content may be determined on the management computer 1000 side.

本実施例１では、データのバックアップを取得し、ＥＴＬ処理を実行するケースの例を記載したが、これに限定されず、バックアップではなく処理中のデータそのものを取得するケースや、ＥＴＬ処理を実施せずに分析用クラウド６に情報を送信するケースもある。 In the first embodiment, an example of a case where data backup is obtained and ETL processing is performed is described. However, the present invention is not limited thereto, and a case where data being processed instead of backup is acquired or ETL processing is performed. There is also a case where information is sent to the analysis cloud 6 without being processed.

ここで、該当のデータが古い情報の場合、データセンタ５内で集約処理が行われ、一日単位の粒度の情報は残っておらず、一週間単位などの集約された情報しかないケースもあり、この場合、データセンタ５の管理計算機１０００は指定時刻を含む一定期間の集約情報を返すなどしても良い。また、データセンタ５から分析用クラウド６へのデータ送信については、データ通信用のネットワーク６０００を利用して実施される。 Here, if the corresponding data is old information, aggregation processing is performed in the data center 5, and there is no case where information of granularity in units of one day remains, and there is also a case where there is only aggregated information such as one week. In this case, the management computer 1000 of the data center 5 may return aggregation information for a predetermined period including the designated time. In addition, data transmission from the data center 5 to the analysis cloud 6 is performed using the network 6000 for data communication.

以上の処理によって、分析用クラウド６の管理計算機９０００は、要求されたデータの所在を特定して、データセンタ５または分析用クラウド６から該当するデータを取得することができる。 By the above process, the management computer 9000 of the analysis cloud 6 can specify the location of the requested data, and acquire the corresponding data from the data center 5 or the analysis cloud 6.

図１３は、実施例にかかわる分析用クラウド６へのデータ収集時に分析ソースデータのメタデータを生成及び格納する処理のフローチャートである。ここでメタデータとは、データをトレースする際に必要となる、データの所在情報やデータ取得方法に関する情報などにあたる。本処理のステップ２０３以降は、管理計算機９０００のプロセッサ９３００が、メモリ９１００上に展開されたメタデータ管理プログラム９１６０を実行することによって行われる。以下、本フローチャートの具体例を示す。 FIG. 13 is a flowchart of processing of generating and storing metadata of analysis source data at the time of data collection to the analysis cloud 6 according to the embodiment. Here, metadata corresponds to location information of data, information on a method of acquiring data, etc., which are necessary when tracing data. Step 203 and subsequent steps of this process are performed by the processor 9300 of the management computer 9000 executing the metadata management program 9160 developed on the memory 9100. Hereinafter, a specific example of this flowchart will be shown.

まず、データセンタ５から分析用クラウド６へのデータ送信が実施される。これは周知または公知の方法によっても良く、例えば、本実施例１では、ストレージ装置２０００のリモートコピー機能を利用し、データセンタ５のストレージ装置２０００から分析用クラウド６のストレージ装置２０００にデータを複製する機能を利用した例を示す。なお、これに限定されるものではなく、例えば、管理計算機１０００で稼働するバックアップソフトウェア等を利用しても良い。 First, data transmission from the data center 5 to the analysis cloud 6 is performed. This may be performed by a known or known method. For example, in the first embodiment, the remote copy function of the storage system 2000 is used to copy data from the storage system 2000 of the data center 5 to the storage system 2000 of the analysis cloud 6 An example of using the function to In addition, it is not limited to this, For example, you may utilize the backup software etc. which operate | move with the management computer 1000. FIG.

データセンタ５の管理計算機１０００のメタデータ共有プログラム１１２０は、データが送信されたことを検出し（ステップ２０１）、分析用クラウド６の管理計算機９０００のメタデータ管理プログラム９１６０に当該データに関する情報を送信する（ステップ２０２）。 The metadata sharing program 1120 of the management computer 1000 of the data center 5 detects that the data has been transmitted (step 201), and transmits information about the data to the metadata management program 9160 of the management computer 9000 of the analysis cloud 6 (Step 202).

具体的には、分析用クラウド６に送信したデータのデータＩＤと、送信日時と、送信元データセンタ５のＩＤおよび送信元データセンタ５内でのデータの処理内容を送信する。管理計算機１０００のメタデータ共有プログラム１１２０は、データセンタ５内のストレージ装置２０００や、業務サーバ３０００や、ＥＴＬサーバ４０００等から、分析用クラウド６へのデータの送信に関する情報や、ＥＴＬサーバ４０００でのデータ処理内容などの情報を定期的に収集して保持しているものとする。 Specifically, the data ID of the data transmitted to the analysis cloud 6, the transmission date and time, the ID of the transmission source data center 5, and the processing content of the data in the transmission source data center 5 are transmitted. The metadata sharing program 1120 of the management computer 1000 includes information about transmission of data from the storage apparatus 2000 in the data center 5, the business server 3000, the ETL server 4000, etc. to the analysis cloud 6, and the ETL server 4000. It is assumed that information such as data processing content is periodically collected and held.

次に、分析用クラウド６の管理計算機９０００のメタデータ管理プログラム９１６０は、データを受信し（ステップ２０３）、受信した情報のうち、データＩＤと、送信日時と、送信元データセンタ５のＩＤに基づき、図６に示すデータ所在情報テーブル９１２０を更新する（ステップ２０４）。 Next, the metadata management program 9160 of the management computer 9000 of the analysis cloud 6 receives the data (step 203), and among the received information, the data ID, the transmission date and time, and the ID of the transmission source data center 5 Based on this, the data location information table 9120 shown in FIG. 6 is updated (step 204).

続いて、メタデータ管理プログラム９１６０は、受信した情報のうち、送信元データセンタ５内での送信データの処理内容に基づき、図８に示すデータ取得情報テーブル９１４０を更新する（ステップ２０５）。すなわち、メタデータ管理プログラム９１６０は、受信したデータの処理内容に基づいて、当該データを取得するためのコマンドなどを取得して、データ取得情報テーブル９１４０のデータ取得コマンド９１４３に格納する。 Subsequently, the metadata management program 9160 updates the data acquisition information table 9140 shown in FIG. 8 based on the processing content of the transmission data in the transmission source data center 5 among the received information (step 205). That is, the metadata management program 9160 acquires a command or the like for acquiring the data based on the processing content of the received data, and stores it in the data acquisition command 9143 of the data acquisition information table 9140.

データ取得コマンド９１４３には、データの格納形態に応じたコマンドが格納される。例えば、当該データがバックアップデータとしてバックアップされている場合には、リストアのコマンドを格納し、データ形式を変換する場合には、データ形式の変換先を指定する。換言すれば、データ取得コマンド９１４３には、現在のデータの格納形式に応じた取得方法が格納される。 The data acquisition command 9143 stores a command according to the data storage mode. For example, when the data is backed up as backup data, a restore command is stored, and when converting the data format, the conversion destination of the data format is specified. In other words, the data acquisition command 9143 stores an acquisition method corresponding to the storage format of the current data.

本実施例１では、ステップ２０１において、データセンタ５の管理計算機１０００のメタデータ共有プログラム１１２０は、データが分析用クラウド６へ送信されたことを検出することで処理が開始されることとした。これに限定されるものではなく、例えば分析用クラウド６のメタデータ管理プログラム９１６０が、データが受信されたことを検出し、メタデータ共有プログラム１１２０に対して、受信データに関する情報の送信を要求するなどにより処理を開始しても良く、あるいは定期的にメタ情報を共有するなどしても良い。また、メタデータ共有プログラム１１２０が、ストレージ装置２０００のボリューム２２１０内にメタデータを格納し、レプリケーションプログラム２１２０の機能を用いて、データ通信用のネットワーク６０００経由で情報を送信しても良い。 In the first embodiment, in step 201, the metadata sharing program 1120 of the management computer 1000 of the data center 5 starts processing by detecting that data has been transmitted to the analysis cloud 6. For example, the metadata management program 9160 of the analysis cloud 6 detects that data has been received, and requests the metadata sharing program 1120 to transmit information on received data. Or the like, or meta information may be periodically shared. In addition, the metadata sharing program 1120 may store metadata in the volume 2210 of the storage device 2000, and transmit information via the network 6000 for data communication using the function of the replication program 2120.

上記処理によって、データセンタ５から分析用クラウド６へデータが送信されると、当該データの情報が分析用クラウド６へ送信され、管理計算機９０００は、データ所在情報テーブル９１２０とデータ取得情報テーブル９１４０を更新することができる。 When data is transmitted from the data center 5 to the analysis cloud 6 by the above processing, the information of the data is transmitted to the analysis cloud 6, and the management computer 9000 stores the data location information table 9120 and the data acquisition information table 9140. It can be updated.

図１４は、実施例１にかかわる分析用クラウド６における分析処理の実行後にメタデータを更新する処理のフローチャートである。本処理は、管理計算機９０００のプロセッサ９３００が、メモリ９１００上に展開されたメタデータ管理プログラム９１６０を実行することによって行われる。以下、本フローチャートの具体例を示す。 FIG. 14 is a flowchart of processing of updating metadata after execution of analysis processing in the analysis cloud 6 according to the first embodiment. This process is performed by the processor 9300 of the management computer 9000 executing the metadata management program 9160 developed on the memory 9100. Hereinafter, a specific example of this flowchart will be shown.

まず、データセンタ５から収集された情報を利用して分析処理が実施される。これは、周知または公知の方法によっても良く、例えば、ユーザが出力デバイス９４００に表示されるデータを参照して、どの期間にどのデータを利用して、分析を行うかを検討の上、入力デバイス９５００を介して分析の実行操作を行うことで、分析プログラム９１７０により分析処理が実行され、出力デバイス９４００を介して、分析結果をレポートなどの形式で参照可能となる。 First, analysis processing is performed using the information collected from the data center 5. This may be performed by a known or known method. For example, the user refers to the data displayed on the output device 9400 and considers which data to use during which period to analyze, and then the input device By performing the analysis execution operation via 9500, the analysis program 9170 executes the analysis processing, and the analysis result can be referenced in the form of a report or the like via the output device 9400.

メタデータ管理プログラム９１６０は、分析プログラム９１７０により分析処理が実行され、分析結果がデータテーブル９１３０に格納されたことを検出する（ステップ３０１）。メタデータ管理プログラム９１６０は、分析結果のデータと、分析に利用した分析ソースデータの対応関係を分析結果生成元テーブル９１１０に格納する（ステップ３０２）。次に、メタデータ管理プログラム９１６０は、分析結果のデータの生成時刻情報と所在情報を、データ所在情報テーブル９１２０に格納する（ステップ３０３）。 The metadata management program 9160 detects that the analysis process is executed by the analysis program 9170 and the analysis result is stored in the data table 9130 (step 301). The metadata management program 9160 stores the correspondence between the data of the analysis result and the analysis source data used for the analysis in the analysis result generation source table 9110 (step 302). Next, the metadata management program 9160 stores generation time information and location information of analysis result data in the data location information table 9120 (step 303).

上記処理によって、分析が完了する度に分析結果のデータと、分析に利用した分析ソースデータの関係を分析結果生成元テーブル９１１０に格納され、分析結果のデータの生成時刻情報と所在情報がデータ所在情報テーブル９１２０に格納されて、各データの所在を含む情報が生成される。 By the above processing, every time analysis is completed, the relationship between the data of the analysis result and the analysis source data used for analysis is stored in the analysis result generation source table 9110, and the generation time information and location information of the data of the analysis result are data locations It is stored in the information table 9120 to generate information including the location of each data.

図１５は、実施例１にかかわる、分析用クラウド６から分析ソースデータを削除した後に、メタデータを更新する処理のフローチャートである。この処理は、分析ソースデータの削除後の他に、所定のタイミング（例えば、所定の周期）で実行することができる。 FIG. 15 is a flowchart of processing of updating metadata after deleting analysis source data from the analysis cloud 6 according to the first embodiment. This process can be executed at a predetermined timing (for example, a predetermined cycle) other than after deletion of analysis source data.

分析用クラウド６上では、分析に利用した分析ソースデータ（顧客システムの情報等）は、契約上の理由や、容量上の理由等で分析終了後にクラウド上から削除されることが多く、このようなケースを想定した処理を実施する。 On the analysis cloud 6, analysis source data (such as customer system information) used for analysis is often deleted from the cloud after analysis is completed due to contract reasons, capacity reasons, etc. Implement the process assuming the following cases.

まず、分析プログラム９１７０が、分析に利用した分析ソースデータで、利用が完了したデータをデータテーブル９１３０から削除する。当該処理は、分析プログラム９１７０により定期的に実行される、分析処理の実行後に自動的に実行される、あるいはユーザが入力デバイス９５００を経由して削除要求を実行するなど、どのような方法によって削除しても良い。 First, the analysis program 9170 deletes, from the data table 9130, data whose use has been completed with analysis source data used for analysis. The process is periodically executed by the analysis program 9170, automatically executed after the execution of the analysis process, or deleted by any method such as the user executing a deletion request via the input device 9500. You may.

データテーブル９１３０からデータが削除された際に（ステップ４０１）、分析プログラム９１７０が削除したデータに対応するメタデータを削除する。具体的には、分析プログラム９１７０が、当該データの全ての時刻における情報が削除されたか否かを判定し（ステップ４０２）、削除された場合には、データ所在情報テーブル９１２０から、該当データがローカル（分析用クラウド６上）にあることを示すエントリを削除する（ステップ４０３）。 When data is deleted from the data table 9130 (step 401), the metadata corresponding to the data deleted by the analysis program 9170 is deleted. Specifically, the analysis program 9170 determines whether or not the information at all times of the data has been deleted (step 402). If the information is deleted, the corresponding data is local from the data location information table 9120. The entry indicating that it is (on the analysis cloud 6) is deleted (step 403).

当該データの一部の時刻（または期間）の情報のみが削除された場合には、分析プログラム９１７０が、データ所在情報テーブル９１２０の、該当データがローカル（分析用クラウド６上）にあることを示すエントリの日時９１２３の情報を更新する（ステップ４０４）。 When only information of a part of time (or period) of the data is deleted, the analysis program 9170 indicates that the corresponding data in the data location information table 9120 is local (on the analysis cloud 6). The information of the entry date 9123 is updated (step 404).

本処理の実行後に、分析用クラウド６上には存在しないデータであっても、そのメタデータ情報、具体的にはデータ所在情報テーブル９１２０には、データ収集元のデータセンタ５の情報などが存在する点が、本発明の特徴の一つとなっている。 Even after the execution of this process, even if the data does not exist on the analysis cloud 6, its metadata information, specifically the data location information table 9120, includes the information of the data center 5 of the data collection source, etc. Is one of the features of the present invention.

以上のように、実施例１によれば、分析ソースデータが分析用クラウド６から削除されていても、分析結果から、分析前のデータを辿れるようになる。本実施例では、例えばデータがブロックである例を示した。計算機システムを利用する顧客観点では、データの位置を意識することなく、データにアクセス可能となる。これにより、追加の分析レポートを容易かつ迅速に作成可能となる。 As described above, according to the first embodiment, even if analysis source data is deleted from the analysis cloud 6, data before analysis can be traced from the analysis result. In the present embodiment, for example, the data is a block. From the viewpoint of a customer using a computer system, data can be accessed without being aware of the position of the data. This makes it possible to create additional analysis reports easily and quickly.

本実施例２にかかわる計算機システムは、図１６に示す１台以上の管理計算機１０００と、１台以上の業務サーバ３０００と、１台以上のエッジサーバ７０００と、１台以上のファイルサーバ８０００を含む１つ以上のデータセンタ５と、１台以上の管理計算機９０００と、１台以上のエッジサーバ７０００を含む一つ以上のデータセンタとしての分析用クラウド６から構成される。 The computer system according to the second embodiment includes one or more management computers 1000 shown in FIG. 16, one or more business servers 3000, one or more edge servers 7000, and one or more file servers 8000. It comprises an analysis cloud 6 as one or more data centers including one or more data centers 5, one or more management computers 9000, and one or more edge servers 7000.

ここで、データセンタ５における管理計算機１０００と、業務サーバ３０００及び分析用クラウド６における管理計算機９０００については、前記実施例１と同様の構成であるため説明は省略する。 Here, the management computer 1000 in the data center 5, the business server 3000, and the management computer 9000 in the analysis cloud 6 have the same configurations as in the first embodiment, and thus the description thereof is omitted.

各構成要素はデータ通信用のネットワーク６０００（具体的にはＩＰ等）を介して互いに接続され、また、管理用ネットワーク５０００を介して互いに接続される。当該接続は、直接接続されるものに限定されず、１台以上のスイッチ等のネットワーク機器を介して接続されても良い。また、データ通信用のネットワークと管理用のネットワークとして、同じネットワークを利用しても良い。 The respective components are connected to one another via a data communication network 6000 (specifically, IP etc.) and are also connected to one another via a management network 5000. The connection is not limited to direct connection, and may be connected via a network device such as one or more switches. Also, the same network may be used as a data communication network and a management network.

エッジサーバ７０００のファイルサーバプログラム７１１０は、業務サーバ３０００からの入出力要求（Ｉ／Ｏ要求）に応じて、業務サーバ３０００に対してファイル共有サービスを提供するプログラム（例えばＮＦＳサーバプログラム）である。 The file server program 7110 of the edge server 7000 is a program (for example, an NFS server program) for providing a file sharing service to the business server 3000 in response to an input / output request (I / O request) from the business server 3000.

分析用クラウド６のファイル共有プログラム７１２０は、ファイルサーバ８０００のファイルシステム（図示省略）と、エッジサーバ７０００のファイルシステムとで疑似ファイルシステム（例えば、仮想ファイルシステム）を構成し、透過的にファイルを移動可能とするプログラムである。 The file sharing program 7120 of the analysis cloud 6 configures a pseudo file system (for example, a virtual file system) by the file system (not shown) of the file server 8000 and the file system of the edge server 7000 and transparently transmits the file. It is a movable program.

データセンタ５のファイルサーバ８０００のファイルサーバプログラム８１１０は、ファイル共有サービスを提供するプログラム（例えばＮＦＳサーバプログラム）である。ファイル共有プログラム８１２０は、エッジサーバ７０００のファイル共有プログラム７１２０と同様であり、ファイルサーバ８０００のファイルシステム（図示省略）と、エッジサーバ７０００のファイルシステムとで疑似ファイルシステム（例えば、仮想ファイルシステム）を構成し、透過的にファイルを移動可能とするプログラムである。なお、各エッジサーバ７０００への更新情報が、ファイルサーバ８０００に反映され、ファイルサーバ８０００への更新情報が、各エッジサーバ７０００に反映される。 The file server program 8110 of the file server 8000 of the data center 5 is a program (for example, an NFS server program) that provides a file sharing service. The file sharing program 8120 is similar to the file sharing program 7120 of the edge server 7000, and a pseudo file system (for example, a virtual file system) is configured by the file system (not shown) of the file server 8000 and the file system of the edge server 7000. It is a program that configures and allows files to be moved transparently. The update information to each edge server 7000 is reflected to the file server 8000, and the update information to the file server 8000 is reflected to each edge server 7000.

本実施例２では、データセンタ５の業務サーバ３０００によるエッジサーバ７０００への書き込み処理がファイルサーバ８０００に反映され、ファイルサーバ８０００に反映されたデータが、分析用クラウド６のエッジサーバ７０００に転送される。 In the second embodiment, the writing process to the edge server 7000 by the business server 3000 of the data center 5 is reflected on the file server 8000, and the data reflected on the file server 8000 is transferred to the edge server 7000 of the analysis cloud 6. Ru.

分析用クラウド６のエッジサーバ７０００に取得要求のデータに関する情報が存在しない場合、前記実施例１と同様に、管理計算機９０００は、分析結果生成元テーブル９１１０と、分析データ所在情報テーブル９１２０と、データ取得情報テーブル９１４０を参照し、データセンタ５の業務サーバ３０００とエッジサーバ７０００におけるデータの、所定時刻のバックアップまたはそれに相当する情報（例えば、ログ）を、ファイルサーバ８０００と同期するように設定を変更する。 If there is no information on the acquisition request data in the edge server 7000 of the analysis cloud 6, as in the first embodiment, the management computer 9000 generates an analysis result generation source table 9110, an analysis data location information table 9120, and data. Change the settings to synchronize the file server 8000 with data (for example, log) of backup at predetermined time or its equivalent of data in the business server 3000 and edge server 7000 of the data center 5 with reference to the acquired information table 9140 Do.

そして、当該バックアップの情報がファイルサーバ８０００に反映され、ファイルサーバ８０００に反映されたデータが分析用クラウド６のエッジサーバ７０００に転送されることで、分析用クラウド６において当該データの参照が可能となる。本実施例２では、実施例１におけるＥＴＬプログラム４１１０の処理等が無い例を示している。 Then, the information of the backup is reflected in the file server 8000, and the data reflected in the file server 8000 is transferred to the edge server 7000 of the analysis cloud 6, so that the data can be referred to in the analysis cloud 6 Become. In the second embodiment, an example in which there is no processing or the like of the ETL program 4110 in the first embodiment is shown.

以上の実施例２により、分析ソースデータが分析用クラウド６から削除されていても、分析結果のデータから、分析前のデータを辿れるようになる。本実施例２では例えばデータがファイルである例を示した。本実施例２の計算機システムを利用する顧客の観点では、データの位置を意識することなく、データにアクセス可能となる。これにより、追加の分析レポートを迅速に作成可能となる。 According to the above-described second embodiment, even if analysis source data is deleted from the analysis cloud 6, data before analysis can be traced from the data of the analysis result. In the second embodiment, for example, the data is a file. From the viewpoint of the customer using the computer system of the second embodiment, data can be accessed without being aware of the position of the data. This allows for the rapid creation of additional analysis reports.

なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に記載したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加、削除、又は置換のいずれもが、単独で、又は組み合わせても適用可能である。 The present invention is not limited to the embodiments described above, but includes various modifications. For example, the embodiments described above are described in detail in order to illustrate the present invention in an easy-to-understand manner, and are not necessarily limited to those having all the configurations described. Also, part of the configuration of one embodiment can be replaced with the configuration of another embodiment, and the configuration of another embodiment can be added to the configuration of one embodiment. In addition, addition, deletion, or replacement of other configurations may be applied singly or in combination with some of the configurations of the respective embodiments.

また、上記の各構成、機能、処理部、及び処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、及び機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記録装置、または、ＩＣカード、ＳＤカード、ＤＶＤ等の記録媒体に置くことができる。 Further, each of the configurations, functions, processing units, processing means, and the like described above may be realized by hardware, for example, by designing part or all of them with an integrated circuit. In addition, each configuration, function, and the like described above may be realized by software by a processor interpreting and executing a program that realizes each function. Information such as a program, a table, and a file for realizing each function can be placed in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.

また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際には殆ど全ての構成が相互に接続されていると考えてもよい。 Further, control lines and information lines indicate what is considered to be necessary for the description, and not all control lines and information lines in the product are necessarily shown. In practice, almost all configurations may be considered to be mutually connected.

Claims

Management computer including processor and memory,
A computer system having one or more storage devices connected to the management computer;
The processor reads analysis source data stored in the storage device and stores the analysis source data in a predetermined data storage area.
The processor outputs a result of performing a predetermined analysis on analysis source data of the data storage area as analysis result data,
The processor stores the location of the read analysis source data in data location information;
The processor associates the analysis source data used when generating the analysis result data with the analysis result data, and stores the analysis source data in analysis result generation source information.
The processor updates the data location information when the location of the used analysis source data is changed or deleted.
The processor receives a data acquisition request including information of analysis result data, refers to the analysis result generation source information from the information of the analysis result data, and generates analysis source data used when generating the analysis result data To identify
The computer system, wherein the processor searches the data location information with the identified analysis source data, and identifies a location of the analysis source data.

The computer system according to claim 1, wherein
The computer system, wherein the processor reads the analysis source data from the storage device or the data storage area based on the identified location.

The computer system according to claim 1, wherein
When the analysis result data is held in the management computer, the processor holds the location of the analysis source data in the data location information even if the analysis source data is deleted from the data storage area. A computer system characterized by

The computer system according to claim 1, wherein
The data location information includes information on the location, identifier and date and time of the analysis source data,
The analysis result generation source information includes analysis source data identifier and date and time information,
Identification of analysis source data used when generating the analysis result data is
The identifier and date of analysis source data used when generating the analysis result data are specified by referring to the analysis result generation source information from the information of the analysis result data, and the identifier and date of the specified analysis source data Searching the data location information to identify the location of the analysis source data.

The computer system according to claim 2, wherein
The processor stores information on acquisition when reading the analysis source data in data acquisition information.
When the specified location is other than the data storage area, the data acquisition information is acquired, applied to the storage device, and the analysis source data is read.

The computer system according to claim 2, wherein
The storage device is connected to a management device that manages backup of the analysis source data;
When the location of the analysis source data is a backup, the processor requests the analysis source data from the management device, and the management device restores the analysis source data by restoration and sends it to the management computer. Characteristic computer system.

The computer system according to claim 2, wherein
The storage device is connected to a management device that manages consolidated data of the analysis source data;
When the location of the analysis source data is aggregated data, the processor requests the management device for the analysis source data, and the management device transmits the aggregated data as the analysis source data to the management computer. Characteristic computer system.

An analysis source data management method for managing analysis source data stored in one or more storage devices connected to a management computer including a processor and a memory, comprising:
A first step of the management computer reading analysis source data stored in the storage device and storing the analysis source data in a predetermined data storage area;
A second step of the management computer outputting, as analysis result data, a result of performing a predetermined analysis on analysis source data of the data storage area;
A third step of the management computer storing the location of the read analysis source data in data location information;
A fourth step of the management computer storing the analysis source data used when generating the analysis result data in association with the analysis result data in analysis result generation source information;
A fifth step of updating the data location information when the location of the used analysis source data is changed or deleted by the management computer;
The management computer receives a data acquisition request including information on analysis result data, refers to the analysis result generation source information from the information on the analysis result data, and generates an analysis source used when generating the analysis result data A sixth step of identifying data;
A seventh step of the management computer searching the data location information in the identified analysis source data to specify the location of the analysis source data;
An analysis source data management method characterized in that it comprises:

The analysis source data management method according to claim 8, wherein
The analysis source data management method according to claim 8, further comprising an eighth step of reading the analysis source data from the storage device or the data storage area based on the identified location.

The analysis source data management method according to claim 8, wherein
The fifth step is
When the analysis result data is held in the management computer, the location of the analysis source data is held in the data location information even if the analysis source data is deleted from the data storage area. Analysis source data management method.

The analysis source data management method according to claim 8, wherein
The data location information includes information on the location, identifier and date and time of the analysis source data,
The analysis result generation source information includes analysis source data identifier and date and time information,
The sixth step is
The identifier and date of analysis source data used when generating the analysis result data are specified by referring to the analysis result generation source information from the information of the analysis result data, and the identifier and date of the specified analysis source data A method of managing the analysis source data, wherein the data location information is searched in step (b) to identify the location of the analysis source data.

The analysis source data management method according to claim 9, wherein
The third step is
Storing information on acquisition when reading the analysis source data in data acquisition information;
The eighth step is
The analysis source data management method characterized in that, when the specified location is other than the data storage area, the data acquisition information is acquired and applied to the storage device to read the analysis source data.

The analysis source data management method according to claim 9, wherein
The eighth step is
When the location of the analysis source data is a backup, the storage device is connected to request a management device that manages backup of the analysis source data to request the analysis source data, and the management device restores the analysis source data by restoration. A method of managing an analysis source data, comprising: restoring and transmitting to the management computer.

The analysis source data management method according to claim 9, wherein
The eighth step is
When the location of the analysis source data is aggregated data, the storage device is connected to request the management source data to manage the aggregation data of the analysis source data, and the management device transmits the aggregated data. And transmitting the analysis source data to the management computer.