JP2008107896A

JP2008107896A - Physical resource control management system, physical resource control management method and physical resource control management program

Info

Publication number: JP2008107896A
Application number: JP2006287536A
Authority: JP
Inventors: Shinji Kami; 伸治加美
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2006-10-23
Filing date: 2006-10-23
Publication date: 2008-05-08

Abstract

<P>PROBLEM TO BE SOLVED: To solve the problem that a virtual environment, it is difficult to implement fast fail-over while hiding failures in physical resources from applications (processes). <P>SOLUTION: A physical resource control management system comprises a hardware space 6100 including physical resources in the system, and a software space 6500 including software programs. In the hardware space, at least one central processing part 6121 and other physical resources are connected by a data transfer line 6131, and part or all of the physical resources have active state check parts. The software space has a virtualization means 6550, at least one virtual resource space 6520, and a virtual device 6510 operating thereon, and the virtualization means has a resource allocation means 6551, a failure management means 6552 and a resource access means 6554. In this configuration, the failure management means controls the resource allocation means by coordinating hardware failure management and software failure management. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、CPUおよびＩ／Ｏデバイスなどを物理資源として利用するITシステムやCPUとラインカードなどを物理資源として利用するネットワーク（NW）システムにおける制御管理手法、特に障害管理の制御管理手法に関する。 The present invention relates to a control management method in an IT system that uses a CPU and an I / O device as physical resources, and a network (NW) system that uses a CPU and a line card as physical resources, and particularly relates to a control management method for fault management.

ITやネットワークシステムなどにおける障害管理システムとして、耐障害性を高めるために、二重化構造がよく採用される。二重化構造とは現用資源に加え、待機資源を設定し、冗長度を高めることで、現用系に何らかの障害が発生しても、待機資源に切替え、サービスの停止を防ぐ手法である。一般にはN+M構成(N：現用系の数、M：待機系の数)の構成が取れる。また、障害からの復旧はサービスと物理資源の間で行い、サービスには障害を隠蔽する構成をとり、サービスに特別な仕組みを組み込まずにすむようにすることが一般的である。 As a failure management system in IT and network systems, a duplex structure is often adopted to enhance fault tolerance. The duplex structure is a method of setting a standby resource in addition to the active resource and increasing redundancy so that even if a failure occurs in the active system, the standby resource is switched to prevent the service from being stopped. In general, an N + M configuration (N: number of active systems, M: number of standby systems) can be used. In general, recovery from a failure is performed between the service and the physical resource, and the service is configured to conceal the failure so that a special mechanism is not included in the service.

この障害管理の方式として、ソフトウェア的、ハードウェア的に行う２つに分類される。 There are two types of failure management methods, software and hardware.

ソフトウェア的管理の例として１．VMM、２．SW RAIDなどがあげられ、ハードウェア的管理の例としてH/W RAIDなどがあげられる（非特許文献２）。 As an example of software management: VMM, 2. SW RAID etc. are mentioned, and H / W RAID etc. are mentioned as an example of hardware management (nonpatent literature 2).

近年、コンピュータ環境において、資源の仮想化による仮想装置構成が主流となってきている。この仮想化はたとえば図８に示すように物理資源１００１と従来のデバイスアクセス手段１００６の間に仮想化層を挟み、仮想化手段１００３が仮想資源１００２を、複数のデバイスアクセス手段１００６を含む仮想装置１００４に提供することで、一組の資源の上で複数の異なる仮想装置１００４を駆動することが可能となる。仮想化手段１００３は複数の仮想装置１００４からの物理資源１００１へのアクセスをスケジューリングすることで、仮想的に各プロセス１００５は従来と変更なくデバイスアクセス手段１００６を用いてデバイスにアクセスすることが可能になる。仮想化により、物理資源１００１の詳細は仮想装置１００４（もしくは仮想装置１００４にインストールされるゲストOS）には隠蔽される。 In recent years, virtual device configurations based on resource virtualization have become mainstream in computer environments. For example, as shown in FIG. 8, the virtualization includes a virtualization layer sandwiched between a physical resource 1001 and conventional device access means 1006. The virtualization means 1003 includes a virtual resource 1002 and a plurality of device access means 1006. By providing to 1004, a plurality of different virtual devices 1004 can be driven on a set of resources. The virtualization unit 1003 schedules access to the physical resource 1001 from a plurality of virtual devices 1004, so that each process 1005 can virtually access the device by using the device access unit 1006 without any change. Become. The details of the physical resource 1001 are hidden by the virtual device 1004 (or the guest OS installed in the virtual device 1004) by virtualization.

仮想化のアーキテクチャは様々だが、一例として、非特許文献１に記載のXENのアーキテクチャを図９に示す。XENでは、物理資源２００１に対して、仮想化層としてハイパーバイザー（hypervisor）２００２を有し、また、デバイスアクセスのためにデバイスドライバなどを有し、仮想装置２００４の物理資源２００１へのアクセスを仲介する専用の特権ドメイン２００３を有する。 Although there are various virtualization architectures, the XEN architecture described in Non-Patent Document 1 is shown in FIG. 9 as an example. XEN has a hypervisor 2002 as a virtualization layer for the physical resource 2001, and also has a device driver for device access and mediates access to the physical resource 2001 of the virtual device 2004. A dedicated privileged domain 2003.

この隠蔽構造を利用して、物理資源に障害が起きてもゲストOSには隠蔽したまま対応する仮想資源を切り替えることで障害を隠蔽することが原理的に可能になる。この方式は特許文献１などに記載がある。 Using this concealment structure, even if a physical resource failure occurs, it is possible in principle to conceal the failure by switching the corresponding virtual resource while concealing the guest OS. This method is described in Patent Document 1 and the like.

図１０に示すように、ソフトウェアによる冗長管理方式の基本的な構成は、ハードウェア空間３００２に属する物理資源３００１とソフトウェア空間３００３に属するデバイスアクセス手段３００４と冗長管理手段３００５とプロセス３００６からなる。冗長管理手段３００５は複数の物理資源３００１および対応するデバイスアクセス手段３００４にアクセス可能であり、たとえば二重化構成をとる物理資源の組をひとつの抽象的なデバイスとしてプロセス３００６に見せる。プロセス３００６は実際の物理資源３００１の状態にかかわらず冗長管理手段３００５との間で規定されたインターフェースに従ってデバイスアクセスを行うため、実際は二重化されていても、ひとつのデバイスにアクセスするようにプロセスを設計することが可能である。もし二重化された物理資源のうち一方が故障しても、冗長化手段３００５はもう一方のデバイスアクセス手段、および物理資源に設定を切り替えることで、障害をプロセス３００６に隠蔽することが可能である。 As shown in FIG. 10, the basic configuration of the software redundancy management system includes a physical resource 3001 belonging to the hardware space 3002, device access means 3004 belonging to the software space 3003, redundancy management means 3005, and a process 3006. The redundancy management unit 3005 can access the plurality of physical resources 3001 and the corresponding device access unit 3004. For example, a set of physical resources having a duplex configuration is shown to the process 3006 as one abstract device. Since the process 3006 performs device access according to the interface defined with the redundancy management unit 3005 regardless of the actual state of the physical resource 3001, the process is designed to access one device even if it is actually duplicated. Is possible. If one of the duplicated physical resources fails, the redundancy unit 3005 can conceal the failure in the process 3006 by switching the setting to the other device access unit and physical resource.

上記のソフトウェア処理は専用ハードウェアが必要ないことなどから柔軟性に優れるが、障害検出から切替処理がソフトウェアプロセスで行われるため、どうしても処理時間が長いという欠点がある。 The above software processing is excellent in flexibility because it does not require dedicated hardware. However, since the switching processing is performed in the software process from the failure detection, there is a disadvantage that the processing time is long.

また、その他の例としてハードウェア方式によるRAIDシステムがある。 Another example is a hardware RAID system.

図１１に、ハードウェア的冗長管理方式の一例としてRAIDシステムの概要図を示す。本システムはディスク装置４００５とソフトウェアプログラム４００４からなり、ディスク装置４００５はハードディスクなどの物理ディスク４００１とメモリ４００７を有し、ソフトウェアプログラム４００４はプロセス４００６とデバイスアクセス手段４００８を有する。ある物理ディスクに格納されるデータ４０１１は常に他の物理ディスクのデータ４０１２としてミラーリング（コピー）しておいて、ユーザ（プロセス４００６、デバイスアクセス手段４００８などのソフトウェア演算手段４００４）にはその詳細を隠蔽し、メモリ４００７にあるひとつのデータ４０１３にアクセスしているようにする手法である。これにより、たとえそのデータ４０１１が存在する一方の物理ディスクが故障しても、もう一方の物理ディスクのデータ４０１２をプロセスに供給することで、プロセスには障害の影響は及ばない。この冗長化管理、専用HWであるコントローラ４００２が行っており、デバイスアクセス手段４００８およびプロセス４００６といったソフトウェア演算手段４００４は意識する必要がない。これは専用ハードウェアを用いたハードウェアレベルでの冗長管理の隠蔽である。 FIG. 11 shows a schematic diagram of a RAID system as an example of a hardware redundancy management method. This system includes a disk device 4005 and a software program 4004. The disk device 4005 has a physical disk 4001 such as a hard disk and a memory 4007. The software program 4004 has a process 4006 and device access means 4008. Data 4011 stored on a physical disk is always mirrored (copied) as data 4012 on another physical disk, and the details are hidden from the user (software operation means 4004 such as process 4006 and device access means 4008). In this method, one data 4013 in the memory 4007 is accessed. As a result, even if one physical disk in which the data 4011 exists fails, the data 4012 of the other physical disk is supplied to the process, so that the process is not affected by the failure. The controller 4002 which is the redundant management and dedicated HW is performed, and the software operation unit 4004 such as the device access unit 4008 and the process 4006 need not be aware of it. This is a concealment of redundancy management at the hardware level using dedicated hardware.

また、障害復旧の高速化手法としては冗長構成をとるハードウェア同士で、自立的に死活を監視する方法がある。マスタ（現用）とスレーブ（待機）を決め、お互いに死活を監視しあう事で、たとえばマスタに障害が生じ、スレーブがそれを感知すると、スレーブがマスタとして動作するように設定しておく方式である。障害をハードウェア的に検出し、切り替えるため、障害回復が高速に行える。これはたとえば図１２に示すように、ネットワークにおける現用パス５００３と待機パス５００４において、正常時は分岐点５００１でデータをコピーし、常に両方とものパスでデータを転送しておき、分岐点５００２で現用パスからデータを受け取り転送する。現用パスと待機パスはたとえば定期的に試験信号を送信するなどして常にハードウェア的に死活が確認されており、現用パスに障害が観測されると、ハードウェア的に待機パスからのデータを転送するように分岐点５００２で切り替える。本方式により、高速な障害回復が可能となる。 Further, as a method for speeding up failure recovery, there is a method of independently monitoring the life and death between hardware having a redundant configuration. By determining the master (active) and slave (standby) and monitoring each other's life and death, for example, when a failure occurs in the master and the slave detects it, the slave is set to operate as a master. is there. Since faults are detected and switched by hardware, fault recovery can be performed at high speed. For example, as shown in FIG. 12, in the working path 5003 and the standby path 5004 in the network, data is copied at the branch point 5001 in the normal state, the data is always transferred through both paths, and at the branch point 5002 Receive and transfer data from the working path. The working path and standby path are always confirmed to be alive and dead by hardware, for example, by periodically sending test signals. If a failure is observed in the working path, data from the standby path is hardware-based. Switching is performed at the branch point 5002 so as to transfer. This method enables fast failure recovery.

以上のハードウェア的処理は復旧が早いが専用ハードウェアが必要であったり、冗長構成設定などの管理柔軟性にかけるなどの欠点がある。
米国特許出願公開第2005/0246718明細書 Paul Barham et. Al. “Xen and the Art of Virtualization” Proceedings of the nineteenth ACM symposium on Operating systems principles, pp 164-177, Bolton Landing, NY, USA, 2003 David A. Patterson, et. Al. “A case for redundant arrays of inexpensive disks (RAID)”, Proceedings of the 1988 ACM SIGMOD international conference on Management of data, Pages: 109 - 116 Although the above hardware processing is quick to recover, there are drawbacks such as the need for dedicated hardware and management flexibility such as redundant configuration settings.
US Patent Application Publication No. 2005/0246718 Paul Barham et. Al. “Xen and the Art of Virtualization” Proceedings of the nineteenth ACM symposium on Operating systems principles, pp 164-177, Bolton Landing, NY, USA, 2003 David A. Patterson, et. Al. “A case for redundant arrays of inexpensive disks (RAID)”, Proceedings of the 1988 ACM SIGMOD international conference on Management of data, Pages: 109-116

しかしながら、上記の構成では、仮想化環境において、専用のハードウェアコントローラを用いることなく、物理資源の障害をアプリケーション（プロセス）に隠蔽しつつ、高速フェールオーバーを行うことが困難であるということである。 However, in the above configuration, it is difficult to perform high-speed failover while concealing a physical resource failure in an application (process) without using a dedicated hardware controller in a virtual environment. .

その理由は、現在の仮想化環境でのフェールオーバーはソフトウェア的な障害管理であるため、復旧に時間がかかるためである。また、ハードウェア的な高速復旧方式とソフトウェア復旧方式の連携ができないため、専用のハードウェアコントローラを用いてソフトウェアに対して隠蔽しない限り、キープアライブ方式などのハードウェア的な高速自立復旧方式をそのまま高速性を保ったまま仮想化環境に適用することが困難であったためである。 The reason is that failover in the current virtual environment is software failure management, and thus recovery takes time. In addition, since hardware fast recovery method and software recovery method cannot be linked, hardware fast independent recovery method such as keep alive method is used as it is unless it is concealed from software using a dedicated hardware controller. This is because it was difficult to apply to a virtual environment while maintaining high speed.

（発明の目的）
本発明の目的は、高価で専用ハードウェアコントローラを用いずに、仮想化環境において、物理障害を仮想装置、仮想装置内にインストールされるゲストOS、もしくはプロセスに隠蔽しながら、ハードウェア方式とソフトウェア方式が連携可能な柔軟な高速フェールオーバーを行うシステムを提供することにある。 (Object of invention)
An object of the present invention is to provide a hardware method and software while concealing a physical failure in a virtual device, a guest OS installed in the virtual device, or a process in a virtual environment without using an expensive and dedicated hardware controller. The object is to provide a system that performs flexible high-speed failover that can be coordinated.

また、本発明のほかの目的は、キープアライブ方式などの高速障害復旧機能を持ったハードウェアとそうでない通常ハードウェアが混在するような複雑な管理環境でも、各仮想装置の優先度などの管理ポリシーを満足するよう自立的に最適な冗長化設定ができるシステムを提供することにある。 Another object of the present invention is to manage the priority of each virtual device even in a complicated management environment in which hardware with a high-speed failure recovery function such as a keep alive method and other normal hardware are mixed. An object of the present invention is to provide a system in which optimal redundancy can be set independently to satisfy the policy.

本発明の物理資源制御管理システムは、複数の物理資源と、
少なくとも一つのソフトウェアプログラムが動作する仮想装置、および前記仮想装置で前記複数の物理資源を共有することを可能とする仮想化手段、として機能するソフトウェアを搭載するコンピュータ構成部と、を備え、
前記仮想化手段は、前記複数の物理資源の前記仮想装置への割り当てを行う資源割当手段と、前記複数の物理資源の一つ又は複数で障害が発生し、ハードウェアによる他の物理資源への切り替え制御が実行された場合に、該切り替え制御と連携して、前記資源割当手段でのソフトウェアによる前記他の物理資源への切り替えを制御する障害管理手段とを有することを特徴とする。 The physical resource control management system of the present invention includes a plurality of physical resources,
A virtual machine in which at least one software program operates, and a computer component that includes software functioning as a virtualization unit that enables the virtual apparatus to share the plurality of physical resources, and
The virtualization means includes a resource allocation means for allocating the plurality of physical resources to the virtual device, and one or more of the plurality of physical resources have failed, and the hardware is allocated to another physical resource. It comprises fault management means for controlling switching to the other physical resource by software in the resource allocation means in cooperation with the switching control when switching control is executed.

本発明の物理資源制御管理方法は、複数の物理資源と、少なくとも一つのソフトウェアプログラムが動作する仮想装置、および前記仮想装置で前記複数の物理資源を共有することを可能とする仮想化手段、として機能するソフトウェアを搭載するコンピュータ構成部と、を備え、
前記仮想化手段は、前記複数の物理資源の前記仮想装置への割り当てを行う資源割当手段を有する物理資源制御管理システムの物理資源制御管理方法において、
複数の物理資源の一つ又は複数で障害が発生した場合に、ハードウェアによる他の物理資源への切り替えを行うステップと、
前記切り替え制御に連携して、前記資源割当手段でのソフトウェアによる前記他の物理資源への切り替えを行うステップと、とを有することを特徴とする。 The physical resource control management method of the present invention includes a plurality of physical resources, a virtual device in which at least one software program operates, and a virtualization unit that enables the virtual device to share the plurality of physical resources. And a computer configuration unit having functioning software,
In the physical resource control management method of the physical resource control management system, the virtualization means includes resource allocation means for allocating the plurality of physical resources to the virtual device.
A step of switching to another physical resource by hardware when a failure occurs in one or more of the plurality of physical resources;
And switching to the other physical resource by software in the resource allocating means in cooperation with the switching control.

本発明の物理資源制御管理用プログラムは、コンピュータを、少なくとも一つのソフトウェアプログラムが動作する仮想装置、および前記仮想装置で複数の物理資源を共有することを可能とする仮想化手段、として機能させるための物理資源制御管理用プログラムであって、
前記仮想化手段は、前記複数の物理資源の前記仮想装置への割り当てを行う資源割当手段と、前記複数の物理資源の一つ又は複数で障害が発生し、ハードウェアによる他の物理資源への切り替え制御が実行された場合に、該切り替え制御と連携して、前記資源割当手段でのソフトウェアによる前記他の物理資源への切り替えを制御する障害管理手段として機能することを特徴とする。 The physical resource control management program according to the present invention causes a computer to function as a virtual device in which at least one software program operates, and a virtualization unit that enables the virtual device to share a plurality of physical resources. The physical resource control management program of
The virtualization means includes a resource allocation means for allocating the plurality of physical resources to the virtual device, and one or more of the plurality of physical resources have failed, and the hardware is allocated to another physical resource. When switching control is executed, it functions as a failure management unit that controls switching to the other physical resource by software in the resource allocation unit in cooperation with the switching control.

本発明によれば、仮想化環境において、専用のハードウェアコントローラを用いることなく、物理資源の障害をアプリケーションおよびプロセスに隠蔽しつつ、高速フェールオーバーを行うことができる。その理由は、仮想化環境における隠蔽構造の中で、ハードウェア障害復旧方式とソフトウェア障害復旧方式が復旧速度を損なわずに連携するためである。 According to the present invention, high-speed failover can be performed in a virtual environment while concealing physical resource failures in applications and processes without using a dedicated hardware controller. The reason is that, in the concealment structure in the virtual environment, the hardware failure recovery method and the software failure recovery method cooperate without impairing the recovery speed.

本発明の代表的な実施形態は、システムシステム内の物理資源の集合であるハードウェア空間と、その上で動作するソフトウェアプログラムの集合であるソフトウェア空間からなり、ハードウェア空間には少なくともひとつ以上の中央演算部とその他の物理資源がデータ転送路によって接続されており、また物理資源の一部もしくは全部には死活確認部を有し、ソフトウェア空間は仮想化手段と、少なくともひとつ以上の仮想資源空間およびその上で動作する仮想装置を有し、仮想化手段は、資源割当手段、障害管理手段、資源アクセス手段を有する構成である。このような構成を採用し、障害管理手段がハードウェアによる障害管理とソフトウェアによる障害管理間の連携をとり、資源割当手段を制御することができる。 A typical embodiment of the present invention includes a hardware space that is a set of physical resources in a system system and a software space that is a set of software programs that operate on the hardware space, and the hardware space includes at least one or more. The central processing unit and other physical resources are connected by a data transfer path, and part or all of the physical resources have a life / death confirmation unit, and the software space is a virtualization means and at least one virtual resource space. And a virtual device that operates on the virtual device, and the virtualization unit includes a resource allocation unit, a failure management unit, and a resource access unit. By adopting such a configuration, the failure management means can coordinate the failure management by hardware and the failure management by software to control the resource allocation means.

以下図面を用いた本発明の各実施形態について説明する。 Embodiments of the present invention will be described below with reference to the drawings.

［第１の実施形態］
まず、本発明の第１の実施形態について図面を参照して詳細に説明する。 [First Embodiment]
First, a first embodiment of the present invention will be described in detail with reference to the drawings.

図１を参照すると、本発明の第１の実施の形態は、ハードウェア空間６１００とソフトウェア空間６５００からなる。ハードウェア空間６１００には少なくともI/Oデバイスなどに代表される物理資源６１０１、物理資源６１０２、CPUなどに代表される中央演算部（中央演算手段）６１２１が設けられ、これらはデータ転送路（データ転送手段）６１３１によって接続されている。データ転送路は例えばPCIバスなどに代表されるシステムバスであるが、これに限定されるものではない。また、物理資源６１０１には死活確認部（死活確認手段）６１１１が設けられ、物理資源６１０２には死活確認部６１１２が設けられている。 Referring to FIG. 1, the first embodiment of the present invention includes a hardware space 6100 and a software space 6500. The hardware space 6100 includes at least a physical resource 6101 typified by an I / O device, a physical resource 6102, and a central processing unit (central processing means) 6121 typified by a CPU. (Transfer means) 6131. The data transfer path is a system bus represented by, for example, a PCI bus, but is not limited to this. The physical resource 6101 is provided with a life / death confirmation unit (life / life confirmation means) 6111, and the physical resource 6102 is provided with a life / death confirmation unit 6112.

ソフトウェア空間６５００は、仮想化手段６５５０と、少なくともひとつ以上の仮想資源空間６５２０およびその上で動作する仮想装置６５１０を有する。仮想化手段６５５０は、資源割当手段６５５１、障害管理手段６５５２、資源アクセス手段６５５３、資源アクセス手段６５５４を有する。仮想装置６５１０は資源アクセス手段６５１１を有する。仮想化手段６５５０、仮想資源空間６５２０、仮想装置６５１０はＤＲＡＭ等の半導体メモリやハードディスク装置に記憶されたプログラムやデータであり、ハードウェア空間６１００のCPUなどに代表される中央演算部により処理が実行される。 The software space 6500 includes a virtualization unit 6550, at least one virtual resource space 6520, and a virtual device 6510 operating thereon. The virtualization unit 6550 includes a resource allocation unit 6551, a failure management unit 6552, a resource access unit 6553, and a resource access unit 6554. The virtual device 6510 has resource access means 6511. The virtual means 6550, the virtual resource space 6520, and the virtual device 6510 are programs and data stored in a semiconductor memory such as a DRAM or a hard disk device, and are processed by a central processing unit represented by a CPU in the hardware space 6100. Is done.

次にこれらの手段の動作の概略を説明する。 Next, an outline of the operation of these means will be described.

ハードウェア空間６１００において、I/Oデバイスなどに代表される物理資源６１０１と、物理資源６１０２と、CPUなどに代表される中央演算部６１２１とが、PCIバスなどに代表されるデータ転送路６１３１によって接続されている。現用系に設定された物理資源６１０１と、待機系に設定された物理資源６１０２とは、それぞれ死活確認部６１１１と死活確認部６１１２によってキープアライブ信号に代表される死活確認信号６１１３を交換することでお互いに死活確認を行う。物理資源６１０１と物理資源６１０２とは、冗長ペアを構成する。死活確認信号６１１３が途切れて、相手の物理資源の障害を検出すると、それを通知する信号を、データ転送路６１３１を通して中央演算部６１２１に送信する。中央演算部６１２１はその信号を受信すると、現在の処理を中断し、障害発生信号をソフトウェア空間６５００内の障害管理手段６５５２に送信する。 In the hardware space 6100, a physical resource 6101 typified by an I / O device or the like, a physical resource 6102, and a central processing unit 6121 typified by a CPU or the like are connected by a data transfer path 6131 typified by a PCI bus or the like. It is connected. The physical resource 6101 set in the active system and the physical resource 6102 set in the standby system are exchanged by the life / death confirmation unit 6111 and the life / death confirmation unit 6112 by the life / life confirmation signal 6113 typified by the keep alive signal. Confirm each other's life and death. The physical resource 6101 and the physical resource 6102 constitute a redundant pair. When the life / death confirmation signal 6113 is interrupted and a failure of the physical resource of the other party is detected, a signal for notifying it is transmitted to the central processing unit 6121 through the data transfer path 6131. When the central processing unit 6121 receives the signal, the central processing unit 6121 interrupts the current processing and transmits a failure occurrence signal to the failure management means 6552 in the software space 6500.

ソフトウェア空間６５００はハードウェア空間（物理資源空間）６１００上で動作するソフトウェアプログラムである。ソフトウェア空間６５００に属する仮想化手段６５５０は仮想装置６５１０に対して、ハードウェア空間６１００を仮想化した、仮想資源６５２１を有する仮想資源空間６５２０を提供する。仮想装置６５１０はその仮想資源空間６５２０上で、あたかも同様のハードウェア空間上で動作するかのごとくに動作する。仮想装置６５１０はデバイスドライバに代表される資源アクセス手段６５１１を有し、仮想装置６５１０内で動作する各種ソフトウェアプロセスに資源へのアクセス手段を提供する。資源アクセス手段６５１１は、各種ソフトウェアプロセスに仮想資源６５１０空間の存在を提示し、その仮想資源空間６５１０内の資源のアクセス要求に対し、実際はその処理を仮想化手段６５５０の資源割当手段６５５１に転送する。 The software space 6500 is a software program that operates on a hardware space (physical resource space) 6100. The virtualization unit 6550 belonging to the software space 6500 provides the virtual device 6510 with a virtual resource space 6520 having a virtual resource 6521 obtained by virtualizing the hardware space 6100. The virtual device 6510 operates on the virtual resource space 6520 as if it operates on the same hardware space. The virtual device 6510 has resource access means 6511 typified by a device driver, and provides resource access means to various software processes operating in the virtual device 6510. The resource access unit 6511 presents the existence of the virtual resource 6510 space to various software processes, and actually transfers the processing to the resource allocation unit 6551 of the virtualization unit 6550 in response to an access request for the resource in the virtual resource space 6510. .

仮想化手段６５５０は一般に複数の仮想装置６５１０を同一システム内に有することができ、それぞれに仮想資源空間６５２０を提供し、それらからの物理資源アクセスを仲介することで、それらの複数の仮想装置６５１０でハードウェア空間６１００を共有することを可能とする。 The virtualization means 6550 can generally have a plurality of virtual devices 6510 in the same system, each providing a virtual resource space 6520 and mediating physical resource access from them, thereby the plurality of virtual devices 6510. Allows the hardware space 6100 to be shared.

この目的のために、仮想化手段６５５０において、物理資源６１０１および物理資源６１０２に直接アクセスする資源アクセス手段６５５３および資源アクセス手段６５５４を制御するとともに、それらの資源アクセス手段と仮想装置６５１０内の資源アクセス手段６５１１との接続を、資源割当手段６５５１で制御することで、ハードウェア空間（物理資源空間）６１００の共有を実現する。 For this purpose, the virtualization unit 6550 controls the resource access unit 6553 and the resource access unit 6554 that directly access the physical resource 6101 and the physical resource 6102, and the resource access unit and the resource access unit in the virtual device 6510. The hardware allocation (physical resource space) 6100 is shared by controlling the connection with the 6511 by the resource allocation unit 6551.

資源割当手段６５５１は、あらかじめ指定された設定により、仮想装置６５１０に対してアクセス可能な物理資源への資源アクセス手段との接続を容認することで、複数の仮想装置６５１０の資源アクセス手段６５１１からの物理資源アクセスを制御する。また、資源割当手段６５５１は冗長構成の設定も行う。たとえば、仮想資源６５２１の可用性レベルを高めるため、実際は物理資源６１０１と物理資源６１０２を二重化して用いるとする。現用系を物理資源６１０１、待機系を物理資源６１０２とすると、通常時は、資源割当手段６５５１は仮想装置６５１０の資源アクセス手段６５１１と資源アクセス手段６５５３を接続しておく。物理資源６１０１に障害が発生した段階で接続を資源アクセス手段６５５３から資源アクセス手段６５５４に切り替えることで障害復旧が可能である。これは上記のソフトウェア的障害復旧手法であり、仮想装置６５１０に対して障害が隠蔽され、仮想装置６５１０は冗長構成をとるための何ら設定を行う必要はない。 The resource allocation unit 6551 accepts the connection with the resource access unit to the physical resource accessible to the virtual device 6510 according to the setting designated in advance, so that the resource access unit 6511 of the plurality of virtual devices 6510 Control physical resource access. The resource allocation unit 6551 also sets a redundant configuration. For example, in order to increase the availability level of the virtual resource 6521, it is assumed that the physical resource 6101 and the physical resource 6102 are actually used in duplicate. Assuming that the active system is the physical resource 6101 and the standby system is the physical resource 6102, the resource allocation unit 6551 normally connects the resource access unit 6511 and the resource access unit 6553 of the virtual device 6510. The failure can be recovered by switching the connection from the resource access means 6553 to the resource access means 6554 when a failure occurs in the physical resource 6101. This is the above-described software failure recovery method, in which a failure is concealed from the virtual device 6510, and the virtual device 6510 does not need to make any settings for taking a redundant configuration.

さらに仮想化手段６５５０は障害管理手段６５５２を有する。障害管理手段６５５２は中央演算処理手段６１２１からの割り込みに代表される信号により、どの処理を行うかをあらかじめ登録しておいたテーブルから検索し、実行する機能を有する。たとえば、障害時に割り込みを契機に上記の接続設定を変更する命令を出す、などの動作を行う。 Further, the virtualization unit 6550 has a failure management unit 6552. The fault management unit 6552 has a function of searching and executing from the table registered in advance which processing is performed by a signal represented by an interrupt from the central processing unit 6121. For example, an operation such as issuing a command to change the above connection setting upon interruption in the event of a failure is performed.

次に本発明の実施の形態における障害発生から障害復旧までの動作を図２に示すフローチャートを用いて説明する。 Next, the operation from failure occurrence to failure recovery in the embodiment of the present invention will be described with reference to the flowchart shown in FIG.

（ステップＳ１０１）
物理資源６１０１は現用系資源に設定され、物理資源６１０２は待機系資源に設定されているとする。両者の物理資源は死活確認部６１１１および６１１２の死活確認信号６１１３の交信を通して定期的にお互いの死活状態を確認している。また、仮想装置６５１０の仮想資源６５２１は１＋１冗長が設定されており、資源割当手段６５５１によって、正常動作時は現用系の物理資源６１０１にアクセスする資源アクセス手段６５５３と、仮想装置６５１０の資源アクセス手段６５１１とを接続している。 (Step S101)
Assume that the physical resource 6101 is set as an active resource and the physical resource 6102 is set as a standby resource. Both physical resources regularly confirm each other's life / death state through the communication of the life / death confirmation signal 6113 of the life / death confirmation units 6111 and 6112. Further, the virtual resource 6521 of the virtual device 6510 is set to 1 + 1 redundancy, and the resource allocation unit 6551 accesses the active physical resource 6101 during normal operation, and the resource access unit of the virtual device 6510. 6511 is connected.

（ステップＳ１０２）
現用系に設定されている物理資源６１０１に電源故障などに代表される障害が発生し、サービス継続が不可能となる。 (Step S102)
A failure represented by a power failure or the like occurs in the physical resource 6101 set in the active system, and the service cannot be continued.

（ステップＳ１０３）
死活確認部６１１１は物理資源６１０２の死活確認部６１１２と死活確認信号６１１３の交信不可となるため、死活確認部６１１２は最短で信号送信間隔時間で応答なしを検出することで物理資源６１０１の障害を確認し、物理資源６１０２を現用系に状態設定変更する。 (Step S103)
The life and death confirmation unit 6111 cannot communicate between the life and death confirmation unit 6112 of the physical resource 6102 and the life and death confirmation signal 6113. Therefore, the life and death confirmation unit 6112 detects the failure of the physical resource 6101 by detecting no response at the shortest signal transmission interval time. Confirm and change the status setting of the physical resource 6102 to the active system.

（ステップＳ１０４）
物理資源６１０２は割り込み信号に代表される障害検出通知信号および待機系から現用系への状態変更通知信号を中央演算部６１２１に送信する。 (Step S104)
The physical resource 6102 transmits a failure detection notification signal represented by an interrupt signal and a state change notification signal from the standby system to the active system to the central processing unit 6121.

（ステップＳ１０５）
中央演算部６１２１は物理資源６１１２からの信号を受信すると、現在行っている処理を停止し、割り込み信号を障害管理手段６５５２に通知する。 (Step S105)
When the central processing unit 6121 receives a signal from the physical resource 6112, the central processing unit 6121 stops the current processing and notifies the failure management unit 6552 of an interrupt signal.

（ステップＳ１０６）
障害管理手段６５５２は割り込み信号に対して、あらかじめ登録しておいたテーブルから該当する処理を検索する。 (Step S106)
The fault management unit 6552 searches for a corresponding process from a previously registered table for the interrupt signal.

（ステップＳ１０７）
障害管理手段６５５２は検索した処理に応じて、接続変更命令を資源割当手段６５５１に送信し、資源割当手段６５５１は命令に従い、接続先を資源アクセス手段６５５３から資源アクセス手段６５５４へと切替制御を行い、現用系が物理資源６１０２に状態設定変更された、ハードウェア空間での運用状態との同期をとる。 (Step S107)
The failure management unit 6552 transmits a connection change command to the resource allocation unit 6551 according to the searched processing, and the resource allocation unit 6551 performs switching control of the connection destination from the resource access unit 6553 to the resource access unit 6554 according to the command. The active system is synchronized with the operation state in the hardware space in which the state setting is changed to the physical resource 6102.

（ステップＳ１０８）
資源アクセス手段６５１１は物理資源６１０２へのアクセスが可能となり、サービス継続が可能となる。 (Step S108)
The resource access unit 6511 can access the physical resource 6102 and can continue the service.

次に本実施形態の効果を説明する。 Next, the effect of this embodiment will be described.

従来のソフトウェア的障害復旧手法は、物理資源での障害発生時に、仮想装置の資源アクセス手段対する物理資源６１０１からの応答がなくなり、タイムアウトなどの処理により障害を検出し、現用系から待機系に接続を変更し、必要に応じてその他ソフトウェアおよびハードウェアの運用状態を変更して障害復旧処理が終了する。このため障害発生から復旧までの処理時間は一般に長い。処理時間を早くするために常に仮想化手段の資源アクセス手段から死活確認信号を送るなどの代替手段が考えられるが、障害検出時間を短くするにはその分ＣＰＵ負荷が増加する。それに対し、本実施形態によれば、死活確認はハードウェア手法によって行うことで高速性を最大限に引き出し、ハードウェアから割り込みをあげることで大きな遅延なく資源割当手段６５５１によるソフトウェア的な切替と同期を図ることが可能になるため、仮想装置には隠蔽したまま特別なハードウェア障害隠蔽構造をとらずに高速な障害復旧が可能となる。 In the conventional software failure recovery method, when a failure occurs in a physical resource, there is no response from the physical resource 6101 to the resource access means of the virtual device, the failure is detected by processing such as timeout, and the active system is connected to the standby system Is changed, and the operation status of other software and hardware is changed as necessary, and the failure recovery processing is completed. For this reason, the processing time from failure occurrence to recovery is generally long. In order to shorten the processing time, alternative means such as always sending a life / death confirmation signal from the resource access means of the virtualization means can be considered, but in order to shorten the fault detection time, the CPU load increases accordingly. On the other hand, according to the present embodiment, the life and death confirmation is performed by a hardware method to maximize the high speed, and by interrupting from the hardware, the resource allocation unit 6551 synchronizes and synchronizes with software without significant delay. Therefore, it is possible to quickly recover from a failure without using a special hardware failure concealment structure while concealing the virtual device.

［第２の実施形態］
次に本発明の第２の実施の形態について図３を参照して詳細に説明する。 [Second Embodiment]
Next, a second embodiment of the present invention will be described in detail with reference to FIG.

本発明の第２の実施の形態は、ハードウェア空間７１００とソフトウェア空間７５００とから構成される。 The second embodiment of the present invention includes a hardware space 7100 and a software space 7500.

ハードウェア空間７１００は、データ転送路７１５０によって相互に接続されたI/Oデバイス７１０１、I/Oデバイス７１０２、およびCPU７１０３を少なくともひとつずつ以上有する。I/Oデバイス７１０１および７１０２は死活確認部７１１１および７１１２によってキープアライブ信号７２０１を通して互いに死活確認を行っている。ここで、Ｉ／Ｏデバイス７１０１および７１０２がネットワークカードであった場合、Ｉ／Ｏデバイス７１０１および７１０２との間での死活確認部部７１１１および７１１２によるネットワークの端点同士の死活確認のほかに、ネットワーク上のノードとの死活確認信号によるそれぞれのパスの死活確認部が別に搭載されていてもよい。 The hardware space 7100 includes at least one I / O device 7101, I / O device 7102, and CPU 7103 connected to each other via a data transfer path 7150. The I / O devices 7101 and 7102 confirm each other's life and death through the keep alive signal 7201 by the life and death confirmation units 7111 and 7112. Here, when the I / O devices 7101 and 7102 are network cards, in addition to the alive confirmation between the network end points by the alive confirmation units 7111 and 7112 between the I / O devices 7101 and 7102, the network The life and death confirmation part of each path by the life and death confirmation signal with the upper node may be mounted separately.

ソフトウェア空間７５００は、オペレーティングシステムなどに代表される仮想化手段７５８０および一つ以上の仮想装置７５３０を有する。仮想化手段７５８０は、Ｉ／Ｏデバイス７１０１へのアクセス手段であるデバイスドライバ７５０１、Ｉ／Ｏデバイス７１０２へのアクセス手段であるデバイスドライバ７５０２、障害管理手段７５５０、資源割当手段７５０３、バックエンドデバイスドライバ７５０４を有する。さらに障害管理手段７５５０は、管理者７９００にアクセス手段を提供するインターフェース手段７５５１、処理管理手段７５５４、情報格納手段７５５３を有する。また、仮想装置７５３０はプロセス７５３２およびフロントエンドデバイスドライバ７５３１を有する。 The software space 7500 includes virtualization means 7580 typified by an operating system and one or more virtual devices 7530. The virtualization unit 7580 includes a device driver 7501 that is an access unit to the I / O device 7101, a device driver 7502 that is an access unit to the I / O device 7102, a failure management unit 7550, a resource allocation unit 7503, and a back-end device driver. 7504. Further, the failure management unit 7550 includes an interface unit 7551 that provides an access unit to the administrator 7900, a process management unit 7554, and an information storage unit 7553. The virtual device 7530 includes a process 7532 and a front-end device driver 7531.

次に、これらの手段の動作について説明する。I/Oデバイス７１０１および７１０２はNIC（ネットワークインターフェースカード）やディスクに代表されるInput/Outputデバイスであるが、必ずしもこれらに限定するものではない。一般にユーザプロセスはこれらのI/Oデバイスを外部システムとの通信やディスクへのデータアクセスなどの目的に用いる。CPU７１０３はPentium（登録商標）やXeonなどに代表される中央演算処理装置であり、データ転送路７１５０はPCIバスなどに代表されるシステムバスであるが必ずしもこれらに限定されるものではない。I/Oデバイス７１０１および７１０２は冗長ペアを構成しており片方が現用系、もう一方が待機系に設定される（ここではI/Oデバイス７１０１が現用系とする）。死活確認部７１１１および７１１２はキープアライブ信号７２０１を一定間隔で交信することで互いに死活確認をしている。 Next, the operation of these means will be described. The I / O devices 7101 and 7102 are input / output devices represented by NIC (network interface card) and disk, but are not necessarily limited to these. In general, a user process uses these I / O devices for purposes such as communication with an external system and data access to a disk. The CPU 7103 is a central processing unit typified by Pentium (registered trademark) and Xeon, and the data transfer path 7150 is a system bus typified by a PCI bus, but is not necessarily limited thereto. The I / O devices 7101 and 7102 form a redundant pair, and one is set as the active system and the other is set as the standby system (here, the I / O device 7101 is set as the active system). The life and death confirmation units 7111 and 7112 mutually confirm the life and death by communicating keep-alive signals 7201 at regular intervals.

もしI/Oデバイス７１０１に障害が発生し、死活確認部７１１２がキープアライブ信号７２０１の交信に異常を検出すると、I/Oデバイス７１０１を障害と判定し、自分の運用状態を待機系から現用系に変更し、障害通知割り込み信号７２０２をCPU７１０３に送信する。CPU７１０３はこの割り込み信号７２０２を検出すると、現在の処理を一旦停止し、障害管理手段７５５０に割り込み信号７２０３を送信する。 If a failure occurs in the I / O device 7101 and the life and death confirmation unit 7112 detects an abnormality in the communication of the keep-alive signal 7201, the I / O device 7101 is determined as a failure, and its own operating state is changed from the standby system to the active system. The failure notification interrupt signal 7202 is transmitted to the CPU 7103. When the CPU 7103 detects the interrupt signal 7202, the CPU 7103 temporarily stops the current processing and transmits the interrupt signal 7203 to the failure management unit 7550.

デバイスドライバ７５０１および７５０２はそれぞれI/Oデバイス７１０１および７１０２へのアクセス手段を提供するソフトウェアプログラムであり、デバイス固有に作成される。仮想装置７５３０内のフロントエンドデバイスドライバ７５３１は、プロセス７５３２に、仮想資源７３０１の存在を示し、プロセス７５３２が普通の物理資源にアクセスする場合と同様のインターフェースを提供している。なお、プロセス７５３２が直接フロントエンドデバイスドライバ７５３１にアクセスする場合もあり、またカーネルプロセスのような別プロセスが間に介在する場合もある。フロントエンドデバイスドライバ７５３１は仮想資源７３０１へのアクセス要求を受け、あらかじめ対応付けされているバックエンドデバイスドライバ７５０４に対してアクセス要求を転送する。この対応付けは非特許文献１に記載のXENのように、実態は共有メモリへのアクセスという形でデータ転送を行う手法などによって実現されるが、必ずしもこれに限定するものではない。 Device drivers 7501 and 7502 are software programs that provide access means to the I / O devices 7101 and 7102, respectively, and are created unique to the device. The front-end device driver 7531 in the virtual device 7530 indicates the existence of the virtual resource 7301 to the process 7532 and provides an interface similar to that used when the process 7532 accesses a normal physical resource. Note that the process 7532 may directly access the front-end device driver 7531, and another process such as a kernel process may intervene. The front-end device driver 7531 receives an access request to the virtual resource 7301 and transfers the access request to the back-end device driver 7504 associated in advance. This association is realized by a method of transferring data in the form of access to a shared memory, such as XEN described in Non-Patent Document 1, but is not necessarily limited to this.

バックエンドデバイスドライバ７５０４はフロントエンドデバイスドライバ７５３１およびデバイスドライバ７５０１，７５０２と対応付けられており、デバイスドライバ７５０１，７５０２との仲介を行う。バックエンドデバイスドライバ７５０４とデバイスドライバ７５０１，７５０２の間に資源割当手段７５０３を介することで、ソフトウェア方式による障害隠蔽構造を形成している。正常動作時はバックエンドデバイスドライバ７５０４は現用系I/Oデバイスへのアクセス手段であるデバイスドライバ７５０１に接続されている。 The back-end device driver 7504 is associated with the front-end device driver 7531 and the device drivers 7501 and 7502, and mediates between the device drivers 7501 and 7502. A software-based fault concealment structure is formed by interposing resource allocation means 7503 between the back-end device driver 7504 and the device drivers 7501 and 7502. During normal operation, the back-end device driver 7504 is connected to a device driver 7501 that is an access means to the active I / O device.

障害管理手段７５５０は物理資源に障害が発生したときにハードウェア障害復旧方式とソフトウェア障害復旧方式を同期動作させるものである。この目的のため、管理者７９００とのインターフェース手段７５５１、情報格納手段７５５３、処理管理手段７５５４を有する。管理者７９００はあらかじめ、仮想資源７３０１の冗長構成、障害が発生した時の処理方法についての設定をインターフェース手段７５５１を通して行う。たとえば具体的には、冗長構成としてI/Oデバイス７１０１を現用系、I/Oデバイス７１０２を待機系とする１＋１冗長とし、I/Oデバイス７１０２からの障害通知の割り込み信号が検出されたら、資源割当手段７５０３でのバックエンドデバイスドライバ７５０４とデバイスドライバ７５０１との接続を、バックエンドデバイスドライバ７５０４とデバイスドライバ７５０２との接続に切り替える、という処理を登録しておく。割り込み信号はたとえば各障害や割り込み信号送信元デバイスに対して唯一の値となるＩＤとして管理される。この処理情報はたとえば各割り込み信号のＩＤに対してテーブルとして管理され、メモリやディスクに代表される情報格納手段７５５３に格納される。処理管理手段７５５４はCPU７１０３からの割り込み信号を契機に、その割り込み信号IDから該当処理を上記の情報格納手段７５５３内のテーブルを検索し、その処理を実行する（ここでは資源制御手段７５０３に接続切替命令を発行する）。ここで、該当処理が見つからない場合はエラーメッセージをインターフェース手段７５５１を通して管理者７９００に通知する、などの処理も登録しておくことも可能である。 The failure management means 7550 operates the hardware failure recovery method and the software failure recovery method in synchronization when a failure occurs in the physical resource. For this purpose, an interface unit 7551 with an administrator 7900, an information storage unit 7553, and a process management unit 7554 are provided. The administrator 7900 makes settings in advance through the interface means 7551 for the redundant configuration of the virtual resources 7301 and the processing method when a failure occurs. For example, specifically, as a redundant configuration, the I / O device 7101 is the active system, the I / O device 7102 is the standby system, and 1 + 1 redundancy is established. If a fault notification interrupt signal from the I / O device 7102 is detected, the resource A process of switching the connection between the back-end device driver 7504 and the device driver 7501 in the assignment unit 7503 to the connection between the back-end device driver 7504 and the device driver 7502 is registered. The interrupt signal is managed as an ID which is a unique value for each fault or interrupt signal transmission source device, for example. This processing information is managed as a table for each ID of each interrupt signal, for example, and stored in information storage means 7553 represented by a memory or a disk. In response to an interrupt signal from the CPU 7103, the process management unit 7554 searches the table in the information storage unit 7553 for the corresponding process from the interrupt signal ID, and executes the process (in this case, the connection switch to the resource control unit 7503) Issue an order). Here, it is also possible to register a process such as notifying the administrator 7900 of an error message through the interface means 7551 when the corresponding process is not found.

死活確認部７１１１が搭載されたＩ／Ｏデバイス７１０１および死活確認部７１１１が搭載された７１０２が、たとえばネットワークカードであった場合、ネットワーク上のノードの障害によるサービス断も、Ｉ／Ｏデバイス７１０１および７１０２のハードウェアカウンタなどの値を監視し、カウンタ値の変化の異常の検出により、障害と判断することができる。そして、キープアライブ信号を通してマスタからスレーブに障害通知を行うことで同様の障害切り替えが可能である。 When the I / O device 7101 on which the life / death confirmation unit 7111 is mounted and the 7102 on which the life / death confirmation unit 7111 is mounted are, for example, a network card, a service interruption due to a failure of a node on the network may be caused by the I / O device 7101 and A value such as a hardware counter 7102 is monitored, and a failure can be determined by detecting an abnormal change in the counter value. Then, similar failure switching is possible by notifying the slave from the master through the keep-alive signal.

また、ネットワークカードとネットワーク上のノードとの死活確認信号により、ネットワーク上の障害においても同様に割り込み信号７２０３を発行し、高速切り替えが可能である。 In addition, an interruption signal 7203 is issued in the same way in the case of a failure on the network by a life / death confirmation signal between the network card and the node on the network, and high-speed switching is possible.

以下、Ｉ／Ｏデバイス７１０１での障害発生から障害復旧までの動作を図４に示すフローチャートを用いて詳細に説明する。ここで、上記のとおり、管理者７９００によって必要設定および処理登録はすでに行われているとする。 Hereinafter, the operation from the failure occurrence to the failure recovery in the I / O device 7101 will be described in detail with reference to the flowchart shown in FIG. Here, as described above, it is assumed that necessary settings and process registration have already been performed by the administrator 7900.

（ステップＳ２０１）
Ｉ／Ｏデバイス７１０１に障害が発生し、デバイスドライバ７５０１、バックエンドデバイスドライバ７５０４、フロントエンドデバイスドライバ７５３１およびプロセス７５３２はＩ／Ｏデバイス７１０１へのアクセスが不可となり、サービスが停止する。またその障害により、キープアライブ信号７２０１の交信異常が発生する。 (Step S201)
When a failure occurs in the I / O device 7101, the device driver 7501, the back-end device driver 7504, the front-end device driver 7531, and the process 7532 cannot access the I / O device 7101, and the service stops. Further, due to the failure, a communication abnormality of the keep alive signal 7201 occurs.

（ステップＳ２０２）
Ｉ／Ｏデバイス７１０２は死活確認部７１１２のキープアライブ信号交信異常検出からＩ／Ｏデバイス７１０１の障害を検出し、自分の運用状態を待機から運用に変更する。 (Step S202)
The I / O device 7102 detects a failure of the I / O device 7101 from the keepalive signal communication abnormality detection of the life and death confirmation unit 7112, and changes its operation state from standby to operation.

（ステップＳ２０３）
Ｉ／Ｏデバイス７１０２は、割り込み信号７２０２をＣＰＵ７１０３に送信する。 (Step S203)
The I / O device 7102 transmits an interrupt signal 7202 to the CPU 7103.

（ステップＳ２０４）
ＣＰＵ７１０３は割り込み信号７２０３を障害管理手段７５５０に送信する。
（ステップＳ２０５）
障害管理手段７５５０における処理管理手段７５５４は割り込み信号のＩＤ(Identification)から情報格納手段７５５３にアクセスし、処理テーブルから該当する処理を検索する。 (Step S204)
The CPU 7103 transmits an interrupt signal 7203 to the failure management unit 7550.
(Step S205)
The process management unit 7554 in the failure management unit 7550 accesses the information storage unit 7553 from the ID (Identification) of the interrupt signal, and retrieves the corresponding process from the process table.

（ステップＳ２０６）
処理管理手段７５５４は検索した処理を実行する命令を発行する（ここでは資源割当手段７５０３に対して、バックエンドデバイスドライバ７５０４をデバイスドライバ７５０２に接続変更するように命令（契機信号）を発行する）。 (Step S206)
The process management unit 7554 issues a command to execute the searched processing (here, a command (trigger signal) is issued to the resource allocation unit 7503 to change the connection of the back-end device driver 7504 to the device driver 7502). .

（ステップＳ２０７）
資源割当手段７５０３は上記の命令に従って、バックエンドデバイスドライバ７５０４とデバイスドライバ７５０１の接続をデバイスドライバ７５０２との接続に変更する。 (Step S207)
The resource allocation unit 7503 changes the connection between the back-end device driver 7504 and the device driver 7501 to the connection with the device driver 7502 in accordance with the above command.

（ステップＳ２０８）
上記の接続が確立されると、バックエンドデバイスドライバ７５０４、フロントエンドデバイスドライバ７５３１およびプロセス７５３２はデバイスドライバ７５０２およびすでに現用系として動作しているＩ／Ｏデバイス７１０２へのアクセスが可能となり、サービスが復旧する。 (Step S208)
When the above connection is established, the back-end device driver 7504, the front-end device driver 7531, and the process 7532 can access the device driver 7502 and the I / O device 7102 that is already operating as the active system, and the service is Restore.

次に本実施形態による効果について説明する。プロセス７５３２とデバイスドライバ７５０１および７５０２、さらにＩ／Ｏデバイス７１０１および７１０２との接続はフロントエンドデバイスドライバ７５３１とバックエンドデバイスドライバ７５０４の間の接続を介して実現されているため、資源割当手段７５０３による切替により、デバイスドライバおよびＩ／Ｏデバイスでの障害は完全に隠蔽される。これはソフトウェア障害復旧方式の利点である。さらに、障害検出および運用状態切替は高速なハードウェア方式を用い、処理管理手段７５５４によるハードウェア方式とソフトウェア方式の連携同期動作によって、死活確認部以外の特別な隠蔽構造をとるハードウェアなしに高速障害復旧が可能となる。 Next, the effect by this embodiment is demonstrated. Since the connection between the process 7532 and the device drivers 7501 and 7502 and the I / O devices 7101 and 7102 is realized through the connection between the front-end device driver 7531 and the back-end device driver 7504, the resource allocation unit 7503 By switching, faults in the device driver and I / O device are completely hidden. This is an advantage of the software failure recovery method. Further, failure detection and operation state switching use a high-speed hardware method, and by the cooperative synchronization operation of the hardware method and the software method by the processing management unit 7554, high-speed operation can be performed without hardware having a special concealment structure other than the alive confirmation unit. Disaster recovery is possible.

さらに、上記の可用性(availability)を基準（メトリック）とした制御以外に、ハードウェア上で監視可能な項目に対する統計値をメトリックとしたハードウェアとソフトウェアの連携資源割当制御も同様の方法で実現可能である。統計値として、たとえばカウンタ値による帯域測定、試験信号などによる遅延測定、ビットエラー検査による信頼性などがあるが、必ずしもこれらに限るものではない。連携による実現機能として、たとえば帯域測定の場合は、冗長構成を組むペアの間で状況の変化に応じた動的なロードバランスなどが実現可能である。 Furthermore, in addition to the above-mentioned control based on availability (metric), hardware and software linked resource allocation control using the statistical values for items that can be monitored on hardware as a metric can be realized in the same way. It is. Statistical values include, for example, bandwidth measurement using a counter value, delay measurement using a test signal, and reliability using a bit error test, but are not necessarily limited thereto. As an implementation function by cooperation, for example, in the case of bandwidth measurement, it is possible to realize a dynamic load balance according to a change in a situation between pairs forming a redundant configuration.

［第３の実施形態］
次に本発明の第３の実施の形態について図５を参照して詳細に説明する。 [Third Embodiment]
Next, a third embodiment of the present invention will be described in detail with reference to FIG.

本発明の第３の実施の形態は、ハードウェア空間８１００とソフトウェア空間８５００から構成される。 The third embodiment of the present invention includes a hardware space 8100 and a software space 8500.

ハードウェア空間８１００は通信手段８１５０によって相互に接続されたＩ／Ｏデバイス８１０１、Ｉ／Ｏデバイス８１０２、Ｉ／Ｏデバイス８１０３およびＣＰＵ８１０４を少なくともひとつずつ以上有する。I/Oデバイス８１０２および８１０３は死活確認部によって上記の第２の実施の形態と同様にキープアライブ信号を通して互いに死活確認を行っている。Ｉ／Ｏデバイス８１０１は死活確認部を有しない通常のＩ／Ｏデバイスである。Ｉ／Ｏデバイス８１０１、Ｉ／Ｏデバイス８１０２、Ｉ／Ｏデバイス８１０３は物理資源を構成し、Ｉ／Ｏデバイス８１０１は死活確認部を有せず、異なる性能を有する。
ソフトウェア空間８５００は、オペレーティングシステムなどに代表される仮想化手段８５８０および少なくとも二つ以上の仮想装置８５３０および８５４０を有する。仮想化手段８５８０は、Ｉ／Ｏデバイス８１０１〜８１０３それぞれへのアクセス手段であるデバイスドライバ８５０１〜８５０３、資源割当手段８５０６、バックエンドデバイスドライバ８５０４および８５０５、障害管理手段８５５０を有する。さらに障害管理手段８５５０は、管理者８９００にアクセス手段を提供するインターフェース手段８５５１、設定管理手段８５５２、情報格納手段８５５３、処理管理手段８５５４、および物理資源管理手段８５５５を有する。また、仮想装置８５３０および８５４０は、それぞれプロセス８５３２および８５４２と、フロントエンドデバイスドライバ８５３１および８５４１を有する。 The hardware space 8100 includes at least one I / O device 8101, I / O device 8102, I / O device 8103, and CPU 8104 connected to each other by the communication unit 8150. The I / O devices 8102 and 8103 confirm each other's life and death through a keep alive signal by the life and death confirmation unit as in the second embodiment. The I / O device 8101 is a normal I / O device that does not have a life / death confirmation unit. The I / O device 8101, the I / O device 8102, and the I / O device 8103 constitute a physical resource, and the I / O device 8101 does not have a life / death confirmation unit and has different performance.
The software space 8500 includes virtualization means 8580 represented by an operating system and the like, and at least two or more virtual devices 8530 and 8540. The virtualization unit 8580 includes device drivers 8501 to 8503, resource allocation units 8506, back-end device drivers 8504 and 8505, and failure management units 8550 that are access units to the I / O devices 8101 to 8103. The failure management unit 8550 further includes an interface unit 8551 that provides an access unit to the administrator 8900, a setting management unit 8552, an information storage unit 8553, a process management unit 8554, and a physical resource management unit 8555. The virtual devices 8530 and 8540 have processes 8532 and 8542 and front-end device drivers 8531 and 8541, respectively.

また管理者８９００は、障害管理手段８５５０に対して、管理情報として、構成設定情報８７０１、管理ポリシー情報８７０２を入力することができる。 The administrator 8900 can input configuration setting information 8701 and management policy information 8702 as management information to the failure management unit 8550.

次に、これらの手段の動作について説明する。上記の第２の実施の形態と同様に、I/Oデバイス８１０１〜８１０３はそれぞれNIC（ネットワークインターフェースカード）やディスクに代表されるInput/Outputデバイスであり、ＣＰＵ８１０４はＰｅｎｔｉｕｍ（登録商標）やＸｅｏｎに代表される中央演算処理装置であるが、必ずしもこれらに限定するものではない。 Next, the operation of these means will be described. As in the second embodiment, the I / O devices 8101 to 8103 are input / output devices represented by NIC (network interface card) and disk, respectively, and the CPU 8104 is connected to Pentium (registered trademark) or Xeon. Although it is a representative central processing unit, it is not necessarily limited to these.

I/Oデバイス８１０２および８１０３は冗長ペアを構成しており片方が現用系、もう一方が待機系に設定される（ここではI/Oデバイス８１０２が現用系とする）。そして、I/Oデバイス８１０２および８１０３は、第２の実施の形態同様に死活確認部によるキープアライブ信号の一定間隔での交信によって互いに死活確認をしている。Ｉ／Ｏデバイス８１０１はこのような死活確認部を持たない通常のハードウェア資源とする。 The I / O devices 8102 and 8103 form a redundant pair, and one is set as the active system and the other is set as the standby system (here, the I / O device 8102 is set as the active system). Then, the I / O devices 8102 and 8103 confirm each other's life and death by communicating at regular intervals of keep alive signals by the life and death confirmation unit, as in the second embodiment. The I / O device 8101 is a normal hardware resource that does not have such a life / death confirmation unit.

このため、もしI/Oデバイス８１０１に障害が発生した場合は、プロセスやデバイスドライバによるタイムアウトなどの応答異常検出によって障害が検出され、従来のソフトウェア障害復旧方式のみによって障害復旧されるため、障害復旧は一般に時間のかかるものとなる。一方でＩ／Ｏデバイス８１０２および８１０３における障害は死活確認部によって検出されるため、第２の実施の形態に記載と同様の処理により高速検出および運用状態変更が可能である。 For this reason, if a failure occurs in the I / O device 8101, the failure is detected by detecting a response abnormality such as a timeout by a process or device driver, and the failure is recovered only by the conventional software failure recovery method. Is generally time consuming. On the other hand, since the failure in the I / O devices 8102 and 8103 is detected by the life and death confirmation unit, high-speed detection and operation state change can be performed by the same processing as described in the second embodiment.

デバイスドライバ８５０１〜８５０３はそれぞれI/Oデバイス８１０１〜８１０３へのアクセス手段を提供するソフトウェアプログラムであり、デバイス固有に作成される。第２の実施の形態と同様に、仮想装置８５３０および８５４０内のフロントエンドデバイスドライバ８５３１および８５４１は、プロセス８５３２および８５４２に、仮想資源８３０１および８３０２の存在を示し、プロセス８５３２および８５４２が普通の物理資源にアクセスする場合と同様のインターフェースを提供している。フロントエンドデバイスドライバ８５３１および８５４１は、それぞれバックエンドデバイスドライバ８５０４および８５０５に対応付けられており、仮想資源８３０１および８３０２のアクセス要求を転送する。 The device drivers 8501 to 8503 are software programs that provide access means to the I / O devices 8101 to 8103, respectively, and are created unique to the device. Similar to the second embodiment, the front-end device drivers 8531 and 8541 in the virtual devices 8530 and 8540 indicate to the processes 8532 and 8542 the existence of the virtual resources 8301 and 8302, and the processes 8532 and 8542 perform normal physical processing. Provides an interface similar to that used to access resources. The front-end device drivers 8531 and 8541 are associated with the back-end device drivers 8504 and 8505, respectively, and transfer access requests for the virtual resources 8301 and 8302.

バックエンドデバイスドライバ８５０４および８５０５は、資源割当手段８５０６を通して、デバイスドライバ８５０１〜８５０３と接続制御され、ソフトウェア方式による障害隠蔽構造を形成しており、冗長構成をとっている場合はその構成に指定の現用系デバイスドライバに接続を許可する。 The back-end device drivers 8504 and 8505 are connected and controlled with the device drivers 8501 to 8503 through the resource allocation unit 8506 to form a fault concealment structure by a software method. If a redundant configuration is adopted, the configuration is designated as the configuration. Allow the active device driver to connect.

障害管理手段８５５０は第２の実施の形態に加えて、設定管理手段８５５２と物理資源管理手段８５５５を有する。 The fault management unit 8550 includes a setting management unit 8552 and a physical resource management unit 8555 in addition to the second embodiment.

物理資源管理手段８５５５は、現在システム内にある物理資源の情報を管理し、また必要に応じて物理資源に設定を行う。物理資源の情報はたとえば物理資源ＩＤ（物理資源識別情報）や種類、性能に加え、上記の死活確認部の有無といった障害復旧性能に関する情報を有する。たとえば、Ｉ／Ｏデバイス８１０１は通常デバイスであり、Ｉ／Ｏデバイス８１０２および８１０３は死活確認部を有し、ハードウェア方式による冗長構成を組むことが可能である、などの情報である。さらに、Ｉ／Ｏデバイス８１０２と８１０３で冗長ペアを組むことを決めた時、両者の間でキープアライブ信号に代表される死活確認信号の交信の設定を行い、運用系および待機系の運用状態設定を行う。 The physical resource management unit 8555 manages information on physical resources currently in the system, and sets the physical resources as necessary. The physical resource information includes, for example, information related to failure recovery performance such as the presence / absence of the alive confirmation unit in addition to the physical resource ID (physical resource identification information), type, and performance. For example, the information is that the I / O device 8101 is a normal device, the I / O devices 8102 and 8103 have a life / death confirmation unit, and a redundant configuration by a hardware method can be built. Furthermore, when it is decided to form a redundant pair with the I / O devices 8102 and 8103, the life and death confirmation signal communication represented by the keep alive signal is set between the two, and the operation state setting of the active system and the standby system is performed. I do.

設定管理手段８５５２はシステム内の仮想装置に割り当てられた仮想資源構成、プライオリティ、冗長構成や障害復旧速度などの可用性レベルといった仮想装置毎に定められた管理情報を有する。仮想装置の資源割当構成は、仮想装置を生成時に取得・保存し、可用性レベルやプライオリティの情報は、それぞれ構成設定情報８７０１および管理ポリシー８７０２によって定められる。構成設定情報８７０１および管理ポリシー８７０２はインターフェース手段８５５１を通して管理者８９００から入力される。 The setting management unit 8552 has management information determined for each virtual device such as a virtual resource configuration assigned to a virtual device in the system, a priority, an availability level such as a redundant configuration and a failure recovery speed. The resource allocation configuration of the virtual device is acquired and stored when the virtual device is generated, and the availability level and priority information are determined by the configuration setting information 8701 and the management policy 8702, respectively. The configuration setting information 8701 and the management policy 8702 are input from the administrator 8900 through the interface unit 8551.

構成設定情報８７０１はシステム内の仮想装置の有する仮想資源の冗長構成や障害復旧速度などの可用性レベル情報を有する。管理ポリシー情報８７０２はシステム内の仮想装置の優先度を記載したプライオリティ情報を有する。 The configuration setting information 8701 includes availability level information such as a redundant configuration of a virtual resource possessed by a virtual device in the system and a failure recovery speed. The management policy information 8702 has priority information describing the priority of the virtual device in the system.

設定管理手段８５５２は構成設定情報８７０１および管理ポリシー情報８７０２と、仮想装置に割り当てられている仮想資源構成情報と、物理資源管理手段の有する物理資源情報から、可能な資源割当方法の組み合わせを計算し、最適構成を探索し、その設定反映命令を資源割当手段８５０６に発行し、その設定情報を保持する。また、同時に、決定した冗長構成情報から、各種割り込み信号ＩＤに対して障害発生時の処理を情報格納手段８５５３のテーブルに格納する。 The setting management unit 8552 calculates a combination of possible resource allocation methods from the configuration setting information 8701 and the management policy information 8702, the virtual resource configuration information allocated to the virtual device, and the physical resource information of the physical resource management unit. The optimum configuration is searched, the setting reflection command is issued to the resource allocation unit 8506, and the setting information is held. At the same time, from the determined redundant configuration information, the processing at the time of occurrence of failure for each interrupt signal ID is stored in the table of the information storage unit 8553.

以下、構成設定情報８７０１および管理ポリシー情報８７０２の入力から、冗長構成の決定および設定処理について図６に示すフローチャートを用いて詳細に説明する。 The redundant configuration determination and setting processing from the input of the configuration setting information 8701 and the management policy information 8702 will be described in detail below with reference to the flowchart shown in FIG.

（ステップＳ３０１）
構成設定情報８７０１および管理ポリシー８７０２を入力する。 (Step S301)
Input configuration setting information 8701 and management policy 8702.

（ステップＳ３０２）
設定管理手段８５５２は物理資源管理手段８５５５から物理資源情報を取得する。また、現在の仮想装置の仮想資源に対して、入力された構成設定情報８７０１に記載の可用性レベルを達成する物理資源割当構成を計算し、上記の取得した物理資源情報と比較する。 (Step S302)
The setting management unit 8552 acquires physical resource information from the physical resource management unit 8555. In addition, a physical resource allocation configuration that achieves the availability level described in the input configuration setting information 8701 is calculated for the virtual resource of the current virtual device, and compared with the acquired physical resource information.

（ステップＳ３０３）
要求の可用性レベルを満たす物理資源割当が可能であれば（ステップＳ３０４）に進む。不可能なら、エラーメッセージを、インターフェース手段８５５１を通して管理者８９００に出力して終了する。 (Step S303)
If physical resource allocation that satisfies the requested availability level is possible (step S304), the process proceeds. If not possible, an error message is output to the administrator 8900 through the interface means 8551 and the process is terminated.

（ステップＳ３０４）
管理ポリシー情報８７０２から仮想装置のプライオリティ情報を取得し、要求の可用性レベルを満たす物理資源割当組み合わせのうち、プライオリティに応じて物理資源割当組み合わせをソートし、一番可用性レベルが高い仮想装置の物理資源割当方法を決定する。 (Step S304)
The virtual device priority information is acquired from the management policy information 8702, and among the physical resource allocation combinations satisfying the requested availability level, the physical resource allocation combinations are sorted according to the priority, and the physical resource of the virtual device having the highest availability level Determine the allocation method.

（ステップＳ３０５）
決定した組み合わせを設定反映する命令を資源割当手段８５０６に発行し、資源割当を決定し、その設定を保存する。 (Step S305)
A command to reflect the determined combination for setting is issued to the resource allocation unit 8506, the resource allocation is determined, and the setting is stored.

（ステップＳ３０６）
設定した資源割当構成に対して、各物理資源での障害に対する処理を作成し、情報格納手段８５５３のテーブルに登録し、処理管理手段８５５４が障害発生通知の割り込み信号取得時に処理を検索できるようにする。 (Step S306)
For the set resource allocation configuration, a process for a failure in each physical resource is created and registered in the table of the information storage unit 8553 so that the process management unit 8554 can search for the process when acquiring the interrupt signal of the failure occurrence notification. To do.

たとえば、図５に記載の例で、仮想装置８５３０より仮想装置８５４０の方がプライオリティが高いとし、両者の仮想デバイスとも１＋１の冗長構成（ただし待機系は共有可）であったとする。Ｉ／Ｏデバイス８１０１で障害が発生した場合は、ハードウェアによる死活確認部がなく、一般に障害復旧処理はタイムアウトなどのソフトウェア処理になるため、処理時間が長くなるため可用性レベルは低い。一方Ｉ／Ｏデバイス８１０２で障害が発生した場合は、第２の実施の形態に記載のように、障害復旧が高速で行えるため、可用性レベルは高い。 For example, in the example shown in FIG. 5, it is assumed that the virtual device 8540 has a higher priority than the virtual device 8530, and both virtual devices have a 1 + 1 redundant configuration (however, the standby system can be shared). When a failure occurs in the I / O device 8101, there is no hardware alive confirmation unit, and generally the failure recovery processing is software processing such as timeout, so that the processing time is long and the availability level is low. On the other hand, when a failure occurs in the I / O device 8102, the failure level can be recovered at a high speed as described in the second embodiment, so the availability level is high.

そこで、仮想装置８５４０の仮想デバイス８３０２の現用系にはＩ／Ｏデバイス８１０２が、待機系にはＩ／Ｏデバイス８１０３が設定され、仮想装置８５３０の仮想デバイス８３０１の現用系にはＩ／Ｏデバイス８１０１が、待機系にはＩ／Ｏデバイス８１０３が設定されることになる。Ｉ／Ｏデバイス８１０３を現用系にし、Ｉ／Ｏデバイス８１０２を待機系にする組み合わせも同様の可用性レベルで実現可能であるが両者は区別がないため、どちらか一方を選ぶこととする。 Therefore, the I / O device 8102 is set for the active system of the virtual device 8302 of the virtual apparatus 8540, the I / O device 8103 is set for the standby system, and the I / O device is set for the active system of the virtual device 8301 of the virtual apparatus 8530. 8101 and the I / O device 8103 is set in the standby system. A combination in which the I / O device 8103 is used as the active system and the I / O device 8102 is used as the standby system can be realized with the same availability level.

次に本実施形態による効果について説明する。障害復旧性能、ひいては可用性レベルの異なる物理資源を複数有するハードウェア空間（物理資源空間）をシステム内に有するシステムにおいて、本実施形態によれば、第２の実施の形態による効果は引き継いだまま仮想装置の優先度に応じた最適障害復旧構成を自動的に選ぶことが可能になる。
そして、キープアライブ方式などの高速障害復旧機能を持ったハードウェアとそうでない通常ハードウェアが混在するような多様な障害回復性能が存在する複雑な管理環境で、管理ポリシーを満足するよう自立的に最適な冗長化設定を行う管理柔軟性を提供することができる。その理由は、設定情報、管理ポリシーや資源情報などを管理しながら、状態に合わせた最適な設定を自動的に選択し、構成設定・障害復旧を行うためである。 Next, the effect by this embodiment is demonstrated. In a system having a hardware space (physical resource space) having a plurality of physical resources having different failure recovery performance and availability levels in the system, according to the present embodiment, the effect of the second embodiment is maintained while taking the virtual effect. It becomes possible to automatically select the optimum failure recovery configuration according to the priority of the device.
And in a complex management environment with various failure recovery performances such as hardware that has a high-speed failure recovery function such as keep alive method and other hardware that is not so, it is autonomous to satisfy the management policy. Management flexibility for optimal redundancy setting can be provided. The reason is that, while managing the setting information, management policy, resource information, etc., the optimum setting according to the state is automatically selected, and the configuration setting / failure recovery is performed.

［第４の実施形態］
次に本発明の第４の実施の形態について図７を参照して詳細に説明する。
本発明の第４の実施の形態は、第２および第３の実施の形態のハードウェア空間の生成の方法およびその方法とソフトウェア空間の連携に関する。 [Fourth Embodiment]
Next, a fourth embodiment of the present invention will be described in detail with reference to FIG.
The fourth embodiment of the present invention relates to a method for generating a hardware space according to the second and third embodiments, and cooperation between the method and the software space.

図７を参照すると、第４の実施の形態は、物理資源を収容するシャーシ９１０１および９２０１と、スイッチ９４００と、ハードウェア空間９０００と、ソフトウェア空間９５００から構成される。ハードウェア空間９０００とソフトウェア空間９５００は例えばパーソナルコンピュータで構成され、ソフトウェア空間９５００はＤＲＡＭ等の半導体メモリやハードディスク装置に記憶されたプログラムやデータで構成され、ハードウェア空間６１００のCPUなどに代表される中央演算部により処理が実行される。 Referring to FIG. 7, the fourth embodiment includes chassis 9101 and 9201 that accommodate physical resources, a switch 9400, a hardware space 9000, and a software space 9500. The hardware space 9000 and the software space 9500 are composed of, for example, a personal computer. The software space 9500 is composed of a semiconductor memory such as a DRAM or a program or data stored in a hard disk device, and is represented by a CPU of the hardware space 6100. Processing is executed by the central processing unit.

シャーシ９１０１はＩ／Ｏデバイス９１１１、Ｉ／Ｏデバイス９１１２、ＣＰＵ９１１３および各デバイスに電力を供給する電源９１２１を収容する。また、シャーシ９２０１はＩ／Ｏデバイス９２１１、Ｉ／Ｏデバイス９２１２、ＣＰＵ９２１３およびを各デバイスに電力を供給する電源９２２１を収容する。スイッチ９４００はシャーシ９１０１および９２０１及びハードウェア空間９０００の各デバイスを相互に接続する。ハードウェア空間９０００は、スイッチ９４００によりパーティションに論理的に分割されグループ化された資源であるＩ／Ｏデバイス９００１、Ｉ／Ｏデバイス９００２、Ｉ／Ｏデバイス９００３、ＣＰＵ９００４を有する。ソフトウェア空間９５００は、物理資源管理手段９５０１および少なくとも一つ以上の仮想装置９５０２を有する。 The chassis 9101 houses an I / O device 9111, an I / O device 9112, a CPU 9113, and a power source 9121 that supplies power to each device. The chassis 9201 houses an I / O device 9211, an I / O device 9212, a CPU 9213, and a power source 9221 that supplies power to each device. The switch 9400 connects the devices in the chassis 9101 and 9201 and the hardware space 9000 to each other. The hardware space 9000 includes an I / O device 9001, an I / O device 9002, an I / O device 9003, and a CPU 9004 that are resources logically divided into partitions by the switch 9400 and grouped. The software space 9500 includes physical resource management means 9501 and at least one virtual device 9502.

ここで、シャーシの数や各シャーシに収容されるＩ／Ｏデバイスなどの物理資源の数や種類は図７の構成に限るものではない。 Here, the number of chassis and the number and types of physical resources such as I / O devices accommodated in each chassis are not limited to the configuration shown in FIG.

また、スイッチによってグループ化されるように選択された物理資源も一例であり、この構成に限るものではない。 The physical resources selected to be grouped by the switch are also an example, and the present invention is not limited to this configuration.

ここでいうグループ化とは、グループ化されたハードウェア空間９０００内の物理資源が、スイッチの論理分割機能、たとえばＥｔｈｅｒｎｅｔ（登録商標）スイッチのＶＬＡＮ機能に代表される機能によって、お互いに通信することが可能であり、かつ、異なるグループとは基本的に接続が分離される処理のことである。 Grouping here means that the physical resources in the grouped hardware space 9000 communicate with each other by the logical division function of the switch, for example, the function represented by the VLAN function of the Ethernet (registered trademark) switch. In addition, different groups are processes in which connections are basically separated.

またスイッチ９４００はＥｔｈｅｒｎｅｔ（登録商標）スイッチなどに代表されるネットワーク装置であるが、プロトコルやその物理構成などはそれに限定するものではない。 The switch 9400 is a network device typified by an Ethernet (registered trademark) switch, but the protocol and the physical configuration thereof are not limited thereto.

ソフトウェア空間９５００は第３の実施例に記述のソフトウェア空間に代表される、仮想化によるソフトウェア方式による障害隠蔽手段と障害管理手段を有する。 The software space 9500 includes failure concealment means and failure management means by a software method based on virtualization, represented by the software space described in the third embodiment.

Ｉ／Ｏデバイスに代表される物理資源は死活確認部などのハードウェア方式による障害復旧機能を有していてもよいし、有していなくてもよい。 A physical resource represented by an I / O device may or may not have a failure recovery function by a hardware method such as a life / death confirmation unit.

物理資源管理手段９５０２はハードウェア空間９０００に属する物理資源の性能、可用性、物理位置、グループ化構成などの情報を管理する。
次に、これらの手段の動作について説明する。図７に示すように、ソフトウェア空間９５００に提供されるハードウェア空間９０００に属する物理資源は、設定の違いによりシャーシ９１０１およびシャーシ９１０２のどちらかからもしくは両方から選択されグループ化される可能性がある。 The physical resource management unit 9502 manages information such as the performance, availability, physical location, and grouping configuration of physical resources belonging to the hardware space 9000.
Next, the operation of these means will be described. As shown in FIG. 7, physical resources belonging to the hardware space 9000 provided in the software space 9500 may be selected and grouped from either or both of the chassis 9101 and the chassis 9102 depending on the setting. .

図７に示す構成はＣＰＵ９００４とＩ／Ｏデバイス９００１とＩ／Ｏデバイス９００２はそれぞれシャーシ９００１に属するＣＰＵ９１１３とＩ／Ｏデバイス９１１１とＩ／Ｏデバイス９１１２と接続され、１つのＣＰＵ、それぞれ独立した２つのＩ／Ｏデバイスとして機能する。また、Ｉ／Ｏデバイス９００３はシャーシ９２０１に属するＩ／Ｏデバイス９２１１と接続され、１つのＩ／Ｏデバイスとして機能する。ここで、第３の実施の形態と同様にソフトウェア空間９５００に属する２つのプライオリティが異なる仮想装置の仮想資源の冗長構成としてＩ／Ｏデバイスの１＋１構成（現用系１つ、待機系に１つのＩ／Ｏデバイス）を設定するとする。
ここで、すべてのＩ／Ｏデバイス、シャーシ、電源の性能、可用性などの特性は等しいとし、等価な効果を生む選択肢は省略すると、Ｉ／Ｏデバイス９００１とＩ／Ｏデバイス９００２は完全に等価であり、両者を交換しただけの構成は省略する。すると現用系としてシャーシ９１０１のＩ／Ｏデバイスにするか、シャーシ９２０１のＩ／Ｏデバイスにするかの選択があるが、完全に等価であることを仮定しているため、可用性という観点ではどちらか一方を考えれば十分である。 In the configuration illustrated in FIG. 7, the CPU 9004, the I / O device 9001, and the I / O device 9002 are connected to the CPU 9113, the I / O device 9111, and the I / O device 9112 that belong to the chassis 9001, respectively. Functions as one I / O device. The I / O device 9003 is connected to the I / O device 9211 belonging to the chassis 9201 and functions as one I / O device. Here, as in the third embodiment, as a redundant configuration of virtual resources of two virtual devices having different priorities belonging to the software space 9500, a 1 + 1 configuration of I / O devices (one active system and one I / O in the standby system). / O device) is set.
Here, if the characteristics such as performance and availability of all I / O devices, chassis, and power supplies are equal, and options that produce an equivalent effect are omitted, the I / O device 9001 and the I / O device 9002 are completely equivalent. There is no need to replace the two. Then, as an active system, there is a choice between an I / O device of the chassis 9101 or an I / O device of the chassis 9201, but since it is assumed that they are completely equivalent, it is either from the viewpoint of availability. It is enough to consider one.

そこで図７に示すハードウェア空間９０００では１＋１冗長に対して、
（選択肢１）現用系：Ｉ／Ｏデバイス９００１、待機系：Ｉ／Ｏデバイス９００２
（選択肢２）現用系：Ｉ／Ｏデバイス９００１、待機系：Ｉ／Ｏデバイス９００３
の２通りが考えられる。これは同一シャーシ内での冗長構成か、シャーシをまたいだ冗長構成かの違いである。 Therefore, in the hardware space 9000 shown in FIG.
(Option 1) Active system: I / O device 9001, standby system: I / O device 9002
(Option 2) Active system: I / O device 9001, standby system: I / O device 9003
There are two possible ways. This is the difference between a redundant configuration within the same chassis or a redundant configuration across chassis.

電源故障という観点からは（選択肢１）は同じ電源で駆動される同一のシャーシ内のＩ／Ｏデバイス９１１１とＩ／Ｏデバイス９１１２に接続されているので、故障リスクを共有しており、（選択肢２）のほうが異なる電源で駆動されるため可用性が高い。そのため、第３の実施の形態で記載のように、管理者によって指定された仮想装置の仮想資源の冗長構成およびプライオリティに対して資源割当制御を決定する際に、物理資源管理手段９５０１より物理資源情報を参照し、上記のようなグループ化の違いによる可用性の違いを考慮し、よりプライオリティの高い仮想装置により可用性の高い構成を割り当てる。 From the viewpoint of power failure, (Option 1) is connected to the I / O device 9111 and I / O device 9112 in the same chassis driven by the same power source, and therefore shares the risk of failure (option Since 2) is driven by a different power source, the availability is high. Therefore, as described in the third embodiment, when the resource allocation control is determined for the redundant configuration and priority of the virtual resource of the virtual device designated by the administrator, the physical resource management unit 9501 determines the physical resource. By referring to the information and considering the difference in availability due to the difference in grouping as described above, a configuration with higher availability is assigned to a virtual device with higher priority.

なお、本例では可用性を例とり、たとえば物理的な位置やネットワーク性能などによるデータ転送速度の違いなどに代表される性能、その他の制限事項を考慮からはずしたが、可用性に加え、管理ポリシーおよび構成設定情報に考慮したい制限事項を加え、最適設定選択時にその制限事項を含めることでその他の設定も可能であり、これに限るものではない。 In this example, availability is taken as an example. For example, performance represented by differences in data transfer speed due to physical location and network performance, and other restrictions have been removed from consideration, but in addition to availability, management policy and By adding restrictions to be considered in the configuration setting information and including the restrictions when selecting the optimum setting, other settings are possible, and the present invention is not limited to this.

次に第４の実施の形態による効果について説明する。第４の実施の形態によれば、第２、第３の実施の形態の効果を引き継いだまま、物理的な位置情報や共有リスク情報などに代表される物理資源そのものの性能、および死活確認部の有無などに代表される可用性情報の他の情報を考慮したうえで、管理ポリシーや構成設定情報などの管理者の設定意思にもっとも沿う構成を自動的に選択することが可能である。 Next, effects of the fourth embodiment will be described. According to the fourth embodiment, the performance of the physical resource represented by physical location information, shared risk information, etc., and the life / death confirmation unit, while taking over the effects of the second and third embodiments. It is possible to automatically select a configuration that best suits the administrator's intention to set, such as a management policy and configuration setting information, in consideration of other information such as availability information typified by the presence or absence.

そして、物理資源の障害リスクが一様でない場合にも、最適な構成をとることができる。その理由は、最適構成検索過程において、各物理資源の障害リスクを考慮して最適化できるためである。
以上、本発明の代表的な実施形態について説明したが、本実施形態は種々の変形が可能であり、本願の請求の範囲によって定義される本発明の精神及び範囲から逸脱しないかぎり、置換、変更が可能である。 Even when the physical resource failure risk is not uniform, an optimal configuration can be adopted. This is because the optimization can be optimized in consideration of the failure risk of each physical resource in the optimum configuration search process.
While typical embodiments of the present invention have been described above, the present embodiments can be variously modified and replaced without departing from the spirit and scope of the present invention defined by the claims of the present application. Is possible.

本発明は、複数の物理資源と、これら複数の物理資源を共有するためのソフトウェアを搭載するコンピュータ構成部とを備えたシステム、例えば、CPUおよびＩ／Ｏデバイスなどを物理資源として利用するITシステムやCPUとラインカードなどを物理資源として利用するネットワーク（NW）システムに用いることができる。 The present invention relates to a system including a plurality of physical resources and a computer configuration unit having software for sharing the plurality of physical resources, for example, an IT system using a CPU and an I / O device as physical resources. And network (NW) systems that use CPU and line cards as physical resources.

本発明の最良の形態を示す図である。It is a figure which shows the best form of this invention. 本発明の最良の形態における動作フローの図である。It is a figure of the operation | movement flow in the best form of this invention. 障害発生時に、ハードウェア方式とソフトウェア方式が連携し、高速障害回復を行う図である。When a failure occurs, the hardware method and the software method cooperate to perform high-speed failure recovery. 障害発生時に、ハードウェア方式とソフトウェア方式が連携し、高速障害回復を行う際の動作フロー図である。FIG. 10 is an operation flowchart when a hardware method and a software method cooperate to perform high-speed failure recovery when a failure occurs. 複数のプライオリティの異なる仮想装置に対して、最適な資源を選択して冗長構成を設定する図である。FIG. 10 is a diagram for selecting a suitable resource and setting a redundant configuration for a plurality of virtual devices having different priorities. 複数のプライオリティの異なる仮想装置に対して、最適な資源を選択して冗長構成を設定する際の動作フロー図である。It is an operation | movement flowchart at the time of selecting an optimal resource and setting a redundant structure with respect to the several virtual apparatus from which a priority differs. 物理資源の障害リスクが一様でない場合に、それを設定に反映させる本発明の実施例の構成図である。It is a block diagram of the Example of this invention which is reflected in a setting when the failure risk of a physical resource is not uniform. VMMによる仮想化の図である。It is a figure of virtualization by VMM. XENにおける仮想化アーキテクチャの図である。It is a figure of the virtualization architecture in XEN. ソフトウェア方式による障害隠蔽方式の図である。It is a figure of the fault concealment system by a software system. RAIDによるハードウェアを用いた障害隠蔽方式の図である。It is a figure of the fault concealment method using the hardware by RAID. 死活確認装置による障害時の高速切替の図である。It is a figure of the high-speed switching at the time of the failure by the life and death confirmation apparatus.

Explanation of symbols

６１００ハードウェア空間
６１０１，６１０２物理資源
６１１１，６１１２死活確認部
６１１３死活確認信号
６１２１中央演算部
６１３１データ転送路（バス）
６５００ソフトウェア空間
６５１０仮想装置
６５１１資源アクセス手段
６５２０仮想資源空間
６５２１仮想資源
６５５０仮想化手段
６５５１資源割当手段
６５５２障害管理手段
６５５３，６５５４資源アクセス手段
７１００ハードウェア空間
７５００ソフトウェア空間
７１０１，７１０２Ｉ／Ｏデバイス
７１０３ＣＰＵ
７１１１，７１１２死活確認部
７１５０データ転送路
７２０１キープアライブ信号
７２０２障害通知割り込み信号
７２０３割り込み信号
７３０１仮想資源
７５０１，７５０２デバイスドライバ
７５０３資源割当制御手段
７５０４バックエンドデバイスドライバ
７５３０仮想装置
７５３１フロントエンドデバイスドライバ
７５３２プロセス
７５５０障害管理手段
７５５１インターフェース手段
７５５３情報格納手段
７５５４処理管理手段
７５８０仮想化手段
７９００管理者
８１００ハードウェア空間
８１５０通信手段
８１０１，８１０２，８１０３Ｉ／Ｏデバイス
８１０４ＣＰＵ
８５００ソフトウェア空間
８５０１，８５０２，８５０３デバイスドライバ
８５０４，８５０５バックエンドデバイスドライバ
８５０６資源割当手段
８５３１，８５４１フロントエンドデバイスドライバ
８５３２，８５４２プロセス
８５８０仮想化手段
８５３０，８５４０仮想装置
８５５０障害管理手段
８５５１インターフェース手段
８５５２設定管理手段
８５５３情報格納手段
８５５４処理管理手段
８５５５物理資源管理手段
８７０１構成設定情報
８７０２管理ポリシー情報
８３０１，８３０２仮想資源
８９００管理者
９０００ハードウェア空間
９００１，９００２，９００３Ｉ／Ｏデバイス
９００４ＣＰＵ
９１０１，９２０１シャーシ
９１１１，９１１２Ｉ／Ｏデバイス
９１１３ＣＰＵ
９２１１，９２１２Ｉ／Ｏデバイス
９２１３ＣＰＵ
９１２１，９２２１電源
９４００スイッチ
９３０１冗長ペア
９３０２冗長ペア
９５００ソフトウェア空間
９５０１物理資源管理手段
９５０２仮想装置 6100 Hardware space 6101, 6102 Physical resource 6111, 6112 Alive check unit 6113 Alive check signal 6121 Central processing unit 6131 Data transfer path (bus)
6500 Software space 6510 Virtual device 6511 Resource access means 6520 Virtual resource space 6521 Virtual resource 6550 Virtualization means 6551 Resource assignment means 6552 Failure management means 6553, 6554 Resource access means 7100 Hardware space 7500 Software space 7101, 7102 I / O device 7103 CPU
7111, 7112 Life confirmation unit 7150 Data transfer path 7201 Keep-alive signal 7202 Failure notification interrupt signal 7203 Interrupt signal 7301 Virtual resource 7501, 7502 Device driver 7503 Resource allocation control means 7504 Back-end device driver 7530 Virtual device 7531 Front-end device driver 7532 Process 7550 Fault management means 7551 Interface means 7553 Information storage means 7554 Processing management means 7580 Virtualization means 7900 Administrator 8100 Hardware space 8150 Communication means 8101, 8102, 8103 I / O device 8104 CPU
8500 Software space 8501, 8502, 8503 Device driver 8504, 8505 Back-end device driver 8506 Resource allocation means 8531, 8541 Front-end device driver 8532, 8542 Process 8580 Virtualization means 8530, 8540 Virtual device 8550 Fault management means 8551 Interface means 8552 Setting Management unit 8553 Information storage unit 8554 Processing management unit 8555 Physical resource management unit 8701 Configuration setting information 8702 Management policy information 8301, 8302 Virtual resource 8900 Administrator 9000 Hardware space 9001, 9002, 9003 I / O device 9004 CPU
9101, 9201 Chassis 9111, 9112 I / O device 9113 CPU
9211, 9212 I / O device 9213 CPU
9121, 9221 Power supply 9400 Switch 9301 Redundant pair 9302 Redundant pair 9500 Software space 9501 Physical resource management means 9502 Virtual device

Claims

Multiple physical resources,
A virtual machine in which at least one software program operates, and a computer component that includes software functioning as a virtualization unit that enables the virtual apparatus to share the plurality of physical resources, and
The virtualization means includes a resource allocation means for allocating the plurality of physical resources to the virtual device, and one or more of the plurality of physical resources have failed, and the hardware is allocated to another physical resource. Physical resource control, comprising: fault management means for controlling switching to the other physical resource by software in the resource allocation means in cooperation with the switching control when switching control is executed Management system.

In the physical resource control management system according to claim 1, two or more or all of the plurality of physical resources have a life and death confirmation unit,
The physical resource having the alive confirmation unit performs a state change by a preset operation when a failure of another physical resource is detected by the alive confirmation unit, and notifies the failure management means of the physical resource Resource control management system.

2. The physical resource control management system according to claim 1, wherein the failure management means includes an information storage means for storing information on an operation to be performed when a failure occurs, and an opportunity signal for switching to another physical resource. A physical resource control management system comprising processing management means for controlling the resource allocation means on the basis of information in the information storage means and performing failure recovery processing.

4. The physical resource control management system according to claim 3, wherein the information storage means includes a list table that associates an operation to be performed when a failure occurs with identification information determined from the trigger signal. Resource control management system.

In the physical resource control management system according to claim 3, two or more or all of the plurality of physical resources have a life and death confirmation unit,
The physical resource having the life and death confirmation unit performs a state change by a preset operation when a failure of another physical resource is detected by the life and death confirmation unit, and the trigger signal is notified by the physical resource that has performed the state change. A physical resource control management system, characterized in that the signal is a generated signal.

6. The physical resource control management system according to claim 1, wherein at least one physical resource of the plurality of physical resources is more reliable, bandwidth, and delay of physical resources than other physical resources. A physical resource control management system characterized in that any one of the performance, physical resource type, existence / non-existence confirmation function, and failure risk is different.

2. The physical resource control management system according to claim 1, wherein the failure management means includes performance including physical resource reliability, bandwidth, and delay, physical resource identification information, physical resource type, and alive confirmation in the system. A physical resource control management system comprising physical resource management means for managing physical resource information of at least one of presence / absence of function and failure risk.

8. The physical resource control management system according to claim 7, wherein the failure management unit includes physical resource information acquired from the physical resource management unit, and a redundant configuration of resources set for each virtual device input from an administrator. A setting for performing physical resource allocation calculation for the resource of the virtual device and performing setting control of the resource allocation means from configuration setting information including at least one information regarding availability level and a management policy regarding priority information for the virtual device A physical resource control management system comprising management means.

9. The physical resource control management system according to claim 8, wherein the physical resource management means has a failure probability as failure risk information, and the setting management means takes into account the failure probability and A physical resource control management system characterized by performing setting control.

2. The physical resource control management system according to claim 1, wherein the failure management unit performs setting control of the resource allocation unit based on a statistical value measured by a hardware monitoring unit mounted on the physical resource. A physical resource control management system.

11. The physical resource control management system according to claim 10, wherein the statistical value is at least one dynamic performance measurement value of bandwidth, delay, and bit error rate.

8. The physical resource control management system according to claim 7, wherein each of the plurality of physical resources is connected to a physical resource grouped through a network, and the physical resource management means manages a grouped configuration.

A computer component having software that functions as a plurality of physical resources, a virtual device in which at least one software program operates, and a virtualization unit that enables the virtual devices to share the plurality of physical resources; With
In the physical resource control management method of the physical resource control management system, the virtualization means includes resource allocation means for allocating the plurality of physical resources to the virtual device.
A step of switching to another physical resource by hardware when a failure occurs in one or more of the plurality of physical resources;
A physical resource control management method comprising: switching to the other physical resource by software in the resource allocation means in cooperation with the switching control.

A physical resource control management program for causing a computer to function as a virtual device in which at least one software program operates, and a virtualization unit that enables the virtual device to share a plurality of physical resources,
The virtualization means includes a resource allocation means for allocating the plurality of physical resources to the virtual device, and one or more of the plurality of physical resources have failed, and the hardware is allocated to another physical resource. A physical resource control that functions as a failure management unit that controls switching to the other physical resource by software in the resource allocation unit in cooperation with the switching control when the switching control is executed Administrative program.