WO2019138999A1

WO2019138999A1 - Display controller, storage device, storage device recovery method, and non-temporary storage medium storing recovery program for display controller

Info

Publication number: WO2019138999A1
Application number: PCT/JP2019/000206
Authority: WO
Inventors: 小林　健介
Original assignee: NEC Platforms Ltd
Current assignee: NEC Platforms Ltd
Priority date: 2018-01-10
Filing date: 2019-01-08
Publication date: 2019-07-18
Anticipated expiration: 2020-07-10
Also published as: JP2019121279A; JP6734305B2

Abstract

[Problem] To shorten the time required to recover redundancy in a RAID after the occurrence of a storage device malfunction, when a plurality of storage devices constitute a plurality of types of RAID. [Solution] The present invention comprises: a partial mirroring means that mirrors a prescribed block in each storage device constituting a plurality of types of RAIDs, mirroring same in a spare storage device; and a reconstruction means that, when any of the storage devices has malfunctioned, reconstructs in the spare storage device a block that the spare storage device does not have in the storage device that had the malfunction and recovers redundancy by replacing the malfunctioning storage device with the spare storage device.

Description

Disk array controller, storage device, recovery method for storage device, and non-temporary storage medium storing recovery program for disk array controller

　本発明は、ＲＡＩＤ（Redundant Arrays of Inexpensive Disks）を構成する記憶装置の故障時にＲＡＩＤにおける冗長性を復旧する技術に関する。 The present invention relates to a technology for recovering redundancy in RAID when a storage device constituting a RAID (Redundant Arrays of Inexpensive Disks) fails.

　ＲＡＩＤ１、ＲＡＩＤ５、ＲＡＩＤ６等の冗長性を有する論理ディスク（ＲＡＩＤ）を構成する一部のディスク（物理ディスク）が故障すると、ＲＡＩＤが縮退（冗長性を喪失又は冗長性が低下）する。この際、予め実装されていたホットスペアディスクにおいて、故障したディスクが保持していたデータと同じデータを再構成（リビルド）し、ＲＡＩＤを縮退していない状態に復旧することがある。 When a part of disks (physical disks) constituting a logical disk (RAID) having redundancy such as RAID1, RAID5, RAID6 fail, RAID is degenerated (loss of redundancy or redundancy is reduced). At this time, in the hot spare disk mounted in advance, the same data as the data held by the failed disk may be reconfigured (rebuilt), and the RAID may be restored to a non-degenerate state.

　近年、ディスク容量の増大に伴い、リビルドに要する時間が増大している。そのため、ＲＡＩＤが縮退状態にある時間や、リビルド中の性能低下時間の長時間化が問題になっている。 In recent years, with the increase in disk capacity, the time required for rebuilding has increased. Therefore, the time during which RAID is in a degeneracy state, and the increase in performance reduction time during rebuilding have become problems.

　リビルドに要する時間を短縮する技術の一例が特許文献１に開示されている。特許文献１のストレージシステムは、ストレージ制御装置と、１つのＲＬＵ（RAID Logical Unit）を構成する複数台の記憶装置と、予備用の記憶装置とを含む。ストレージ制御装置は、リビルド制御部と、アクセス処理部とを含む。リビルド制御部は、リビルド処理を部分処理に分割して、部分処理の実行をアクセス処理部に指示する。リビルド処理は、当該ＲＬＵを構成するある記憶装置に記録されていたデータと同一のデータを、当該ＲＬＵを構成する残りの記憶装置から読み出したデータを基に生成して、予備用の記憶装置に書き込む処理である。部分処理は、データの読み出し対象範囲を一定サイズごとに分割した分割範囲からデータを読み出す処理と、この分割範囲から読み出したデータに基づいて他の記憶装置にデータを書き込む処理との組み合わせを含む。アクセス制御部は、リビルド制御部から受け付けた複数の部分処理Ｐ１、Ｐ２の実行指示に応じて、指示された部分処理Ｐ１、Ｐ２を並列に実行する。上記構成の結果、特許文献１のストレージシステムは、リビルド処理を高速化する。 Patent Document 1 discloses an example of a technology for reducing the time required for rebuilding. The storage system of Patent Document 1 includes a storage control device, a plurality of storage devices constituting one RLU (RAID Logical Unit), and a spare storage device. The storage control device includes a rebuild control unit and an access processing unit. The rebuild control unit divides the rebuilding process into partial processes, and instructs the access processing unit to execute partial processes. In the rebuild process, the same data as the data recorded in a storage device configuring the RLU is generated based on the data read from the remaining storage devices configuring the RLU, and is used as a spare storage device. It is a process to write. The partial process includes a combination of a process of reading data from a divided area obtained by dividing a data read target area by a fixed size and a process of writing data in another storage device based on the data read from the divided area. The access control unit executes the instructed partial processes P1 and P2 in parallel according to the execution instruction of the plurality of partial processes P1 and P2 received from the rebuild control unit. As a result of the above configuration, the storage system of Patent Document 1 speeds up the rebuilding process.

　特許文献１の技術では、記憶装置の故障が検出された後にリビルド処理を開始する。そのため、特許文献１の技術には、リビルドの開始が遅いという問題がある。 In the technique of Patent Document 1, rebuild processing is started after a storage device failure is detected. Therefore, the technique of Patent Document 1 has a problem that the start of rebuilding is slow.

　記憶装置の故障が検出される前にリビルド処理を開始する技術の一例が特許文献２に開示されている。特許文献２のストレージシステムは、複数のＨＤＤ（Hard Disk Drive）と、スペアＨＤＤと、制御部とを備える。複数のＨＤＤは、パリティ計算を利用する１種類のＲＡＩＤ方式に従って、ＲＡＩＤを構成する。スペアＨＤＤは、ＲＡＩＤにより冗長性が確保されたデータのうち何れかのＨＤＤに記憶される第１のデータと同内容の第２のデータを記憶する。制御部は、複数のＨＤＤのうちの１つがスペアＨＤＤに取り換えられた場合、当該取り換えられたＨＤＤに記憶されていたデータを、他のＨＤＤ及びスペアＨＤＤに記憶されているデータに基づいて、スペアＨＤＤにおいてリビルドする。上記構成の結果、特許文献２のストレージシステムは、あるＨＤＤが取り換えられた時点で、当該取り換えられたＨＤＤに記憶される第１のデータと同内容の第２のデータをスペアＨＤＤに記憶している。 Patent Document 2 discloses an example of a technique for starting a rebuilding process before a storage device failure is detected. The storage system of Patent Document 2 includes a plurality of HDDs (Hard Disk Drives), spare HDDs, and a control unit. The plurality of HDDs configure a RAID according to one type of RAID scheme using parity calculation. The spare HDD stores second data having the same content as the first data stored in any HDD among the data for which redundancy has been secured by RAID. When one of the plurality of HDDs is replaced with a spare HDD, the control unit spares the data stored in the replaced HDD based on the data stored in the other HDD and the spare HDD. Rebuild on the HDD. As a result of the above configuration, when a certain HDD is replaced, the storage system of Patent Document 2 stores, in the spare HDD, the second data having the same content as the first data stored in the replaced HDD. There is.

特開２０１３－０５４４０７号公報JP, 2013-054407, A 特開２０１２－１８５５７５号公報JP 2012-185575 A

　特許文献２のストレージシステムでは、複数のＨＤＤは、１種類のＲＡＩＤ方式（ＲＡＩＤ５又はＲＡＩＤ６の何れか）に従って、ＲＡＩＤを構成している。 In the storage system of Patent Document 2, a plurality of HDDs configure RAID according to one type of RAID method (either RAID 5 or RAID 6).

　一般的なストレージシステムでは、保持するデータの種類等に応じて、一群のＨＤＤのそれぞれに、複数の種類のＲＡＩＤ方式（ＲＡＩＤ１及びＲＡＩＤ５、ＲＡＩＤ１及び又はＲＡＩＤ６等）の何れかを個別に適用することがある。 In a general storage system, one of a plurality of RAID methods (such as RAID 1 and RAID 5, RAID 1 and / or RAID 6) should be individually applied to each group of HDDs according to the type of data to be held, etc. There is.

　ところが、特許文献２のストレージシステムには、複数のＨＤＤが複数の種類のＲＡＩＤ方式に従ってＲＡＩＤを構成している場合に、全ての種類のＲＡＩＤ方式（ＲＡＩＤ１等）における冗長性を復旧することができないという問題がある。 However, in the storage system of Patent Document 2, when a plurality of HDDs form a RAID according to a plurality of types of RAID systems, redundancy in all types of RAID systems (such as RAID 1) can not be restored. There is a problem of

　本発明は、上記の課題に鑑みてなされたもので、複数の記憶装置が複数種類のＲＡＩＤを構成する場合に、記憶装置の故障が発生してからＲＡＩＤにおける冗長性が復旧するまでに要する時間を短縮することを主たる目的とする。 The present invention has been made in view of the above problems, and in the case where a plurality of storage devices constitute a plurality of types of RAID, the time required for restoration of redundancy in the RAID after occurrence of a failure of the storage device. The main purpose is to shorten the

　本発明の一態様において、ディスクアレイコントローラは、複数種類のＲＡＩＤを構成する全ての記憶装置それぞれにおける所定のブロックを予備記憶装置にミラーリングする部分ミラーリング手段と、記憶装置の何れかが故障した際に、故障が発生した記憶装置における予備記憶装置が保持していないブロックを予備記憶装置において再構成し、故障が発生した記憶装置を予備記憶装置に置き換えることによって冗長性を復旧する再構成手段とを備える。 In one aspect of the present invention, the disk array controller is configured to use a partial mirroring unit configured to mirror a predetermined block in each of all the storage devices configuring a plurality of types of RAID to a spare storage device, or when any of the storage devices fails. And reconfiguring means for reconfiguring in the spare storage device a block not held by the spare storage device in the failed storage device and replacing the failed storage device with the spare storage device to restore redundancy. Prepare.

　本発明の一態様において、ストレージ装置は、複数種類のＲＡＩＤを構成する複数台の記憶装置と、予備記憶装置と、全ての記憶装置それぞれにおける所定のブロックを予備記憶装置にミラーリングする部分ミラーリング手段と、記憶装置の何れかが故障した際に、故障が発生した記憶装置における予備記憶装置が保持していないブロックを予備記憶装置において再構成し、故障が発生した記憶装置を予備記憶装置に置き換えることによって冗長性を復旧する再構成手段とを含むディスクアレイコントローラとを備える。 In one aspect of the present invention, a storage apparatus comprises: a plurality of storage devices constituting a plurality of types of RAID; a spare storage device; and partial mirroring means for mirroring predetermined blocks in all the storage devices to the spare storage device And reconfiguring in the spare storage device a block not held by the spare storage device in the failed storage device when any of the storage devices fails, and replacing the failed storage device with the spare storage device. And a disk array controller including reconstruction means for restoring redundancy.

　本発明の一態様において、ストレージ装置の復旧方法は、複数種類のＲＡＩＤを構成する複数台の記憶装置と、予備記憶装置とを備えたストレージ装置の復旧方法であって、全ての記憶装置それぞれにおける所定のブロックを予備記憶装置にミラーリングすると共に、記憶装置の何れかが故障した際に、故障が発生した記憶装置における予備記憶装置が保持していないブロックを予備記憶装置において再構成し、故障が発生した記憶装置を予備記憶装置に置き換えることによって冗長性を復旧する。 In one aspect of the present invention, a method of recovering a storage device is a method of recovering a storage device comprising a plurality of storage devices constituting a plurality of types of RAID and a spare storage device. While mirroring a predetermined block to a spare storage device, if any of the storage devices fail, a block not retained by the spare storage device in the failed storage device is reconfigured in the spare storage device, and the failure The redundancy is restored by replacing the generated storage with spare storage.

　本発明の一態様において、ディスクアレイコントローラの復旧プログラム又は、係る復旧プログラムが格納された非一時的な記憶媒体は、複数種類のＲＡＩＤを構成する複数台の記憶装置と、予備記憶装置とに接続されたディスクアレイコントローラが備えるコンピュータに、複数種類のＲＡＩＤを構成する全ての記憶装置それぞれにおける所定のブロックを予備記憶装置にミラーリングする部分ミラーリング処理と、記憶装置の何れかが故障した際に、故障が発生した記憶装置における予備記憶装置が保持していないブロックを予備記憶装置において再構成し、故障が発生した記憶装置を予備記憶装置に置き換えることによって冗長性を復旧する再構成処理とを実行させる。 In one aspect of the present invention, a recovery program of a disk array controller or a non-temporary storage medium storing the recovery program is connected to a plurality of storage devices constituting a plurality of types of RAID and a spare storage device. Partial mirroring processing of mirroring a predetermined block in each of all the storage devices constituting a plurality of types of RAID to a spare storage device in a computer provided in the disk array controller, and a failure in any of the storage devices. Reconfigure blocks in the spare storage device that are not held by the spare storage device in the storage device where the error occurred in the spare storage device, and execute a reconfiguration process that restores redundancy by replacing the failed storage device with the spare storage device. .

　本発明によれば、複数の記憶装置が複数種類のＲＡＩＤを構成する場合に、記憶装置の故障が発生してからＲＡＩＤにおける冗長性が復旧するまでに要する時間を短縮できるという効果がある。 According to the present invention, when a plurality of storage devices constitute a plurality of types of RAID, there is an effect that it is possible to shorten the time required from the occurrence of storage device failure to recovery of redundancy in the RAID.

本発明の第１の実施形態におけるストレージ装置の構成の一例を示すブロック図である。It is a block diagram showing an example of composition of a storage device in a 1st embodiment of the present invention. 本発明の第１の実施形態におけるストレージ装置の動作を示すフローチャートである。It is a flowchart which shows operation | movement of the storage apparatus in the 1st Embodiment of this invention. 本発明の第１の実施形態におけるストレージ装置の動作例を説明する図である。It is a figure explaining the operation example of the storage apparatus in the 1st Embodiment of this invention. 本発明の第１の実施形態におけるストレージ装置の別の動作例を説明する図である。It is a figure explaining another operation example of the storage apparatus in a 1st embodiment of the present invention. 本発明の各実施形態におけるストレージ装置を実現可能なハードウェア構成の一例を示すブロック図である。It is a block diagram which shows an example of the hardware constitutions which can implement | achieve the storage apparatus in each embodiment of this invention.

　以下、本発明の実施形態について、図面を参照して詳細に説明する。なお、すべての図面において、同等の構成要素には同じ符号を付し、適宜説明を省略する。
（第１の実施形態）
　本実施形態における構成について説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In all the drawings, the same components are denoted by the same reference numerals, and the description thereof will be appropriately omitted.
First Embodiment
The configuration in the present embodiment will be described.

　図１は、本発明の第１の実施形態におけるストレージ装置の構成の一例を示すブロック図である。図１では、記憶装置Ｄ１及び記憶装置Ｄ２がＲＡＩＤ１方式によってＲＡＩＤを構成し、記憶装置Ｄ３、記憶装置Ｄ４、及び記憶装置Ｄ５がＲＡＩＤ５方式によってＲＡＩＤを構成する例を示している。ここで、Ａ１、Ａ２、Ａ３、・・・はある一連のデータを構成するブロック（におけるデータ）を示し、Ｂ１、Ｂ２、Ｂ３、・・・は別の一連のデータを構成するブロック（におけるデータ）を示すこととする。又、ＰＢ１２は、ブロックＢ１及びブロックＢ２から算出されるパリティデータを含むブロックであることとする。即ち、ブロックＢ１、ブロックＢ２、又はブロックＰＢ１２のうち何れか２つのブロックから、残りの１つのブロックを算出できる。ブロックＰＢ３４、ＰＢ５６、・・・についても、ブロックＰＢ１２と同様である。又、本実施形態におけるＲＡＩＤ方式、各ＲＡＩＤを構成する記憶装置の台数等は本例には限定されない。 FIG. 1 is a block diagram showing an example of the configuration of a storage apparatus according to the first embodiment of this invention. FIG. 1 illustrates an example in which the storage device D1 and the storage device D2 configure a RAID according to the RAID1 method, and the storage device D3, the storage device D4, and the storage device D5 configure a RAID according to the RAID5 method. Here, A1, A2, A3, ... indicate blocks (data in) a certain series of data, B1, B2, B3, ... indicate blocks in another series of data To indicate). Also, it is assumed that PB12 is a block including parity data calculated from block B1 and block B2. That is, one remaining block can be calculated from any two blocks among the block B1, the block B2, and the block PB12. The blocks PB34, PB56,... Are similar to the block PB12. Further, the RAID method in the present embodiment, the number of storage devices configuring each RAID, and the like are not limited to this example.

　本実施形態におけるストレージ装置１００は、複数台の記憶装置１４０と、予備記憶装置１５０と、ディスクアレイコントローラ１１０とを含む。 The storage apparatus 100 in the present embodiment includes a plurality of storage devices 140, a spare storage device 150, and a disk array controller 110.

　複数台の記憶装置１４０は、複数種類のＲＡＩＤを構成する。各記憶装置１４０は、例えば、ＨＤＤ、ＳＳＤ（Solid State Drive）、又は不揮発性メモリである。 The plurality of storage devices 140 configure a plurality of types of RAID. Each storage device 140 is, for example, an HDD, a solid state drive (SSD), or a non-volatile memory.

　予備記憶装置１５０は、記憶装置１４０毎に所定の一部ブロックをミラーリングする。予備記憶装置１５０は、例えば、ＨＤＤ、ＳＳＤ、又は不揮発性メモリである。 The spare storage device 150 mirrors a predetermined partial block for each storage device 140. The spare storage device 150 is, for example, an HDD, an SSD, or a non-volatile memory.

　ディスクアレイコントローラ１１０は、複数種類のＲＡＩＤ方式の何れかに従って、各記憶装置１４０を制御する。ディスクアレイコントローラ１１０は、部分ミラーリング部１２０と、再構成部１３０とを含む。ここで、複数種類のＲＡＩＤ方式は、例えば、ＲＡＩＤ１及びＲＡＩＤ５、又はＲＡＩＤ１及びＲＡＩＤ６である。ここで、ＲＡＩＤ５の代わりにＲＡＩＤ６が使用される場合には、ＲＡＩＤを構成する記憶装置１４０が追加されてもよい。 The disk array controller 110 controls each storage device 140 according to any of a plurality of types of RAID schemes. The disk array controller 110 includes a partial mirroring unit 120 and a reconfiguration unit 130. Here, the plurality of types of RAID schemes are, for example, RAID 1 and RAID 5, or RAID 1 and RAID 6. Here, when RAID 6 is used instead of RAID 5, a storage device 140 configuring RAID may be added.

　部分ミラーリング部１２０は、全ての記憶装置１４０それぞれにおける所定のブロックを予備記憶装置１５０にミラーリングする。図１では、予備記憶装置１５０は、記憶装置Ｄ１のブロックＡ１と、記憶装置Ｄ２のブロックＡ２と、記憶装置Ｄ３のブロックＰＢ５６と、記憶装置Ｄ４のブロックＢ８と、記憶装置Ｄ５のブロックＢ０とを予備記憶装置１５０にミラーリングしている。 The partial mirroring unit 120 mirrors predetermined blocks in all the storage devices 140 to the spare storage device 150. In FIG. 1, the spare storage device 150 includes the block A1 of the storage device D1, the block A2 of the storage device D2, the block PB56 of the storage device D3, the block B8 of the storage device D4, and the block B0 of the storage device D5. It is mirrored to the spare storage device 150.

　再構成部１３０は、記憶装置１４０の何れかが故障した際に、故障が発生した記憶装置１４０における、予備記憶装置１５０が保持していないブロックを、予備記憶装置１５０において再構成する。そして、再構成部１３０は、故障が発生した記憶装置１４０を予備記憶装置１５０に置き換えることによって冗長性を復旧する。 The reconfiguration unit 130 reconfigures, in the spare storage device 150, blocks not held by the spare storage device 150 in the storage device 140 in which the failure has occurred when any of the storage devices 140 has failed. Then, the reconfiguration unit 130 restores redundancy by replacing the storage device 140 in which the failure has occurred with the spare storage device 150.

　本実施形態における動作について説明する。 The operation in this embodiment will be described.

　図２は、本発明の第１の実施形態におけるストレージ装置の動作を示すフローチャートである。 FIG. 2 is a flowchart showing the operation of the storage device in the first embodiment of the present invention.

　まず、ストレージ装置１００は、複数種類のＲＡＩＤを構成する全ての記憶装置１４０それぞれにおける所定のブロックを予備記憶装置１５０にミラーリングする（ステップＳ１１０）。ここで、所定のブロックは、複数種類のＲＡＩＤを構成する各記憶装置１４０において少なくとも１つ選択されることとする。各記憶装置１４０において選択されたブロックの合計サイズは、故障が発生した記憶装置１４０の再構成（ステップＳ１３０において後述）に要する時間が均等化されるように設定されることが望ましい。 First, the storage apparatus 100 mirrors a predetermined block in each of all the storage devices 140 configuring a plurality of types of RAID in the spare storage device 150 (step S110). Here, it is assumed that at least one predetermined block is selected in each of the storage devices 140 configuring a plurality of types of RAID. The total size of the blocks selected in each storage device 140 is preferably set so as to equalize the time required for reconfiguration of the storage device 140 in which a failure has occurred (described later in step S130).

　次に、ストレージ装置１００は、記憶装置１４０の何れかが故障したか否かを検出する（ステップＳ１２０）。ここで、記憶装置１４０の故障は、例えば、S.M.A.R.T.（Self-Monitoring Analysis and Reporting Technology）を用いて検出される。 Next, the storage device 100 detects whether any one of the storage devices 140 has failed (step S120). Here, a failure of the storage device 140 is detected using, for example, Self-Monitoring Analysis and Reporting Technology (S.M.A.R.T.).

　記憶装置１４０の何れも故障しなければ（ステップＳ１２０：Ｎｏ）、ストレージ装置１００は、ステップＳ１１０に処理を戻す。 If none of the storage devices 140 fails (step S120: No), the storage device 100 returns the process to step S110.

　続いて、記憶装置１４０の何れかが故障すれば（ステップＳ１２０：Ｙｅｓ）、ストレージ装置１００は、故障が発生した記憶装置１４０における、予備記憶装置１５０が保持していないブロックを、予備記憶装置１５０において再構成する（ステップＳ１３０）。 Subsequently, if any of the storage devices 140 fails (step S120: Yes), the storage device 100 stores the blocks not held by the spare storage device 150 in the storage device 140 in which the failure occurred in the spare storage device 150. At step S130.

　続いて、ストレージ装置１００は、故障が発生した記憶装置１４０を予備記憶装置１５０に置き換えることによって冗長性を復旧する（ステップＳ１４０）。 Subsequently, the storage device 100 restores redundancy by replacing the storage device 140 in which the failure has occurred with the spare storage device 150 (step S140).

　本実施形態における動作例について説明する。 An operation example in the present embodiment will be described.

　図３は、本発明の第１の実施形態におけるストレージ装置の動作例を説明する図である。 FIG. 3 is a diagram for explaining an operation example of the storage device in the first embodiment of the present invention.

　故障発生前において、記憶装置Ｄ１、Ｄ２、Ｄ３、Ｄ４、Ｄ５のブロックＡ１、Ａ２、ＰＢ５６、Ｂ８、Ｂ０はそれぞれ予備記憶装置にミラーリングされている（図１）。ある時点において、記憶装置Ｄ５が故障することとする。 Before a failure occurs, the blocks A1, A2, PB 56, B8, B0 of the storage devices D1, D2, D3, D4, D5 are respectively mirrored in the spare storage devices (FIG. 1). At some point in time, the storage device D5 breaks down.

　このとき、ストレージ装置１００は、予備記憶装置１５０において記憶装置Ｄ５の再構成（リビルド）を開始する。ところが、ブロックＢ０は既にミラーリングされているので、記憶装置Ｄ５におけるＰＢ１２、Ｂ４、Ｂ６、PＢ７８、・・・のデータについて、記憶装置Ｄ３及び記憶装置Ｄ４によって保持されるデータに基づいてデータをリビルドする。その結果、予備記憶装置１５０においてミラーリング済みであったブロックＢ０のリビルドに要する時間の分、リビルドに要する時間が短縮される。 At this time, the storage device 100 starts reconfiguration (rebuild) of the storage device D5 in the spare storage device 150. However, since block B0 is already mirrored, data is rebuilt based on the data held by storage device D3 and storage device D4 for the data of PB12, B4, B6, PB78, ... in storage device D5. . As a result, the time required for rebuilding is reduced by the time required for rebuilding the block B0 that has been mirrored in the spare storage device 150.

　図４は、本発明の第１の実施形態におけるストレージ装置の別の動作例を説明する図である。 FIG. 4 is a diagram for explaining another operation example of the storage apparatus in the first embodiment of the present invention.

　故障発生前において、記憶装置Ｄ１、Ｄ２、Ｄ３、Ｄ４、Ｄ５のブロックＡ１、Ａ２、ＰＢ５６、Ｂ８、Ｂ０はそれぞれ予備記憶装置にミラーリングされている（図１）。ある時点において、記憶装置Ｄ２が故障することとする。 Before a failure occurs, the blocks A1, A2, PB 56, B8, B0 of the storage devices D1, D2, D3, D4, D5 are respectively mirrored in the spare storage devices (FIG. 1). At some point in time, the storage device D2 fails.

　このとき、ストレージ装置１００は、予備記憶装置１５０において記憶装置Ｄ２のリビルドを開始する。ところが、ブロックＡ１、Ａ２は既にミラーリングされているので、記憶装置Ｄ２におけるＡ３、Ａ４、Ａ５、・・・について、記憶装置Ｄ１によって保持されるデータに基づいてブロックをリビルドする。その結果、予備記憶装置１５０においてミラーリング済みであったブロックＡ１及びＡ２のリビルドに要する時間の分、リビルドに要する時間が短縮される。 At this time, the storage device 100 starts rebuilding of the storage device D2 in the spare storage device 150. However, since the blocks A1 and A2 are already mirrored, the blocks are rebuilt based on the data held by the storage device D1 for A3, A4, A5,... In the storage device D2. As a result, the time required for rebuilding is reduced by the time required for rebuilding the blocks A1 and A2 that have been mirrored in the spare storage device 150.

　以上説明したように、本実施形態におけるストレージ装置１００は、複数種類のＲＡＩＤを構成する全ての記憶装置１４０について、記憶装置１４０毎の所定の一部ブロックを予備記憶装置１５０にミラーリングする。そして、ストレージ装置１００は、記憶装置１４０の何れかが故障した際に、故障が発生した記憶装置１４０における、予備記憶装置１５０が保持していないブロックを予備記憶装置１５０において再構成する。そして、ストレージ装置１００は、故障が発生した記憶装置１４０を予備記憶装置１５０に置き換えることによって冗長性を復旧する。従って、本実施形態におけるストレージ装置１００には、複数の記憶装置が複数種類のＲＡＩＤを構成する場合に、記憶装置の故障が発生してからＲＡＩＤにおける冗長性が復旧するまでに要する時間を短縮できるという効果がある。 As described above, the storage apparatus 100 according to the present embodiment mirrors, to the spare storage apparatus 150, a predetermined partial block for each storage apparatus 140 with respect to all of the storage apparatuses 140 configuring a plurality of types of RAID. Then, when any of the storage devices 140 fails, the storage device 100 reconfigures, in the spare storage device 150, blocks not held by the spare storage device 150 in the storage device 140 in which the failure has occurred. Then, the storage device 100 restores redundancy by replacing the storage device 140 in which the failure has occurred with the spare storage device 150. Therefore, in the storage apparatus 100 according to the present embodiment, when a plurality of storage devices constitute a plurality of types of RAID, it is possible to shorten the time required for the redundancy in the RAID to be restored after occurrence of a failure of the storage device. It has the effect of

　特に、記憶装置Ｄ１又はＤ２から予備記憶装置１５０にミラーリングされた所定のブロックの合計サイズの総和と、記憶装置Ｄ３、Ｄ４、Ｄ５、・・・それぞれから予備記憶装置１５０にミラーリングされた所定のブロックの合計サイズとが均等化されている場合には、ある記憶装置１４０の故障が発生してからＲＡＩＤにおける冗長性が復旧するまでに要する時間が均等化されるという効果がある。 In particular, the sum of the total size of the predetermined blocks mirrored from the storage device D1 or D2 to the spare storage device 150, and the predetermined blocks mirrored from the storage devices D3, D4, D5,. In the case where the total size is equalized, there is an effect that the time required from restoration of a certain storage device 140 to restoration of redundancy in RAID is equalized.

　図５は、本発明の各実施形態におけるストレージ装置を実現可能なハードウェア構成の一例を示すブロック図である。 FIG. 5 is a block diagram showing an example of a hardware configuration that can realize the storage apparatus according to each embodiment of the present invention.

　ストレージ装置９０７は、記憶装置９０２と、ＣＰＵ（Central Processing Unit）９０３と、キーボード９０４と、モニタ９０５と、Ｉ／Ｏ（Input/Output）装置９０８とを備え、これらが内部バス９０６によって接続されている。記憶装置９０２は、部分ミラーリング部１２０、再構成部１３０等のＣＰＵ９０３の動作プログラムを格納する。ＣＰＵ９０３は、ストレージ装置９０７の全体を制御し、記憶装置９０２に格納された動作プログラムを実行し、Ｉ／Ｏ装置９０８によって部分ミラーリング部１２０、再構成部１３０等のプログラムの実行やデータの送受信を行なう。尚、上記のストレージ装置９０７の内部構成は一例である。ストレージ装置９０７は、必要に応じて、キーボード９０４、モニタ９０５を接続する装置構成であってもよい。 The storage device 907 includes a storage device 902, a central processing unit (CPU) 903, a keyboard 904, a monitor 905, and an input / output (I / O) device 908, which are connected by an internal bus 906. There is. The storage device 902 stores an operation program of the CPU 903 such as the partial mirroring unit 120 and the reconfiguration unit 130. The CPU 903 controls the entire storage device 907, executes an operation program stored in the storage device 902, and executes programs such as the partial mirroring unit 120 and the reconfiguration unit 130 by the I / O device 908 and transmits / receives data. Do. The internal configuration of the storage device 907 described above is an example. The storage device 907 may have a device configuration in which a keyboard 904 and a monitor 905 are connected as necessary.

　上述した本発明の各実施形態におけるストレージ装置９０７は、専用の装置によって実現してもよいが、Ｉ／Ｏ装置９０８が外部との通信を実行するハードウェアの動作以外は、コンピュータ（情報処理装置）によっても実現可能である。本発明の各実施形態において、Ｉ／Ｏ装置９０８は、例えば、記憶装置１４０、予備記憶装置１５０等との入出力部である。この場合、係るコンピュータは、記憶装置９０２に格納されたソフトウェア・プログラムをＣＰＵ９０３に読み出し、読み出したソフトウェア・プログラムをＣＰＵ９０３において実行する。上述した各実施形態の場合、係るソフトウェア・プログラムには、上述したところの、図１に示した、ストレージ装置９０７又はストレージ装置９０７の各部の機能を実現可能な記述がなされていればよい。但し、これらの各部には、適宜ハードウェアを含むことも想定される。そして、このような場合、係るソフトウェア・プログラム（コンピュータ・プログラム）は、本発明を構成すると捉えることができる。更に、係るソフトウェア・プログラムを格納した、コンピュータ読み取り可能な記憶媒体も、本発明を構成すると捉えることができる。 Although the storage device 907 in each embodiment of the present invention described above may be realized by a dedicated device, a computer (an information processing device other than the operation of hardware in which the I / O device 908 executes communication with the outside) It can also be realized by In each embodiment of the present invention, the I / O device 908 is, for example, an input / output unit with the storage device 140, the spare storage device 150, and the like. In this case, the computer reads the software program stored in the storage device 902 to the CPU 903 and causes the CPU 903 to execute the read software program. In the case of each of the above-described embodiments, the software program may be described so as to realize the functions of the storage device 907 or the respective units of the storage device 907 shown in FIG. 1 described above. However, it is also assumed that these parts include hardware as appropriate. And, in such a case, such software program (computer program) can be understood to constitute the present invention. Furthermore, a computer readable storage medium storing such a software program can be considered to constitute the present invention.

　以上、本発明を、上述した各実施形態およびその変形例によって例示的に説明した。しかしながら、本発明の技術的範囲は、上述した各実施形態およびその変形例に記載した範囲に限定されない。当業者には、係る実施形態に対して多様な変更又は改良を加えることが可能であることは明らかである。そのような場合、係る変更又は改良を加えた新たな実施形態も、本発明の技術的範囲に含まれ得る。そしてこのことは、請求の範囲に記載した事項から明らかである。 The present invention has been described above exemplarily by the above-described embodiments and the modifications thereof. However, the technical scope of the present invention is not limited to the scope described in the above-described embodiments and the modifications thereof. It will be apparent to those skilled in the art that various changes or modifications can be made to such embodiments. In such a case, new embodiments added with such changes or improvements can also be included in the technical scope of the present invention. And this is clear from the matter described in the claim.

　この出願は、２０１８年１月１０日に出願された日本出願特願２０１８－００２０３１を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims priority based on Japanese Patent Application No. 2018-002031 filed on Jan. 10, 2018, the entire disclosure of which is incorporated herein.

　本発明は、ＲＡＩＤを構成する記憶装置の故障時における復旧時間を短縮する用途において利用できる。 The present invention can be used in applications that reduce the recovery time when a storage device constituting a RAID fails.

　１００　ストレージ装置
　１１０　ディスクアレイコントローラ
　１２０　部分ミラーリング部
　１３０　再構成部
　１４０　記憶装置
　１５０　予備記憶装置
　９０２　記憶装置
　９０３　ＣＰＵ
　９０４　キーボード
　９０５　モニタ
　９０６　内部バス
　９０７　ストレージ装置
　９０８　Ｉ／Ｏ装置 100 Storage device 110 Disk array controller 120 Partial mirroring unit 130 Reconfiguration unit 140 Storage device 150 Spare storage device 902 Storage device 903 CPU
904 keyboard 905 monitor 906 internal bus 907 storage device 908 I / O device

Claims

Partial mirroring means for mirroring a predetermined block in each of all the storage devices constituting a plurality of types of RAID to a spare storage device;
When any one of the storage devices fails, a block not held by the spare storage device in the failed storage device is reconfigured in the spare storage device, and the failed storage device is replaced by the spare device. A disk array controller comprising: reconfiguration means for recovering redundancy by replacing storage devices.

The sum of the total size of the predetermined blocks in all the storage devices constituting RAID 1 and the total size of the predetermined blocks in each of the storage devices constituting either RAID 5 or RAID 6 are equalized. The disk array controller according to 1.

A disk array controller according to claim 1 or 2;
The storage device;
The spare storage device;
Storage device.

A plurality of storage devices constituting a plurality of types of RAID;
A method of recovering a storage device comprising a spare storage device, comprising:
Mirroring a predetermined block in each of all the storage devices to a spare storage device;
When any one of the storage devices fails, a block not held by the spare storage device in the failed storage device is reconfigured in the spare storage device, and the failed storage device is replaced by the spare device. Restore redundancy by replacing it with a storage device,
Storage device recovery method.

The sum of the total size of the predetermined blocks in all the storage devices constituting RAID 1 and the total size of the predetermined blocks in each of the storage devices constituting either RAID 5 or RAID 6 are equalized. The storage device recovery method according to 4.

A plurality of storage devices constituting a plurality of types of RAID;
A computer provided with a disk array controller connected to a spare storage device;
Partial mirroring processing for mirroring a predetermined block in each of all the storage devices constituting a plurality of types of RAID to a spare storage device;
When any one of the storage devices fails, a block not held by the spare storage device in the failed storage device is reconfigured in the spare storage device, and the failed storage device is replaced by the spare device. A non-transitory storage medium storing a recovery program of a disk array controller that executes a reconfiguration process of recovering redundancy by replacing it with a storage device.

The sum of the total size of the predetermined blocks in all the storage devices constituting RAID 1 and the total size of the predetermined blocks in each of the storage devices constituting either RAID 5 or RAID 6 are equalized. A non-transitory storage medium storing the recovery program of the disk array controller according to 6.