JP4469822B2

JP4469822B2 - Disk array device and data management method for disk array device

Info

Publication number: JP4469822B2
Application number: JP2006251721A
Authority: JP
Inventors: 義光上山
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2006-09-15
Filing date: 2006-09-15
Publication date: 2010-06-02
Anticipated expiration: 2026-09-15
Also published as: JP2008071297A

Description

本発明は、例えばＲＡＩＤ（redundant array of inexpensive disks）６を構成するディスクアレイ装置に適用して好適なデータ管理技術に関する。 The present invention relates to a data management technique suitable for application to, for example, a disk array device constituting a redundant array of inexpensive disks (RAID) 6.

近年、電子商取引の普及等に伴い、高応答性や耐障害性など、データ記憶装置に対する要求が非常に高まっている。この要求に応える技術として、複数のディスク装置を並列に接続して、これらをあたかも1台のディスク装置のように連携して動作させるＲＡＩＤ機能が存在する。 In recent years, with the spread of electronic commerce and the like, demands for data storage devices such as high responsiveness and fault tolerance have been greatly increased. As a technology that meets this requirement, there is a RAID function in which a plurality of disk devices are connected in parallel, and operate as if they were a single disk device.

特に、最新のＲＡＩＤ−６では、パリティデータなどと称される冗長データを２種類使用することにより、整合性確認の強化が図られている。また、このＲＡＩＤ−６を構成するディスクアレイ装置においては、データ不正箇所を特定するための手法なども種々提案されている（例えば特許文献１等参照）。
特開２００５−２９３２６３号公報 In particular, in the latest RAID-6, the consistency check is strengthened by using two types of redundant data called parity data. Further, in the disk array device constituting RAID-6, various methods for specifying an illegal data location have been proposed (see, for example, Patent Document 1).
JP 2005-293263 A

この特許文献１に記載のディスクアレイ装置では、２種類の冗長データが存在することに着目し、被疑データを未知数とする連立方程式を順次解いていくことにより、誤りデータを特定する。 In the disk array device described in Patent Document 1, attention is given to the presence of two types of redundant data, and error data is identified by sequentially solving simultaneous equations with suspicious data as unknowns.

しかしながら、この手法は、ディスクアレイ装置が備える複数のディスク装置すべてが正常稼働している場合、より厳密には、ストライプグループ内に、故障状態にあるディスク装置内の区域が配置されていない場合にのみ有効である。換言すれば、１台のディスク装置が故障状態にある場合、この故障状態にあるディスク装置内の区域が配置されたストライプグループでは、整合性確認で不整合が検出できたとしても、データ不正箇所を特定することはできないことになる。 However, this method is used when all of the plurality of disk devices included in the disk array device are operating normally. More strictly, when the area in the disk device in the failed state is not arranged in the stripe group. Only valid. In other words, if one disk unit is in a failed state, even if an inconsistency can be detected by the consistency check in the stripe group in which an area in the disk unit in the failed state is arranged, the data illegal location Cannot be specified.

この発明は、このような事情を考慮したものであり、例えばＲＡＩＤ−６を構成する複数のディスク装置の中の１台のディスク装置が故障状態にあっても、整合性確認で不整合が検出された場合には、データ不正箇所を特定すること等を可能とするディスクアレイ装置およびディスクアレイ装置のデータ管理方法を提供することを目的とする。 The present invention takes such circumstances into consideration, and for example, even if one of the plurality of disk devices constituting RAID-6 is in a failure state, inconsistency is detected by the consistency check. In such a case, it is an object of the present invention to provide a disk array device and a data management method for the disk array device that make it possible to specify an illegal portion of data.

前述した目的を達成するために、この発明は、複数のディスク装置にデータおよび前記データの冗長データを分散して記録するディスクアレイ装置において、前記データおよび前記冗長データの実際の記録単位よりも大きいデータ長で前記複数のディスク装置それぞれの記録領域を区域分けして、前記データまたは前記冗長データの書き込みに関するアクセス履歴情報を記録するための冗長領域を前記複数のディスク装置それぞれの各区域ごとに確保する冗長領域確保手段と、データの書き込み時、同一の冗長データを用いるストライプグループごとに、当該データが記録される１または２以上の区域の冗長領域と当該データの冗長データが記録される区域の冗長領域とに、そのストライプグループを構成する全区域の中で前記データまたは前記冗長データが同期的に記録される区域を示すビットマップデータを含む同一のアクセス履歴情報を記録するアクセス履歴情報記録手段と、前記複数のディスク装置の中の１台のディスク装置が故障状態にあるとき、この故障状態にあるディスク装置内の区域が配置されたストライプグループにおいて前記冗長データを用いた整合性確認で不整合が検出された場合に、前記冗長データが記録される区域の冗長領域に記録されたアクセス履歴情報および当該アクセス履歴情報に含まれるビットマップデータによって前記冗長データと同期的に前記データが記録された旨が示される区域の冗長領域に記録されたアクセス履歴情報を読み出して比較することにより、データ不正箇所を特定するデータ不正箇所特定手段と、を具備することを特徴とする。 In order to achieve the above-described object, the present invention provides a disk array device that records data and redundant data of the data in a plurality of disk devices in a distributed manner and is larger than the actual recording unit of the data and redundant data. The recording area of each of the plurality of disk devices is divided into sections by data length, and a redundant area for recording access history information related to writing of the data or the redundant data is secured for each of the plurality of disk devices. A redundant area securing means for the data, and at the time of data writing, for each stripe group using the same redundant data, the redundant area of one or more areas where the data is recorded and the area where the redundant data of the data is recorded in the redundant area, the data or in the entire area constituting the stripe group And the access history information recording means for serial redundant data records with the same access history information including bitmap data indicating the area to be synchronously recorded, the fault condition one disk apparatus in said plurality of disk devices When a mismatch is detected by the consistency check using the redundant data in the stripe group in which the area in the disk device in the failed state is arranged, the redundant area of the area in which the redundant data is recorded The access history information recorded in the redundant area of the area where the fact that the data has been recorded synchronously with the redundant data is read out by the access history information recorded in the bitmap and the bitmap data included in the access history information. by comparison, the characterized by comprising a data invalid portion identifying means for identifying the data invalid location, the That.

また、この発明は、複数のディスク装置にデータおよび前記データの冗長データを分散して記録するディスクアレイ装置のデータ管理方法であって、前記データおよび前記冗長データの実際の記録単位よりも大きいデータ長で前記複数のディスク装置それぞれの記録領域を区域分けして、前記データまたは前記冗長データの書き込みに関するアクセス履歴情報を記録するための冗長領域を前記複数のディスク装置それぞれの各区域ごとに確保し、データの書き込み時、同一の冗長データを用いるストライプグループごとに、当該データが記録される１または２以上の区域の冗長領域と当該データの冗長データが記録される区域の冗長領域とに、そのストライプグループを構成する全区域の中で前記データまたは前記冗長データが同期的に記録される区域を示すビットマップデータを含む同一のアクセス履歴情報を記録し、前記複数のディスク装置の中の１台のディスク装置が故障状態にあるとき、この故障状態にあるディスク装置内の区域が配置されたストライプグループにおいて前記冗長データを用いた整合性確認で不整合が検出された場合に、前記冗長データが記録される区域の冗長領域に記録されたアクセス履歴情報および当該アクセス履歴情報に含まれるビットマップデータによって前記冗長データと同期的に前記データが記録された旨が示される区域の冗長領域に記録されたアクセス履歴情報を読み出して比較することにより、データ不正箇所を特定する、ことを特徴とする。 Further, the present invention provides a data management method for a disk array device for distributing and recording data and redundant data of the data on a plurality of disk devices, wherein the data is larger than the actual recording unit of the data and redundant data. The recording area of each of the plurality of disk devices is divided into sections by length, and a redundant area for recording access history information related to the writing of the data or the redundant data is secured for each section of the plurality of disk devices. , when writing data, each stripe group using the same redundant data, to the redundant area of zone 1 or 2 or more sections of the redundant area and the data redundancy data the data is recorded is recorded, the The data or the redundant data is recorded synchronously in all areas constituting the stripe group The same access history information including a bitmap data indicating the area to record that, when said plurality of one disk device in the disk device is in failure state, arranged areas in the disk device in this fault state Included in the access history information and the access history information recorded in the redundant area of the area where the redundant data is recorded when inconsistency is detected in the consistency check using the redundant data in the stripe group An illegal data location is identified by reading out and comparing access history information recorded in a redundant area of an area where it is indicated that the data is recorded synchronously with the redundant data by bitmap data. And

この発明によれば、例えばＲＡＩＤ−６を構成する複数のディスク装置の中の１台のディスク装置が故障状態にあっても、整合性確認で不整合が検出された場合には、データ不正箇所を特定すること等を可能とする。 According to the present invention, for example, if an inconsistency is detected in the consistency check even if one of the plurality of disk devices constituting RAID-6 is in a failure state, an illegal data location is detected. Can be specified.

以下、図面を参照して本発明の実施形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図１には、本実施形態に係るディスクアレイ装置の構成例が示されている。このディスクアレイ装置は、ＲＡＩＤコントローラ２と複数のＨＤＤ（hard disk drive）３とで構成され、あたかも１台のディスク装置のごとく振る舞ってホスト装置１からのアクセス要求を受け付ける。そして、このＲＡＩＤコントローラ２は、複数のＨＤＤ３を使ってＲＡＩＤ−６形式のＲＡＩＤ４を構成する。つまり、ＲＡＩＤコントローラ２は、並列接続された複数のＨＤＤ３でストライピングを行い、各ストライプグループ４ＡごとにＰパリティ／Ｑパリティと称される２種類のパリティデータ（冗長データ）を生成して、この２種類のパリティデータを複数のＨＤＤ３に巡回的に分散させて記録する。 FIG. 1 shows a configuration example of a disk array device according to the present embodiment. This disk array device is composed of a RAID controller 2 and a plurality of hard disk drives (HDDs) 3 and behaves as if it is one disk device and accepts an access request from the host device 1. The RAID controller 2 configures RAID-4 in RAID-6 format using a plurality of HDDs 3. That is, the RAID controller 2 performs striping with a plurality of HDDs 3 connected in parallel to generate two types of parity data (redundant data) called P parity / Q parity for each stripe group 4A. Kinds of parity data are cyclically distributed and recorded in a plurality of HDDs 3.

ここで、本実施形態のディスクアレイ装置についての理解を助けるために、ＲＡＩＤ−６の基本原理について説明する。図２に、ＲＡＩＤ−６のデータ配置例を示す。 Here, the basic principle of RAID-6 will be described in order to facilitate understanding of the disk array device of the present embodiment. FIG. 2 shows a data arrangement example of RAID-6.

（１）ＲＡＩＤ−６のパリティ生成
（ａ）Ｐパリティデータ生成時の計算式
ＲＡＩＤ−６のＰパリティデータは、ＲＡＩＤ−５のパリティデータと同様に、対象となるストライプグループ全てのデータストライプのビット単位の排他的論理和（ＸＯＲ）演算にて算出する。例えば、ストライプグループ１のＰパリティデータは、次の（１）式で求められる。

(1) Parity generation of RAID-6 (a) Calculation formula when generating P parity data RAID-6 P parity data, like RAID-5 parity data, is a bit of data stripes of all target stripe groups Calculated by exclusive OR (XOR) operation of units. For example, the P parity data of the stripe group 1 is obtained by the following equation (1).

（ｂ）Ｑパリティデータは、対象となるストライプグループの各々のデータストライプに、そのストライプグループに唯一のデータとなる係数（重み）を用いてガロア体の乗算を行い、それぞれの積をＸＯＲ演算することで算出する。例えば、ストライプグループ１のＱパリティデータは、次の（２）式で求められる。

(B) Q parity data is obtained by multiplying each data stripe of a target stripe group by a Galois field using a coefficient (weight) that is the only data in the stripe group, and XORing each product. To calculate. For example, the Q parity data of the stripe group 1 is obtained by the following equation (2).

（２）ＲＡＩＤ−６のパリティ更新
ＲＡＩＤ−６のＰパリティデータとＱパリティデータは、データストライプのデータが新しいデータで更新される場合には、整合性を維持するためにそれぞれ更新しなければならない。 (2) RAID-6 parity update RAID-6 P parity data and Q parity data must be updated to maintain consistency when data stripe data is updated with new data. .

（ａ）Ｐパリティデータ更新時の計算式
Ｐパリティデータ更新の計算には、ｎｅｗデータを用いてパリティ生成時と同じ計算を行う方法と、ライト対象のｏｌｄデータおよびｎｅｗデータとＰパリティのｏｌｄデータとから計算を行う方法がある。ストライプグループ１のデータ１がライトされた場合の後者の方法による計算式を以下に示す（（３）式）。

(A) Calculation formula for updating P parity data For the calculation of updating P parity data, the same calculation method as that used when generating parity using new data, the old data to be written, the new data, and the old data of P parity are used. There is a method to calculate from. A calculation formula by the latter method when data 1 of the stripe group 1 is written is shown below (formula (3)).

（ｂ）Ｑパリティデータ更新時の計算式
Ｑパリティデータ更新にも２種類の計算方法があり、以下に、ストライプグループ１のデータ２がライトされた場合に、ライト対象のｏｌｄデータおよびｎｅｗデータとＱパリティのｏｌｄデータとから計算を行う計算式を示す（（４）式）。

(B) Calculation Formula for Updating Q Parity Data There are two types of calculation methods for Q parity data updating. When data 2 of stripe group 1 is written, the old data and new data to be written are A calculation formula for calculating from the Q parity old data is shown (Formula (4)).

（３）ＲＡＩＤ−６の整合性確認
例えば、ＨＤＤに格納したデータとパリティデータを読み出し、読み出したデータをもとにパリティデータを再計算して、先に読み出したパリティデータと一致するか否かを確認する。 (3) Consistency check of RAID-6 For example, the data and parity data stored in the HDD are read, the parity data is recalculated based on the read data, and whether or not it matches the previously read parity data Confirm.

（ａ）ＲＡＩＤが正常状態の場合
ＲＡＩＤが正常状態の場合は、すべてのＨＤＤからデータを読み出してパリティを再計算し、読み出したパリティと一致するかの確認を行う。なお、不整合を検出した場合、改めてパリティを再生する等、システムの方針に基づいてリカバリする。 (A) When the RAID is in a normal state When the RAID is in a normal state, the data is read from all the HDDs, the parity is recalculated, and it is confirmed whether or not it matches the read parity. When inconsistency is detected, recovery is performed based on the policy of the system, such as regenerating parity again.

（ア）Ｐパリティデータを使用した整合性の確認方法
次の（５）式が成立することを確認する。

(A) Consistency confirmation method using P parity data Confirm that the following equation (5) holds.

（イ）Ｑパリティデータを使用した整合性の確認方法
次の（６）式が成立することを確認する。

(A) Consistency confirmation method using Q parity data It is confirmed that the following equation (6) holds.

（ｂ）ＲＡＩＤが縮退状態（ＨＤＤが１台故障）の場合
ＲＡＩＤが縮退状態で、かつ、故障ＨＤＤがデータストライプの場合は、まずＰパリティとその他の正常ＨＤＤのデータから故障したＨＤＤのデータを復元し、その復元データと正常ＨＤＤのデータからＱパリティを再計算し、読み出したＱパリティと一致するかの確認を行う。 (B) When the RAID is in a degraded state (one HDD has failed) When the RAID is in a degraded state and the failed HDD is a data stripe, first, the failed HDD data is determined from the P parity and other normal HDD data. Then, the Q parity is recalculated from the restored data and the normal HDD data, and it is confirmed whether or not it matches the read Q parity.

例えば、図３に示すように、ストライプグループ１のデータ２が記録されたＨＤＤが故障した場合を想定すると、この場合の整合性確認は、以下の（７）式，（８）式の順に計算して、（８）式が成立することを確認する。

For example, as shown in FIG. 3, assuming that the HDD in which the data 2 of the stripe group 1 is recorded is broken, the consistency check in this case is calculated in the order of the following formulas (7) and (8). Then, it is confirmed that the formula (8) is established.

以上のように、ＲＡＩＤ−６では、ＨＤＤが１台故障の縮減状態の場合にも、データ整合性確認を行うことができる。しかしながら、このデータ整合性確認で万一不整合が検出された場合は、データ誤りのある不正箇所を特定することができない。そこで、本実施形態のディスクアレイ装置は、ＨＤＤが１台故障の縮減状態の場合にあっても、データ整合性確認で不整合が検出された場合には、データ不正箇所を特定すること等を可能としたものであり、以下、この点について詳述する。 As described above, in RAID-6, data consistency can be confirmed even when one HDD is in a reduced state of failure. However, if an inconsistency is detected in this data consistency check, an illegal location with a data error cannot be specified. Therefore, the disk array device according to the present embodiment specifies an illegal data location when an inconsistency is detected in the data consistency check even when one HDD is in a reduced state of failure. Hereinafter, this point will be described in detail.

図１に示すように、ＲＡＩＤコントローラ２は、制御部２１、整合性確認部２２、セクタ冗長領域確保部２３、アクセス履歴記録部２４、データ不正箇所特定部２５および簡易整合性確認部２６を有している。ＲＡＩＤコントローラ２は、制御部２１によって全体的な動作制御が司られており、整合性確認部２２、セクタ冗長領域確保部２３、アクセス履歴記録部２４、データ不正箇所特定部２５および簡易整合性確認部２６の各部は、この制御部２１の制御下で動作する。 As shown in FIG. 1, the RAID controller 2 includes a control unit 21, a consistency confirmation unit 22, a sector redundant area securing unit 23, an access history recording unit 24, an illegal data location specifying unit 25, and a simple consistency confirmation unit 26. is doing. The overall operation control of the RAID controller 2 is governed by the control unit 21, and includes a consistency check unit 22, a sector redundant area securing unit 23, an access history recording unit 24, an illegal data location specifying unit 25, and a simple consistency check. Each unit of the unit 26 operates under the control of the control unit 21.

整合性確認部２２は、例えばホスト装置１からのデータアクセスが無い空き時間などを利用して、前述したデータ整合性確認を実施する。即ち、ＨＤＤ３に格納されたデータとパリティデータを読み出し、読み出したデータをもとにパリティデータを再計算して、先に読み出したパリティデータと一致するか否かを確認する処理を実行する。 The consistency check unit 22 performs the above-described data consistency check using, for example, a free time when there is no data access from the host device 1. That is, the data and parity data stored in the HDD 3 are read, the parity data is recalculated based on the read data, and a process of confirming whether or not it matches the previously read parity data is executed.

セクタ冗長領域確保部２３は、ＨＤＤ３のセクタあたりの記憶容量（Byte par Sector）を、実際に有効なデータを格納する容量に対して大きくなるように設定して、未使用部分となる領域を確保する。この未使用部分となって任意に使用可能とした領域を、ここでは、セクタ冗長領域と称する。例えば、ＨＤＤ３の１セクタあたりの記録容量を５２０バイトフォーマットとし、見かけ上のフォーマットは１セクタあたり５１２バイトとすることで、セクタ冗長領域確保部２３は、８バイトのセクタ冗長領域を確保する。 The sector redundant area reservation unit 23 sets the storage capacity (Byte par Sector) per sector of the HDD 3 so as to be larger than the capacity for actually storing valid data, and secures an area to be an unused part. To do. The area that becomes an unused part and can be arbitrarily used is referred to herein as a sector redundant area. For example, by setting the recording capacity per sector of the HDD 3 to a 520-byte format and an apparent format to be 512 bytes per sector, the sector redundant area securing unit 23 secures an 8-byte sector redundant area.

また、アクセス履歴記録部２４は、セクタ冗長領域確保部２３によって確保されたセクタ冗長領域に、アクセス履歴情報およびデータ保護情報（ＣＲＣ：cyclic redundancy check）を記録する。アクセス履歴情報は、次の２種類の内容を含んでいる。 The access history recording unit 24 records access history information and data protection information (CRC: cyclic redundancy check) in the sector redundancy area secured by the sector redundancy area securing unit 23. The access history information includes the following two types of contents.

（１）タイムスタンプ
ライト要求に伴い、対象ストライプグループの同時ライトするＨＤＤ３すべてに同じタイムスタンプ（ＨＤＤ３にデータをライトする時刻）を記録する。 (1) Time stamp In accordance with the write request, the same time stamp (time for writing data to the HDD 3) is recorded in all the HDDs 3 to which the target stripe group is simultaneously written.

（２）ビットマップデータ
対象ストライプグループの同時ライトするＨＤＤ３すべてのＲＡＩＤ内の位置を示すビットマップデータを記録する。ライト先ＨＤＤ３が故障のためにデータをライトできない場合でも、ライト対象のＨＤＤ３であることを示すようにビットマップに反映する（後でＨＤＤ３が交換され、データ復元の際、セクタ冗長領域のアクセス履歴情報をコピーするときの制御で使用するためである）。 (2) Bitmap data Bitmap data indicating the positions in the RAID of all HDDs 3 to which the target stripe group is simultaneously written is recorded. Even when data cannot be written because the write destination HDD 3 is out of order, it is reflected in the bitmap so as to indicate that it is the write target HDD 3 (the HDD 3 is replaced later, and when the data is restored, the access history of the sector redundancy area) Because it is used for control when copying information).

図４は、セクタ冗長領域確保部２３によって確保されたセクタ冗長領域に、アクセス履歴記録部２４によってアクセス履歴情報およびデータ保護情報が記録される様子を示す概念図である。なお、ＲＡＩＤの初期化処理は、データストライプからパリティを生成してＨＤＤにライトする処理であるが、この初期化処理時には、データストライプおよびパリティストライプのすべてのＨＤＤのセクタ冗長領域に、初期化処理によってライトする時刻をタイムスタンプとして、ストライプグループすべてのＨＤＤがライト対象となるデータをビットマップデータとしてそれぞれ記録する。また、初期化処理中に初期化が完了していない論理アドレスに対するライト要求があった場合には、通常のライト時と同じアクセス履歴情報を記録しておく。初期化処理が進み、いずれ、そのストライプグループも初期化によって上記のアクセス履歴情報に上書きされることになる。 FIG. 4 is a conceptual diagram showing how access history information and data protection information are recorded by the access history recording unit 24 in the sector redundant area secured by the sector redundancy area securing unit 23. The RAID initialization process is a process of generating parity from the data stripe and writing it to the HDD. During this initialization process, the initialization process is performed on the sector redundancy areas of all HDDs of the data stripe and the parity stripe. As a time stamp, the HDDs of all stripe groups record data to be written as bitmap data. Also, when there is a write request for a logical address that has not been initialized during the initialization process, the same access history information as in normal writing is recorded. As the initialization process progresses, the access history information is overwritten by the initialization of the stripe group.

そして、データ不正箇所特定部２５は、このセクタ冗長領域に記録されたアクセス履歴情報に基づいて、これまでは不可能であった、ＨＤＤが１台故障の縮減状態時に整合性確認部２２が不整合を検出した場合のデータ不正箇所の特定を実現する。 Based on the access history information recorded in the sector redundancy area, the data illegal location specifying unit 25 determines that the consistency checking unit 22 is not possible when one HDD has been reduced. Realize the location of illegal data when consistency is detected.

データの更新（ライト）に伴い、その対象ストライプグループのデータとパリティ間の整合性を保つために、ＰパリティデータおよびＱパリティデータも同時に更新しなければならない。基本的にＲＡＩＤが正常な状態におけるデータライトでは、少なくともデータストライプ，Ｐパリティストライプ，Ｑパリティストライプの３つが同時に更新されるはずである。つまり、これらのＨＤＤ３にライトされたセクタ冗長領域には、同じ情報が格納されているはずであるので、この点に着目して、データ不正箇所特定部２５は、データ不正箇所を特定する。 As data is updated (written), P parity data and Q parity data must be updated simultaneously in order to maintain consistency between the data and parity of the target stripe group. Basically, in a data write in a normal RAID state, at least three data stripes, P parity stripes, and Q parity stripes should be updated simultaneously. That is, since the same information should be stored in the sector redundancy areas written in these HDDs 3, the data illegal location specifying unit 25 specifies the data illegal location by paying attention to this point.

次に、図５乃至図７を参照して、データライトに伴うセクタ冗長領域へのアクセス履歴情報の記録例（ケース１〜ケース３）を説明する。 Next, an example of recording access history information (case 1 to case 3) to the sector redundant area associated with data writing will be described with reference to FIGS.

（ケース１：図５）
ホスト装置１からのデータライト要求を受けると（図５（１））、ＲＡＩＤコントローラ２の制御部２１は、ライトすべきＨＤＤ３を判断する。ここでは、ＨＤＤ（１）にライトデータがライトされ、これに伴い、ＨＤＤ（４）のＰパリティデータ、ＨＤＤ（５）のＱパリティデータが更新（ライト）されるものとする。 (Case 1: Fig. 5)
When receiving a data write request from the host device 1 (FIG. 5 (1)), the control unit 21 of the RAID controller 2 determines the HDD 3 to be written. Here, it is assumed that write data is written to the HDD (1), and accordingly, the P parity data of the HDD (4) and the Q parity data of the HDD (5) are updated (written).

この場合、アクセス履歴記録部２４は、ＨＤＤ（１），ＨＤＤ（４），ＨＤＤ（５）がライト対象ＨＤＤであると判断されたことから、”１００１１”というビットマップデータを生成する。また、アクセス履歴記録部２４は、例えば内蔵する時計モジュール等のハードウェア資源より時刻を取得し、タイムスタンプを準備する。 In this case, since the access history recording unit 24 determines that the HDD (1), the HDD (4), and the HDD (5) are write target HDDs, the access history recording unit 24 generates bitmap data “10011”. Further, the access history recording unit 24 acquires time from hardware resources such as a built-in clock module, and prepares a time stamp.

これと並行して、ＲＡＩＤコントローラ２では、Ｐパリティデータ／Ｑパリティデータを更新するためのｏｌｄデータのリード（図５（２））と、Ｐパリティデータ／Ｑパリティデータの再計算（図５（３），（４））とが行われる。図５の例では、前述した２つの計算方法のうち、ライト対象のｏｌｄデータとｎｅｗデータとを使った計算方法が使用されている。 In parallel with this, the RAID controller 2 reads the old data for updating the P parity data / Q parity data (FIG. 5 (2)) and recalculates the P parity data / Q parity data (FIG. 5 ( 3) and (4)) are performed. In the example of FIG. 5, the calculation method using the write target old data and new data among the two calculation methods described above is used.

そして、ライトデータおよびＰパリティデータ／ＱパリティデータのＨＤＤへのライトが行われる際（図５（５））、アクセス履歴記録部２４は、セクタ冗長領域へのタイムスタンプおよびビットマップデータのライトを同時に行う。その結果、ライトデータおよびＰパリティデータ／ＱパリティデータがライトされたＨＤＤ（１）、ＨＤＤ（４）、ＨＤＤ（５）の各セクタのセクタ冗長領域には、図５（Ｂ）に示すような、同一のアクセス履歴情報が記録されることになる。また、このセクタ冗長領域へのタイムスタンプおよびビットマップデータのライト時、アクセス履歴記録部２４は、ライトデータおよびＰパリティデータ／ＱパリティデータのＣＲＣを計算して、それぞれのセクタ冗長領域へライトすることも併せて行っている（以下、同じ）。 When the write data and P parity data / Q parity data are written to the HDD (FIG. 5 (5)), the access history recording unit 24 writes the time stamp and bitmap data to the sector redundant area. Do it at the same time. As a result, the sector redundancy area of each sector of HDD (1), HDD (4), and HDD (5) to which the write data and P parity data / Q parity data are written is as shown in FIG. The same access history information is recorded. When writing the time stamp and bitmap data to the sector redundancy area, the access history recording unit 24 calculates the CRC of the write data and P parity data / Q parity data and writes the CRC to each sector redundancy area. (The same applies hereinafter).

（ケース２：図６）
このケースでは、ＨＤＤ（１）、ＨＤＤ（２）にライトデータがライトされ、これに伴って、ＨＤＤ（４）のＰパリティデータ、ＨＤＤ（５）のＱパリティデータが更新（ライト）される。そこで、アクセス履歴記録部２４は、”１１０１１”というビットマップデータを生成する。 (Case 2: Fig. 6)
In this case, write data is written to the HDD (1) and HDD (2), and accordingly, the P parity data of the HDD (4) and the Q parity data of the HDD (5) are updated (written). Therefore, the access history recording unit 24 generates bitmap data “11011”.

また、図６の例では、Ｐパリティデータ／Ｑパリティデータが、ｎｅｗデータを用いてパリティ生成時と同じ計算を行う方法で再計算されている。そのために、ＨＤＤ（３）からのデータリードが行われるが、ＨＤＤ（３）へのデータライトは行われないため、アクセス履歴情報は記録されない。よって、このＨＤＤ（３）を除いた、ライトデータおよびＰパリティデータ／ＱパリティデータがライトされたＨＤＤ（１）、ＨＤＤ（２）、ＨＤＤ（４）、ＨＤＤ（５）の各セクタのセクタ冗長領域に、図６（Ｂ）に示すような、同一のアクセス履歴情報が記録されることになる。 In the example of FIG. 6, P parity data / Q parity data is recalculated by using the same calculation method as that for parity generation using new data. For this reason, data reading from the HDD (3) is performed, but data writing to the HDD (3) is not performed, so that access history information is not recorded. Therefore, sector redundancy of each sector of HDD (1), HDD (2), HDD (4), and HDD (5) to which write data and P parity data / Q parity data are written, excluding this HDD (3). The same access history information as shown in FIG. 6B is recorded in the area.

（ケース３：図７）
このケースは、図５に示したケース１と同様、ＨＤＤ（１）にライトデータがライトされ、これに伴って、ＨＤＤ（４）のＰパリティデータ、ＨＤＤ（５）のＱパリティデータが更新（ライト）されるというものである。しかしながら、ＨＤＤ（１）が故障状態にあるために、ＨＤＤ（１）へのライトが行えない状況にある。 (Case 3: Fig. 7)
In this case, as in the case 1 shown in FIG. 5, the write data is written to the HDD (1), and accordingly, the P parity data of the HDD (4) and the Q parity data of the HDD (5) are updated ( Is written). However, since the HDD (1) is in a failure state, the HDD (1) cannot be written.

そのために、ここでは、図７に示すように、ライト対象のｏｌｄデータとｎｅｗデータとを使った計算方法ではなく、ｎｅｗデータを用いてパリティ生成時と同じ計算を行う方法でＰパリティデータ／Ｑパリティデータを再計算し、このＰパリティデータ／ＱパリティデータのみのライトをＨＤＤ（４）、ＨＤＤ（５）に対して行っている。従って、ＨＤＤ（１）へのライトは実際には行っていない。 For this purpose, as shown in FIG. 7, the P parity data / Q is not calculated by using the old data and new data to be written, but by the same calculation as that at the time of parity generation using new data. Parity data is recalculated, and writing of only the P parity data / Q parity data is performed on the HDD (4) and the HDD (5). Therefore, the writing to the HDD (1) is not actually performed.

この場合でも、アクセス履歴記録部２４は、ＨＤＤ（１）はライトが行われるべきであったとして、”１００１１”というビットマップデータを作成する。ＨＤＤ（１）は故障状態にあるので、図７（Ｂ）に示すような、同一のアクセス履歴情報が、ＨＤＤ（４）、ＨＤＤ（５）に記録されることになる。 Even in this case, the access history recording unit 24 creates bitmap data “10011” on the assumption that the HDD (1) should be written. Since the HDD (1) is in a failure state, the same access history information as shown in FIG. 7B is recorded in the HDD (4) and the HDD (5).

次に、図８を参照して、このようにアクセス履歴情報が記録される本実施形態のディスクアレイ装置において、データ不正箇所特定部２５が、ＨＤＤが１台故障の縮減状態時に整合性確認部２２が不整合を検出した場合のデータ不正箇所の特定をどのように実現しているのかについて説明する。 Next, referring to FIG. 8, in the disk array device of this embodiment in which the access history information is recorded in this way, the data illegal location specifying unit 25 is a consistency checking unit when one HDD is reduced. A description will be given of how data illegal portions are identified when 22 detects inconsistencies.

データ不正箇所特定部２５は、まず、ＰパリティとＱパリティのアクセス履歴情報を読み出し（ステップＡ１）、これらが一致しているかを調べる（ステップＡ２）。もし、一致していれば（ステップＡ２のＹｅｓ）、データ不正箇所特定部２５は、このパリティストライプのビットマップを参照し、対象となる同時ライトしたＨＤＤ３からアクセス履歴情報をリードする（ステップＡ３）。そして、データ不正箇所特定部２５は、データＨＤＤのアクセス履歴情報とパリティＨＤＤのアクセス履歴情報を比較し（ステップＡ４）、情報が一致しない矛盾があるＨＤＤ３を見つけてＲＡＩＤ機能にてデータ訂正する（ステップＡ５）。 The illegal data location specifying unit 25 first reads the access history information of the P parity and the Q parity (step A1) and checks whether they match (step A2). If they match (Yes in step A2), the illegal data location specifying unit 25 refers to the parity stripe bitmap and reads the access history information from the target simultaneously written HDD 3 (step A3). . Then, the illegal data location specifying unit 25 compares the access history information of the data HDD and the access history information of the parity HDD (step A4), finds the HDD 3 having a contradiction where the information does not match, and corrects the data using the RAID function ( Step A5).

一方、ＰパリティとＱパリティのアクセス履歴情報が一致していなければ（ステップＡ２のＮｏ）、データ不正箇所特定部２５は、第１に、Ｐパリティのアクセス履歴情報のビットマップデータをもとに対象となる同時ライトしたＨＤＤ３からアクセス履歴情報をリードして比較する（ステップＡ６）。ここで、データ不正箇所特定部２５は、”比較対象ＨＤＤなし”であれば比較結果Ａ、”全て一致”であれば比較結果Ｂ、”不一致”であれば比較結果Ｃ、とする分類を行う（ステップＡ７）。 On the other hand, if the access history information of the P parity and the Q parity do not match (No in step A2), the data illegal location specifying unit 25 firstly based on the bitmap data of the access history information of the P parity. Access history information is read from the simultaneously written HDD 3 to be compared and compared (step A6). Here, the illegal data location specifying unit 25 classifies the comparison result A if “no comparison target HDD”, the comparison result B if “all match”, and the comparison result C if “no match”. (Step A7).

また、データ不正箇所特定部２５は、第２に、Ｑパリティのアクセス履歴情報のビットマップデータをもとに対象となる同時ライトしたＨＤＤ３からアクセス履歴情報をリードして比較する（ステップＡ８）。ここでも、データ不正箇所特定部２５は、”比較対象ＨＤＤなし”であれば比較結果Ｄ、”全て一致”であれば比較結果Ｅ、”不一致”であれば比較結果Ｆ、とする分類を行う（ステップＡ９）。 Secondly, the illegal data location specifying unit 25 reads and compares the access history information from the simultaneously written HDD 3 based on the bitmap data of the Q parity access history information (step A8). Here again, the data illegal location specifying unit 25 classifies the comparison result D if “no comparison target HDD”, comparison result E if “all match”, and comparison result F if “no match”. (Step A9).

そして、データ不正箇所特定部２５は、この第１の比較結果の分類と第２の比較結果の分類との組み合わせに基づき、図９に示すように、データ不正箇所の判定および訂正を実行する（ステップＡ１０）。 Then, based on the combination of the classification of the first comparison result and the classification of the second comparison result, the data illegal location specifying unit 25 determines and corrects the data illegal location as shown in FIG. 9 ( Step A10).

このように、本実施形態のディスクアレイ装置は、セクタ冗長領域確保部２３によってセクタ冗長領域を確保し、（この確保したセクタ冗長領域に）アクセス履歴記録部２４によってアクセス履歴情報を記録するので、データ不正箇所特定部２５による、これまでは不可能であった、ＨＤＤが１台故障の縮減状態時に整合性確認部２２が不整合を検出した場合のデータ不正箇所の特定を実現する。 As described above, the disk array device according to the present embodiment secures the sector redundancy area by the sector redundancy area securing unit 23 and records the access history information by the access history recording unit 24 (in this secured sector redundancy area). The data illegal location specifying unit 25 realizes specification of the data illegal location when the consistency checking unit 22 detects inconsistency when the HDD is in a reduced state of failure, which was impossible until now.

ところで、以上では、このアクセス履歴情報の用途として、整合性確認部２２によりデータ不整合が検出された場合のデータ不正箇所の特定に利用する例を説明した。しかしながら、このアクセス履歴情報は、これだけに限定されず、例えばデータライト時の検証にも利用することができる。そのために、本実施形態のディスクアレイ装置では、簡易整合性確認部２６を備える。 By the way, as described above, as an application of the access history information, an example has been described in which it is used for specifying an illegal data portion when a data inconsistency is detected by the consistency check unit 22. However, the access history information is not limited to this, and can be used for verification at the time of data write, for example. For this purpose, the disk array device of this embodiment includes a simple consistency confirmation unit 26.

簡易整合性確認部２６は、ライト処理の過程において動作し、ＰパリティやＱパリティを使った高負荷の計算を実行することなく、当該ライト処理が正常に行われたかどうかを検証するものである。図１０を参照して、この簡易整合性確認部２６が実行する簡易整合性確認の手順を説明する。 The simple consistency check unit 26 operates in the process of write processing, and verifies whether or not the write processing has been performed normally without executing high load calculation using P parity or Q parity. . With reference to FIG. 10, the procedure of the simple consistency confirmation which this simple consistency confirmation part 26 performs is demonstrated.

簡易整合性確認部２６は、まず、両パリティストライプをリードし、アクセス履歴情報が一致しているか確認する（ステップＢ１のＹｅｓ，ステップＢ２）。もし、一致していなければ（ステップＢ３のＮｏ）、この時点で、不整合が生じていると判定する（ステップＢ８）。 The simple consistency confirmation unit 26 first reads both parity stripes and confirms whether the access history information matches (Yes in Step B1, Step B2). If they do not match (No in step B3), it is determined that a mismatch has occurred at this point (step B8).

一方、アクセス履歴情報が一致していれば（ステップＢ３のＹｅｓ）、簡易整合性確認部２６は、ビットマップデータを参照し、同時ライト対象のＨＤＤからアクセス履歴情報をリードしてパリティストライプの情報と一致しているか確認する（ステップＢ４，ステップＢ５）。そして、一致していなければ（ステップＢ５のＮｏ）、不整合が生じていると判定する（ステップＢ８）。 On the other hand, if the access history information matches (Yes in step B3), the simple consistency check unit 26 refers to the bitmap data, reads the access history information from the HDD to be simultaneously written, and information on the parity stripe. (Step B4, step B5). If they do not match (No in step B5), it is determined that a mismatch has occurred (step B8).

また、一致していれば（ステップＢ５のＹｅｓ）、簡易整合性確認部２６は、データ保護情報（ＣＲＣ）を使ってデータ自体のチェックを行い（ステップＢ６）、エラーが検出されたら（ステップＢ７のＮｏ）、不整合が生じていると判定する（ステップＢ８）。 If they match (Yes in step B5), the simple consistency check unit 26 checks the data itself using the data protection information (CRC) (step B6), and if an error is detected (step B7). No), it is determined that a mismatch has occurred (step B8).

従って、本実施形態のディスクアレイ装置が実行するデータライトの手順は、図１１に示すような流れとなる。 Therefore, the data write procedure executed by the disk array device of this embodiment is as shown in FIG.

ホスト装置１からデータライト要求を受け付けると（ステップＣ１）、どのＨＤＤのどのアドレス（セクタ）へライトすべきか判断し、対象ストライプグループのライトすべき全てのＨＤＤの位置を示すビットマップを生成すると共に（ステップＣ２）、データ書き込み時刻（タイムスタンプ）を内蔵のハードウェアなどから得る（ステップＣ３）。 When a data write request is received from the host device 1 (step C1), it is determined which address (sector) of which HDD should be written, and a bitmap indicating the positions of all HDDs to be written in the target stripe group is generated. (Step C2) The data writing time (time stamp) is obtained from the built-in hardware or the like (Step C3).

次に、必要により各々のＨＤＤからｏｌｄデータをリードし（ステップＣ４）、パリティ更新データを計算すると共に（ステップＣ５）、ライトデータおよびパリティ更新データのＣＲＣを計算する（ステップＣ６）。そして、対象セクタのデータ領域にホスト要求ライトデータ、冗長領域にタイムスタンプ、ビットマップデータおよびＣＲＣが格納されるように、バッファ領域にデータをセットして（ステップＣ７）、ＨＤＤへライト要求を発行する（ステップＣ８）。 Next, if necessary, old data is read from each HDD (step C4), parity update data is calculated (step C5), and CRC of write data and parity update data is calculated (step C6). Then, data is set in the buffer area so that the host request write data is stored in the data area of the target sector, and the time stamp, bitmap data, and CRC are stored in the redundant area (step C7), and a write request is issued to the HDD. (Step C8).

すべてのライト対象ＨＤＤへのデータライトが完了したら（ステップＣ９のＹＥＳ）、図１０に詳細な手順を示した、アクセス履歴情報を使った簡易整合性確認を実施する（ステップＣ１０）。もし、不整合と判定したら（ステップＣ１１のＹＥＳ）、ライト処理を再実行し（ステップＣ１２）、整合と判定したら（ステップＣ１１のＮＯ）、ホスト装置１に対してデータライト完了を応答する（ステップＣ１３）。 When data writing to all write target HDDs is completed (YES in step C9), a simple consistency check using the access history information shown in the detailed procedure in FIG. 10 is performed (step C10). If it is determined that it is inconsistent (YES in step C11), the write process is re-executed (step C12). If it is determined that it is consistent (NO in step C11), a data write completion response is returned to the host device 1 (step S11). C13).

データライト時に簡易整合性確認で不整合が検出できれば、ＲＡＩＤコントローラ２中にライトすべきデータが存在しているので、リカバリが短時間で可能である。また、簡易整合性確認自体も、パリティ計算を必要としないので、応答性の大幅な低下を招くといったことがない。さらに、このアクセス履歴情報の一致確認をハードウェアで行うようにすれば、高速に処理することが可能である。 If inconsistency can be detected by simple consistency check at the time of data writing, the data to be written exists in the RAID controller 2, so that recovery is possible in a short time. In addition, the simple consistency check itself does not require a parity calculation, so that the responsiveness is not greatly reduced. Furthermore, if this access history information match confirmation is performed by hardware, high-speed processing is possible.

また、次に、ＨＤＤが１台故障の縮減状態から正常状態に復帰した場合のアクセス履歴情報のコピー手順について説明する。 Next, a description will be given of a procedure for copying access history information when a single HDD is restored from a reduced state of failure to a normal state.

ＨＤＤ故障時、ＨＤＤの交換やホットスペアＨＤＤを使用することなどによって、故障したＨＤＤのデータをＲＡＩＤ機能により復元し、交換したＨＤＤにデータを格納することで、故障状態から回復させることができる。そこで、本実施形態のディスクアレイ装置においては、このデータ復元の際、セクタ冗長領域のアクセス履歴情報もコピーする。図１２は、このコピー手順を示すフローチャートである。 When the HDD fails, it is possible to recover from the failed state by restoring the failed HDD data by the RAID function by replacing the HDD or using a hot spare HDD and storing the data in the replaced HDD. Therefore, in the disk array device of this embodiment, the access history information of the sector redundant area is also copied at the time of this data restoration. FIG. 12 is a flowchart showing this copy procedure.

まず、Ｐ，Ｑどちらかアクセス可能な方のパリティストライプをリードし、アクセス履歴情報のビットマップデータをコピーする（ステップＤ１のＹｅｓ，ステップＤ２）。データ復元先のＨＤＤが、そのビットマップ情報でライト先ＨＤＤの対象となっている場合は（ステップＤ３のＹｅｓ）、タイムスタンプもパリティストライプのタイムスタンプをコピーする（ステップＤ４）。一方、ライト先ＨＤＤの対象になっていない場合は（ステップＤ３のＮｏ）、タイムスタンプをゼロとして格納する（ステップＤ５）。 First, the P or Q accessible parity stripe is read, and the bitmap data of the access history information is copied (Yes in step D1, step D2). If the data restoration destination HDD is the target of the write destination HDD in the bitmap information (Yes in step D3), the time stamp of the parity stripe is also copied (step D4). On the other hand, if it is not the target of the write destination HDD (No in step D3), the time stamp is stored as zero (step D5).

故障などで両パリティストライプがリードできない場合は（ステップＤ１のＮｏ）、データストライプのアクセス履歴情報のタイムスタンプが最新のものを見つけて、その情報をコピーする（ステップＤ６）。そして、データ復元先のＨＤＤが、そのビットマップ情報でライト先ＨＤＤの対象となっている場合は（ステップＤ７のＹｅｓ）、タイムスタンプもデータストライプのタイムスタンプをコピーし（ステップＤ８）、一方、ライト先ＨＤＤの対象になっていない場合は（ステップＤ７のＮｏ）、タイムスタンプをゼロとして格納する（ステップＤ９）。 If both parity stripes cannot be read due to a failure or the like (No in step D1), the latest data stamp access history information time stamp is found and the information is copied (step D6). If the data restoration destination HDD is the target of the write destination HDD in the bitmap information (Yes in step D7), the time stamp also copies the time stamp of the data stripe (step D8), If it is not the target of the write destination HDD (No in step D7), the time stamp is stored as zero (step D9).

これにより、有効データの復元時に、セクタ冗長領域のアクセス履歴情報も復元されることになる。なお、データ保護情報（ＣＲＣ）は、復元した有効データそれぞれから再計算されることになる。 As a result, the access history information of the sector redundancy area is also restored at the time of restoring the valid data. The data protection information (CRC) is recalculated from each restored valid data.

以上のように、本実施形態のディスクアレイ装置によれば、データライト抜けによるデータ不正をセクタ冗長領域に記録されたタイムスタンプおよびビットマップデータを使って検出できるので、ＲＡＩＤ−６を構成する複数のディスク装置の中の１台のディスク装置が故障状態にあっても、整合性確認で不整合が検出された場合には、データ不正箇所を特定すること等を可能とする。さらに、ビット化けによるデータ不正もセクタ冗長領域に記録されたＣＲＣを使って検出することを可能とする。 As described above, according to the disk array device of the present embodiment, data fraud due to data write omission can be detected using the time stamp and bitmap data recorded in the sector redundancy area. Even if one of the disk devices is in a failed state, if an inconsistency is detected by the consistency check, it is possible to specify an illegal data portion. Furthermore, it is possible to detect data fraud due to garbled bits using the CRC recorded in the sector redundancy area.

なお、以上では、ＲＡＩＤ−６形式のＲＡＩＤについて説明したが、セクタ冗長領域を確保してアクセス履歴情報を記録するという手法は、ＲＡＩＤ−６形式以外のＲＡＩＤについても適用することが可能である。 In the above, RAID-6 format RAID has been described. However, the technique of securing sector redundancy areas and recording access history information can also be applied to RAIDs other than RAID-6 format.

また、例えばホスト装置１からのデータアクセスが無い空き時間などを利用して、データ整合性確認を周期的に実施する整合性確認部２２は、前述のように、読み出したデータからパリティデータを再計算して比較することによって整合性を確認するのが一般的であるが、本実施形態のディスクアレイ装置では、簡易整合性確認部２６と同様に、セクタ冗長領域に記録されたタイムスタンプ、ビットマップデータおよびＣＲＣを使って効率的に整合性を確認することも可能である（つまり、整合性確認部２２を設けず、簡易整合性確認部２６に周期的な整合性確認も行わせることも可能となる）。特に、ビットマップデータは、データライト時の検証よりも、この周期的な整合性確認の際に有用である。どのＨＤＤ３にライトを行ったかは、データライト直後であればＲＡＩＤコントローラ２のメモリ上に記憶されているので、これを利用して判断可能であるが、（データライトが既に過去の事となっている）周期的な整合性確認の際には、当該ビットマップデータが存在するからこそ判断可能となるからである。 In addition, the consistency check unit 22 that periodically performs data consistency check using, for example, free time when there is no data access from the host device 1 re-creates parity data from the read data as described above. In general, the consistency is confirmed by calculating and comparing, but in the disk array device of the present embodiment, the time stamp and the bit recorded in the sector redundancy area as in the simple consistency confirmation unit 26. It is also possible to efficiently check the consistency by using the map data and the CRC (that is, the consistency checker 22 is not provided, and the simple consistency checker 26 can also periodically check the consistency. Possible). In particular, the bitmap data is more useful for checking the periodic consistency than the verification at the time of data writing. Which HDD 3 has been written is stored in the memory of the RAID controller 2 immediately after the data write, and can be determined using this, but (the data write has already occurred in the past. This is because it can be determined because the bitmap data exists in the periodic consistency check.

このように、本発明は、上記実施形態そのままに限定されるものではなく、実施段階ではその要旨を逸脱しない範囲で構成要素を変形して具体化できる。また、上記実施形態に開示されている複数の構成要素の適宜な組み合わせにより種々の発明を形成できる。例えば、実施形態に示される全構成要素から幾つかの構成要素を削除してもよい。更に、異なる実施形態に構成要素を適宜組み合わせてもよい。 As described above, the present invention is not limited to the above-described embodiment as it is, and can be embodied by modifying the constituent elements without departing from the scope of the invention in the implementation stage. In addition, various inventions can be formed by appropriately combining a plurality of components disclosed in the embodiment. For example, some components may be deleted from all the components shown in the embodiment. Furthermore, you may combine a component suitably in different embodiment.

本発明の実施形態に係るディスクアレイ装置の構成例を示す図1 is a diagram showing a configuration example of a disk array device according to an embodiment of the present invention. ＲＡＩＤ−６のデータ配置例を示す図The figure which shows the data arrangement example of RAID-6 ＲＡＩＤが縮退状態（ＨＤＤが１台故障）の場合のデータ整合性確認を説明するための図The figure for demonstrating the data consistency check in case a RAID is in a degenerate state (one HDD has failed) 同実施形態のディスクアレイ装置において、セクタ冗長領域確保部によって確保されたセクタ冗長領域に、アクセス履歴記録部によってアクセス履歴情報が記録される様子を示す概念図The conceptual diagram which shows a mode that access history information is recorded by the access history recording part in the sector redundant area ensured by the sector redundant area securing part in the disk array apparatus of the embodiment. 同実施形態のディスクアレイ装置が実行するデータライトに伴うセクタ冗長領域へのアクセス履歴情報の記録例（ケース１）を示す図A diagram showing a recording example (case 1) of access history information to a sector redundant area accompanying a data write executed by the disk array device of the same embodiment 同実施形態のディスクアレイ装置が実行するデータライトに伴うセクタ冗長領域へのアクセス履歴情報の記録例（ケース２）を示す図A diagram showing a recording example (case 2) of access history information to a sector redundant area accompanying a data write executed by the disk array device of the same embodiment 同実施形態のディスクアレイ装置が実行するデータライトに伴うセクタ冗長領域へのアクセス履歴情報の記録例（ケース３）を示す図A diagram showing a recording example (case 3) of access history information to a sector redundant area accompanying a data write executed by the disk array device of the same embodiment 同実施形態のディスクアレイ装置が実行する、ＨＤＤが１台故障の縮減状態時に不整合が検出された場合のデータ不正箇所の特定手順を示すフローチャートA flowchart executed by the disk array device according to the embodiment, showing a procedure for identifying an illegal data portion when an inconsistency is detected when one HDD is in a reduced state of failure. 同実施形態のディスクアレイ装置によるデータ不正箇所の判定および訂正を説明するための図The figure for demonstrating the determination and correction of a data illegal location by the disk array apparatus of the embodiment 同実施形態のディスクアレイ装置が実行する簡易整合性確認の手順を示すフローチャートA flowchart showing a procedure of simple consistency check executed by the disk array device of the same embodiment 本実施形態のディスクアレイ装置が実行するデータライトの手順を示すフローチャートA flowchart showing a data write procedure executed by the disk array device of this embodiment 同実施形態のディスクアレイ装置が実行する、データ復元の際のセクタ冗長領域のアクセス履歴情報のコピーの手順を示すフローチャートA flowchart showing a procedure of copying access history information of a sector redundant area at the time of data restoration executed by the disk array device of the same embodiment

Explanation of symbols

１…ホスト装置、２…ＲＡＩＤコントローラ、３…ＨＤＤ、４…ＲＡＩＤ、２１…制御部、２２…整合性確認部、２３…セクタ冗長領域確保部、２４…アクセス履歴記録部、２５…データ不正箇所特定部、２６…簡易整合性確認部。 DESCRIPTION OF SYMBOLS 1 ... Host device, 2 ... RAID controller, 3 ... HDD, 4 ... RAID, 21 ... Control part, 22 ... Consistency confirmation part, 23 ... Sector redundant area reservation part, 24 ... Access history recording part, 25 ... Data illegal location Specific part, 26... Simple consistency confirmation part.

Claims

In a disk array device that records data and redundant data of the data in a plurality of disk devices in a distributed manner,
Redundancy for recording access history information related to writing of the data or the redundant data by dividing the recording area of each of the plurality of disk devices with a data length larger than the actual recording unit of the data and the redundant data Redundant area securing means for securing an area for each area of each of the plurality of disk devices;
When writing data, each stripe group using the same redundant data, to the redundant area of zone 1 or 2 or more sections of the redundant area and the data redundancy data the data is recorded is recorded, the stripe Access history information recording means for recording the same access history information including bitmap data indicating an area in which the data or the redundant data is recorded synchronously among all the areas constituting the group ;
When one of the plurality of disk devices is in a failed state, inconsistency is detected in the consistency check using the redundant data in the stripe group in which the area in the failed disk device is arranged. When detected, the fact that the data was recorded synchronously with the redundant data by the access history information recorded in the redundant area of the area where the redundant data is recorded and the bitmap data included in the access history information By reading and comparing the access history information recorded in the redundant area of the area indicated by, the data illegal location specifying means for specifying the data illegal location,
A disk array device comprising:

2. The disk array device according to claim 1, wherein the access history information recording means records a time stamp as access history information in a redundant area of each section.

2. The disk array device according to claim 1, wherein the access history information recording means further records data protection information for detecting an error in the data or the redundant data in a redundant area of each section.

4. The disk array device according to claim 3, wherein the access history information recording means records CRC (cyclic redundancy check) as data protection information in a redundant area of each section.

RAID (redundant array of inexpensive disks) disk array device according to claim 1, wherein that it is an shall make up the 6.

Immediately after the data is written, it comprises simple consistency confirmation means for confirming whether or not all the access history information of the redundant area of the area where the data and the redundant data of the data are recorded is the same. The disk array device according to claim 1.

7. The disk array device according to claim 6 , further comprising a control unit that re-executes data writing when a mismatch in access history information is detected by the simple consistency checking unit.

A data management method for a disk array device that records data and redundant data of the data in a distributed manner on a plurality of disk devices,
Redundancy for recording access history information related to writing of the data or the redundant data by dividing the recording area of each of the plurality of disk devices with a data length larger than the actual recording unit of the data and the redundant data An area is secured for each area of each of the plurality of disk devices,
When writing data, each stripe group using the same redundant data, to the redundant area of zone 1 or 2 or more sections of the redundant area and the data redundancy data the data is recorded is recorded, the stripe Record the same access history information including bitmap data indicating areas where the data or the redundant data is recorded synchronously among all the areas constituting the group ,
When one of the plurality of disk devices is in a failed state, inconsistency is detected in the consistency check using the redundant data in the stripe group in which the area in the failed disk device is arranged. When detected, the fact that the data was recorded synchronously with the redundant data by the access history information recorded in the redundant area of the area where the redundant data is recorded and the bitmap data included in the access history information By identifying and comparing the access history information recorded in the redundant area in the area indicated by
A data management method for a disk array device.

9. The data management method for a disk array device according to claim 8 , wherein data protection information for detecting an error in the data or the redundant data is further recorded in a redundant area of each section.

Said disk array device, a data management method for a disk array device according to claim 8, wherein a constitutes a RAID6.

Immediately after the data is written, it is checked whether all access history information in the redundant area of the area where the data and the redundant data of the data are recorded matches, and if a mismatch of the access history information is detected, the data To re-write
9. A data management method for a disk array device according to claim 8, wherein: