posted on 1972-01-01, 00:00authored byMark Holland, Garth Gibson, Daniel P Siewiorek
The performance of traditional RAID Level 5 arrays is, for many applications, unacceptably poor
while one of its constituent disks is non-functional. This paper describes and evaluates mechanisms by
which this disk array failure-recovery performance can be improved. The two key issues addressed are
the data layout, the mapping by which data and parity blocks are assigned to physical disk blocks in an
array, and the reconstruction algorithm, which is the technique used to recover data that is lost when a
component disk fails.
The data layout techniques this paper investigates are variations on the declustered parity organization,
a derivative of RAID Level 5 that allows a system to trade some of its data capacity for improved
failure-recovery performance. Parity declustering improves the failure-mode performance of an array
significantly, and a parity-declustered architecture is preferable to an equivalent-size multiple-group
RAID Level 5 organization in environments where failure-recovery performance is important. The presented
analyses also include comparisons to a RAID Level 1 (mirrored disks) approach.
With respect to reconstruction algorithms, this paper describes and briefly evaluates two alternatives
termed stripe-oriented reconstruction and disk-oriented reconstruction, and establishes that the latter
is preferable as it provides faster reconstruction. The paper then revisits a set of previously-proposed
reconstruction optimizations, evaluating their efficacy when used in conjunction with the disk-oriented
algorithm. The paper concludes with a section on the reliability vs. capacity trade-off that must be
addressed when designing large arrays.