posted on 2003-06-01, 00:00authored byBrent Welsh, Marc Unangst, Zainul Abbasi, Garth Gibson, Brian Mueller, Jason Small, Jim Zelenka, Bin Zhou
The Panasas file system uses parallel and redundant
access to object storage devices (OSDs), per-file RAID,
distributed metadata management, consistent client
caching, file locking services, and internal cluster
management to provide a scalable, fault tolerant, high
performance distributed file system. The clustered
design of the storage system and the use of clientdriven
RAID provide scalable performance to many
concurrent file system clients through parallel access to
file data that is striped across OSD storage nodes.
RAID recovery is performed in parallel by the cluster
of metadata managers, and declustered data placement
yields scalable RAID rebuild rates as the storage system
grows larger. This paper presents performance
measures of I/O, metadata, and recovery operations for
storage clusters that range in size from 10 to 120
storage nodes, 1 to 12 metadata nodes, and with file
system client counts ranging from 1 to 100 compute
nodes. Production installations are as large as 500
storage nodes, 50 metadata managers, and 5000 clients.