posted on 2010-02-01, 00:00authored byHrishikesh Amur, James Cipar, Varun Gupta, Gregory GangerGregory Ganger, Michael A Kozuch, Karsten Schwan
Power-proportional cluster-based storage is an important component of an overall cloud computing infrastructure. With it, substantial subsets of nodes in the storage cluster can be turned off to save power during periods of low utilization. Rabbit is a distributed
file system that arranges its data-layout to provide ideal power-proportionality down to very low minimum number of powered-up
nodes (enough to store a primary replica of available datasets). Rabbit addresses the node failure rates of large-scale clusters with
data layouts that minimize the number of nodes that must be powered-up if a primary fails. Rabbit also allows different datasets to
use different subsets of nodes as a building block for interference avoidance when the infrastructure is shared by multiple tenants.
Experiments with a Rabbit prototype demonstrate its power-proportionality, and simulation experiments demonstrate its properties
at scale.