A case for scaling HPC metadata performance through de-specialization

Patil, Swapnil; Ren, Kai; Gibson, Garth

doi:10.1184/R1/6587219.v1

file.pdf (515.04 kB)

A case for scaling HPC metadata performance through de-specialization

journal contribution

posted on 1988-01-01, 00:00 authored by Swapnil Patil, Kai Ren, Garth Gibson

Lack of a highly scalable and parallel metadata service is the Achilles heel for many cluster file system deployments in both the HPC world and the Internet services world. This is because most cluster file systems have focused on scaling the data path, i.e. providing high bandwidth parallel I/O to files that are gigabytes in size. But with proliferation of massively parallel applications that produce metadata-intensive workloads, such as large number of simultaneous file creates and large-scale storage management, cluster file systems also need to scale metadata performance. To realize these goals, this paper makes a case for a scalable metadata service middleware that layers on existing cluster file system deployments and distributes file system metadata, including the namespace tree, small directories and large directories, across many servers. Our key idea is to effectively synthesize a concurrent indexing technique to distribute metadata with a tabular, on-disk representation of all file system metadata.

History

Publisher Statement

Date

1988-01-01

Usage metrics

Keywords

computer sciences

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

A case for scaling HPC metadata performance through de-specialization

History

Publisher Statement

Date

Usage metrics

Categories

Keywords

Licence

Exports