posted on 2018-06-30, 13:30authored byMichael P. Kasick, Keith A Bare, Eugene E Marinelli, Jiaqi Tan, Rajeev Ghandi, Priya Narasimhan
We present a syscall-based approach to automatically
diagnose performance problems, server-to-client
propagated errors, and server crash/hang problems
in PVFS. Our approach compares the statistical and
semantic attributes of syscalls across PVFS servers
in order to diagnose the culprit server, under these
problems, for different file-system benchmarks—dd,
PostMark and IOzone—in a PVFS cluster.