Wide Area Distributed File Systems: GPFS: A Shared-Disk File System for Large Computing Clusters

Wednesday, February 6, 2013

GPFS: A Shared-Disk File System for Large Computing Clusters

by F. Schmuck et al., FAST 2002.

Abstract:
GPFS is IBM’s parallel, shared-disk file system for cluster computers, available on the RS/6000 SP parallel supercomputer and on Linux clusters. GPFS is used on many of the largest supercomputers in the world. GPFS was built on many of the ideas that were developed in the academic community over the last several years, particularly distributed locking and recovery technology. To date it has been a matter of conjecture how well these ideas scale. We have had the opportunity to test those limits in the context of a product that runs on the largest systems in existence. While in many cases existing ideas scaled well, new approaches were necessary in many key areas. This paper describes GPFS, and discusses how distributed locking and recovery techniques were extended to scale to large clusters.

Link to the full paper:
http://www.cse.buffalo.edu/faculty/tkosar/cse710_spring13/papers/gpfs.pdf

13 comments:

SameerFebruary 10, 2013 at 11:01 PM
Using byte range tokens multiple parallel writes to the same file is possible so how does the metadata server handle these multiple requests concurrently? Are there any issues associated with this?
ReplyDelete
Replies
Deeshen ShahFebruary 12, 2013 at 2:58 PM
This comment has been removed by the author.
ReplyDelete
Replies
SimmerFebruary 12, 2013 at 6:33 PM
The author says that "File blocks are assigned to nodes in a round-robin fashion, so that each data block will be read or written only by
one particular node. GPFS forwards read and write
operations originating from other nodes to the node
responsible for a particular data block". If the number of these operations increase, wont it be a potential bottleneck in the system?
ReplyDelete
Replies
UnknownFebruary 12, 2013 at 6:39 PM
Byte-range token can some time result in false sharing, right ?
Then how does GPFS solves this issue ?
ReplyDelete
Replies
UnknownFebruary 12, 2013 at 7:38 PM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownFebruary 12, 2013 at 7:39 PM
How is fault tolerant achieved in GPFS?
ReplyDelete
Replies
Sharath ChandrashekharaFebruary 12, 2013 at 7:42 PM
1. Can you explain a little more on the communication failure for 2 node configuration and disk fencing?
2. When the cluster is broken exactly into half, how is the failure dealt? (In one of the Cluster File Systems I am aware of, there is an extra weight called epsilon given to 1 of the nodes. When system breaks into half, the half containing the node with the epsilon will take over)
ReplyDelete
Replies
VijayFebruary 12, 2013 at 10:01 PM
In the case of Replication, what happens when there is disk space to store only one copy of new data? In such scenarios, how is disk failure handled?
ReplyDelete
Replies

Add comment