Wide Area Distributed File Systems: Shark: Scaling File Servers via Cooperative Caching

Monday, February 25, 2013

Shark: Scaling File Servers via Cooperative Caching

by S. Annapureddy et al., NSDI 2005.

Abstract:
Network file systems offer a powerful, transparent inter- face for accessing remote data. Unfortunately, in current network file systems like NFS, clients fetch data from a central file server, inherently limiting the system’s ability to scale to many clients. While recent distributed (peer-to- peer) systems have managed to eliminate this scalability bottleneck, they are often exceedingly complex and pro- vide non-standard models for administration and account- ability. We present Shark, a novel system that retains the best of both worlds—the scalability of distributed systems with the simplicity of central servers.
Shark is a distributed file system designed for large- scale, wide-area deployment, while also providing a drop- in replacement for local-area file systems. Shark intro- duces a novel cooperative-caching mechanism, in which mutually-distrustful clients can exploit each others’ file caches to reduce load on an origin file server. Using a dis- tributed index, Shark clients find nearby copies of data, even when files originate from different servers. Perfor- mance results show that Shark can greatly reduce server load and improve client latency for read-heavy workloads both in the wide and local areas, while still remaining competitive for single clients in the local area. Thus, Shark enables modestly-provisioned file servers to scale to hundreds of read-mostly clients while retaining tradi- tional usability, consistency, security, and accountability.

Link to the full paper:
http://www.cse.buffalo.edu/faculty/tkosar/cse710_spring13/papers/shark.pdf

13 comments:

DevashisFebruary 25, 2013 at 9:21 PM
In Shark, when a node retrieves a file, it becomes a proxy for further access requests, for that file by other nodes. How this single proxy node handles multiple access and write requests?
Wouldn't this proxy will be saturated with multiple requests?
ReplyDelete
Replies
SameerFebruary 25, 2013 at 10:24 PM
This comment has been removed by the author.
ReplyDelete
Replies
SameerFebruary 25, 2013 at 10:27 PM
How shark client ensures that it is fetching chunks from nearby client
ReplyDelete
Replies
VijayFebruary 26, 2013 at 7:43 PM
If the most nearby client has a smaller file chunk compared to another client which has the complete file, how does shark deal this scenario?
ReplyDelete
Replies
Fengwei TianFebruary 26, 2013 at 9:18 PM
If some different clients ask for the same file lease, what will the file sever do?
ReplyDelete
Replies
Sonali BatraMarch 5, 2013 at 11:07 AM
Why does Shark use Sloppy DHT(Several values for same key) instead of regular DHT for its distributed index?
ReplyDelete
Replies
UnknownMarch 5, 2013 at 1:28 PM
In what order should we fetch chunks of file? It is unclear to me as to when to use the random fetch and when to use sequential fetch ..
ReplyDelete
Replies

Add comment