140 likes | 297 Views
Policies in Distributed Data Storage. Presented by: Ajay Potnis Professor: Craig E. Wills December 2000. Acknowledgements. Professor Craig E. Wills Uresh Vahalia Xiaoye Jiang Gary Ma Jiannan Zheng Yael Melman Mrunal Potnis Aditya Potnis. Goal. Study various distributed architectures
E N D
Policies in Distributed Data Storage Presented by: Ajay Potnis Professor: Craig E. Wills December 2000
Acknowledgements • Professor Craig E. Wills • Uresh Vahalia • Xiaoye Jiang • Gary Ma • Jiannan Zheng • Yael Melman • Mrunal Potnis • Aditya Potnis
Goal • Study various distributed architectures • NFS • xFS • SAN • SSFS • Conclusion
Direct Attached Storage Data Storage Unit Data on SCCI/Fibre
Direct Attached Storage • Channel I/O Speed • Can select the desired storage • No sharing • No flexibility
NFS Data Storage Unit Data and Metadata Data on SCCI/Fibre File Server
NFS • Simple client server semantics using LAN/WAN • Server is the owner of file system • Uses RPC for communication • NFS with Mount protocol makes FS locally available • Support for advisory locking using NLM • Caching in memory/on disk improves performance on clients (stale data) • Data can be shared by multiple clients • Administration is easy due to centralization of data • Server can provide scalability, high availability, online maintenance, etc.
xFS Client Client and Manager Forward Request Data Storage Unit Data Request Forward Request Data on SCCI/Fibre Data Storage Server
xFS • Server-less architecture (clients, storage servers and managers) • Distributes cache, storage and metadata management over machines • Uses a stateful protocol • Uses Active Messages instead of RPC for communication • Multiple managers could lead to more network traffic • Flexibility adding many clients, managers and storage servers • Client write data to storage server before giving it to another client to avoid consistency problems in failures • Increased circular dependencies and complexities
SAN (switched fabric) File Server Data Storage Unit Switch Data Data Data on Fibre
SAN • Connects multiple clients to multiple storage drives • Storage can be configured at will • Needs good management tools and good Sys Admin skills • No seamless sharing of same storage by multiple clients/servers
SSFS Client 1 Data on SCCI / Fibre 1.Metadata Request 2. Metadata Data Storage Unit uses SAN Data on SCCI / Fibre for NAS Server 2. Metadata 1.Metadata Request Data on SCCI / Fibre Client 2
SSFS • Client-Server-Storage semantics using NAS and SAN • NAS for sharing, SAN for channel I/O speed • File sharing among heterogeneous clients • Concurrent sharing among NAS and SAN clients • Server is the owner of metadata and storage • RPC communication for metadata exchange over network • Caching in memory improves performance on clients (stale data not possible because of callbacks) • Read/Write throughput increased since data goes at channel speed (can use multiple SCSI/fibre paths to write) • Multiple clients can use the same storage device • Scalability, high availability, transparent to applications • Failure recovery built in the protocol
Conclusion • Why not get two birds in one shot?