160 likes | 175 Views
This survey explores the integration of SSDs in large-scale storage systems, including their benefits, challenges, and potential future directions. It covers different storage architectures, the use of SSDs in cloud storage, and case studies of SSD integration in HPC clusters.
E N D
Data Storage Systems: A Survey Abdullah Aldhamin July 29, 2013 CMPT 880: Large-Scale Multimedia Systems and Cloud Computing Course Project
Motivation • Research interest in storage systems, specifically in SSDs
Outline • Objective • Overview • Solid-State Drives Use Cases
Objective • Storage system architectures in enterprise data centers. • What is cloud storage? • Integrating flash-based solid-state drives in large-scale storage systems.
Overview • Different storage architectures in data centers: • Block I/O interface (DAS and SAN) • File I/O interface (NAS) • Is there a “better” solution? • Shortcomings for today’s computing…
Overview • Cloud Storage: • What is cloud storage? • Object-based storage • Example: Windows Azure Storage (WAS) • Some research problems
Overview • Flash-based solid state drive: • What is it? • Pros and Cons. • How can we integrate it in large-scale storage systems? • Future direction
Integrating SSDs in Large-Scale Storage Systems • Considerations and Facts: • Non-uniform read access latencies, correlated with workload access pattern • Internal drive-specific operations impacts the performance • Internal fragmentation leads to performance degradation • More writes write amplification bad wear leveling • Performance vs. Lifetime vs. Cost
Gordon: SSD-only HPC Cluster • The first HPC cluster designed with SSD-only storage • Optimized to utilize SSDs high bandwidth, for data-intensive applications
Gordon … (Cont’d) • Design goals: • Reduce performance gap between processor and I/O in large-scale data-intensive computing • Improve the system performance • Less power
Gordon… (Cont’d) • How SSD is integrated? • Replaced conventional hard disks with SSDs • Major device-level modification: New flash translation layer
Gordon… (Cont’d) • Costly $$$ • Not suitable for widespread adoption • Requires major device level modification • Optimized for specific workloads
Hybrid: Griffin • Griffin hybrid storage system • SSD is an end-point store for the data • Uses HDDs as write-back buffers • Log-structured HDDs to buffer incoming writes • Extends SSD lifetime
Hybrid: Hystor • Hybrid storage system • SSD is used to improve I/O performance • Read cache • Write-back buffer • Challenge: • What data to should be cached to gain from SSD performance and improve performance? • Minimum system changes.
Conclusion • The choice of I/O interface allows for different storage access features • Cloud storage continues to grow to accommodate for the overflowing of data collected • Solid state drive has become an instrumental player in storage systems: but how can we best use it?