500 likes | 631 Views
Object-based Storage. Long Liu 2010-10-23. Outline. Why do we need object based storage? What is object based storage? How to take advantage of it? What's the status of object based storage? What can we do about it?. Outline. Why do we need object based storage?
E N D
Object-based Storage Long Liu 2010-10-23
Outline • Why do we need object based storage? • What is object based storage? • How to take advantage of it? • What's the status of object based storage? • What can we do about it?
Outline • Why do we need object based storage? • What is object based storage? • How to take advantage of it? • What's the status of object based storage? • What can we do about it?
Background • Existing enterprise storage infrastructures are feeling the strain • the volume of data generated by many network-based applications continues to escalate
Background Two Technologies: • Files: • Flexible data sharing • Secure • Blocks: • High performance • Scalable Files Blocks
Comparison Objects can be regarded as the convergence of two technologies: files and blocks Operations: Create object Delete object Read object Write object Addressing: [object, byte range] Operations: Read block Write block Addressing: Block range Object Based Disk Block Based Disk
Comparison Traditional Storage Object-based Storage Applications Applications File System File System Object Interface Storage component Logical Block Interface Hard Drive Object-based Storage Device (OSD)
Comparison CPU Applications System Call Interface File System User Component File System Storage Component Block I/O Manager Storage Device (a) Traditional model (b) Object storage model • Two changes : • Object-based storage offloads the storage component to the storage device • The device interface changes from blocks to objects • CPU • Applications • System Call Interface • File System • User Component • Object Interface Block Interface File System Storage Component Block I/O Manager Storage Device
Motivation Improved device and data sharing – Platform-dependent metadata moved to device • • Improved scalability & security – Devices directly handle client requests – Object security • Improved performance – Data types can be differentiated at the device • Improved storage management – Self-managed, policy-driven storage – Storage devices become more autonomous Objects
Outline • Why do we need object based storage? • What is object based storage? • How to take advantage of it? • What's the status of object based storage? • What can we do about it?
Object-based Storage • Object • OSD(Object-based Storage Device) • MDS(Metadata Server)
Object-based Storage • Object • OSD(Object-based Storage Device) • MDS(Metadata Server)
Object • An object is a logical unit of storage • ID (Identification) • Application data • Metadata which includes block allocation and length • Attributes that is accessible by users • Objects have file-like methods • open, close, read, write
Object • The root object -- The OSD itself • User object -- Created by SCSI commands from the application or client • Collection object -- A group of user objects, such as all .mp3 • Partition object -- Containers that share common security and space managementcharacteristics
Object P4 P3 P2 OSD P1 Root Object (one per device) Partition Objects U1 User Data Collection Objects Metadata Attributes • User Objects(for user data) Object ID
Object-based Storage • Object • OSD(Object-based Storage Device) • MDS(Metadata Server)
Object Storage Devices CPU Applications System Call Interface File System User Component File System Storage Component Block I/O Manager Storage Device (a) Traditional model (b) Object storage model • Two changes : • Object-based storage offloads the storage component to the storage device • The device interface changes from blocks to objects • CPU • Applications • System Call Interface • File System • User Component • Object Interface Interface File System Storage Component Block I/O Manager Storage Device
Object Storage Devices Expect wide variety of Object Storage Devices Disk array subsystem 2 SATA disks – 240/500 GB Highly integrated, single disk 4 Gbps per shelf to cluster Orchestrates system activity Balances objects across OSDs Stores up to 5 TBs per shelf
Object-based Storage • Object • OSD(Object-based Storage Device) • MDS(Metadata Server)
Client Storage component Metadata Servers (MDS) Object Storage Devices (OSDs) Object-based Storage Device (OSD) Dataflow of Metadata Applications Client File System Metadata Manager
Outline • Why do we need object based storage? • What is object based storage? • How to take advantage of it? • What's the status of object based storage? • What can we do about it?
Interface OSD Client OSD Interface T10 OSD Commands(face) CREATE/REMOVE GET/SET ATTR READ/WRITE Opaque attributes (stored only) Shared attributes (stored & processed) Attribute pages OSD Target 10001110101 10000001110 11001110111 10001111000 ..… User Data Object
OSD Commands(Interface) • Security – Authorization – Integrity – SET KEY – SET MASTER KEY • Groups – CREATE COLLECTION – REMOVE COLLECTION – LIST COLLECTION • Management – CREATE PARTITION – REMOVE PARTITION – FLUSH PARTITION – PERFORM SCSI COMMAND – PERFORM TASK MGMT • Basic Protocol – READ – WRITE – CREATE – REMOVE – GET ATTR – SET ATTR • Specialized – APPEND – CREATE & WRITE – FLUSH – LIST very basic shared secrets space mgmt attributes • opaque • internal • shared
Storage Technology Today • Direct attached storage (DAS) • Fabric Attached Storage (FAS) -Network Attached Storage (NAS) -Storage Area Networks (SAN)
Direct Attached Storage LAN Windows UNIX Windows A traditional Direct Attached Storage model RAID BACKUP SERVER RAID RAID RAID TAPE
Fabric Attached Storage C/S Fabric Attached Storage SERVER DATA CLIENTS
Network Attached Storage This figure illustrates NAS being used to share files among a number of clients. Clients File I/O IP network Storage area network Block I/O The files themselves may be stored on a fast SAN File server Block storage
Storage Area Networks Clients This figure illustrates a SAN file system The files themselves are stored on a fast storage Area to which the clients are also attached. Servers Metadata Storage area network Data Management Block-based storage devices
Security CLIENTS METADATA SERVERS Attribute Capability NETWORK Attribute Capability Data Management OBJECT-BASED STORAGE DEVICE
Outline • Why do we need object based storage? • What is object based storage? • How to take advantage of it? • What's the status of object based storage? • What can we do about it?
Status • Industrial • Lustre • Panasas • Academic
Status • Industrial • Lustre • Panasas • Academic
Lustre • First open sourced system with object storage • High-performance parallel file system • Consist of clients, MDS and OST(Object Storage Targets)
Lustre Metadata Metadata METADATA SERVER CLUSTER Data Management NETWORK CLIENTS Management Data OBJECT STORAGE TARGETS(OST)
Panasas • High-performance file system • Consist of OSD, Panasas File System, MDS
Panasas KeyObjectStorageFeatures Intelligent space management in storage layer ßMedia geometry aware placement ßData aware prefetching, caching & recovery Encapsulation of data and attributes ßNative object interface, good programming model ßStorage interpreted attributes for per file properties KeyObjectStorageAdvantages ßRobust, shared access by many clients ßScalable performance via an offloaded data path ßStrong fine-grained end-to-end security
Panasas • Clients are from Energy, Government, Finance, Manufacturing and Higher Education
Status • Industrial • Lustre • Panasas • Academic
A Design of Metadata Server Cluster In Large Distributed Object-based Storage Motivation: • Metadata server cluster maybe the bottleneck • Frequent metadata access and movement • Terrible load balance management
A Design of Metadata Server Cluster In Large Distributed Object-based Storage Application Server Cluster VoD Server Web Server E-mail Server File Server Database Server Metadata Data Storage Network (Fibre Channel) MDS Cluster Security Object-based Storage Device Cluster Object-based Storage System Architecture
A Design of Metadata Server Cluster In Large Distributed Object-based Storage Application Servers Application Hashing Partition: File Hashing Manager Mapping Manager Metadata Server Cluster • A total solution for • File hashing • Metadata partitioning • Metadata storage Hashing Partition Logical Partition Manager Metadata Server Backend Common Storage Space Figure 3. Hashing Partition
A Design of Metadata Server Cluster In Large Distributed Object-based Storage Pathname: /Dir1/Dir2/filename 1 4 ①.Filename hashing ②.Selecting MDS through Mapping Manager ③ .Accessing metadata by pathname hashing result ④.Returning metadata to application server Pathname Hashing Result (i) Pathname Mapping Manager Metadata & etc 2 Pathname Hashing Result (i+1) Pathname Metadata Server Cluster Metadata & etc 3 Logical Partitions Figure 4. Metadata Access Pattern
A Design of Metadata Server Cluster In Large Distributed Object-based Storage Hashing Partition Mapping Manager 2 1 3 Metadata Server Cluster Logical Partitions 4 Common Storage Space Figure 5. MDS cluster failover procedure
Outline • Why do we need object based storage? • What is object based storage? • How to take advantage of it? • What's the status of object based storage? • What can we do about it?
The End Thank you