1 / 24

Communications and Protocols

Communications and Protocols. Active Networks and Active Object Storage John A. Chandy Department of Electrical and Computer Engineering Janardhan Singaraju , Ajith Thamarakuzhi , Cengiz Karakoyunlu , Orko Momin , Mike Runde , Paul Wortman. Active Storage Networks. Active Disks

Download Presentation

Communications and Protocols

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Communications and Protocols Active Networks and Active Object Storage John A. Chandy Department of Electrical and Computer Engineering JanardhanSingaraju, AjithThamarakuzhi, CengizKarakoyunlu, OrkoMomin, Mike Runde, Paul Wortman

  2. Active Storage Networks • Active Disks • Intelligence at the disk can distribute computation to parallel disks • Process data in streams • Disks only have local view of data • Active Storage Network • Network has a global view of data • Distributed caching of file system metadata and data • Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011

  3. Active storage networks • An ASN is comprised of a smart switch along with intelligence embedded in the I/O network. • Network Switches have global view of the data and can perform in-stream data reduction and transformation operations. • ASN can enhance storage node performance as well as improve the computational performance of the parallel I/O systems. 1 /3 Gbps 1 Gbps 1/3 Gbps 1/3 Gbps Active Storage Networks HEC FSIO Workshop August 9, 2011

  4. Network switch topology • 2-dilated flattened butterfly 1 9 2 1 5 10 3 11 4 2 6 12 5 13 6 3 7 14 7 15 8 8 4 16 Active Storage Networks HEC FSIO Workshop August 9, 2011

  5. Hardware Implementation • NetFPGA board from Stanford • 4 GigE connects • 2 SATA connectors for node to node communication • PCI bus for node to node communication Active Storage Networks HEC FSIO Workshop August 9, 2011

  6. Active Storage Networks • Application operations • Reduction operations - min/max, k-means clustering, search • Transformational operations – streaming, sort, • File System Operations • Locking • Redundancy optimizations Active Storage Networks HEC FSIO Workshop August 9, 2011

  7. Parallelization techniques • Functional units are re-used on reaching the reconfigurable hardware area limits. • Data level parallelism by distributing the data to several functional units in several switch elements. • Functional level parallelism by distributing functions to several elements. Active Storage Networks HEC FSIO Workshop August 9, 2011

  8. K-means clustering Active Storage Networks HEC FSIO Workshop August 9, 2011

  9. Runtime per iteration Active Storage Networks HEC FSIO Workshop August 9, 2011

  10. Data search Active Storage Networks HEC FSIO Workshop August 9, 2011

  11. Kmin/Kmax Active Storage Networks HEC FSIO Workshop August 9, 2011

  12. Data sort Active Storage Networks HEC FSIO Workshop August 9, 2011

  13. Redundancy optimizations • RAIDed files, parity calculated in switch Active Storage Networks HEC FSIO Workshop August 9, 2011

  14. File locking • Lock table in switch Active Storage Networks HEC FSIO Workshop August 9, 2011

  15. Active Storage Networks • Lessons Learned • Hardware design is hard • HW Libraries can help • ASNs make most sense for reductions • Storage systems optimizations show promise • What needs to get done • Better HW design • Application and FS hooks • When to do ASN and when to do SW? Active Storage Networks HEC FSIO Workshop August 9, 2011

  16. Active Object Storage • Active Disks • Intelligence at the disk can distribute computation to parallel disks • Active Object Storage for Parallel File Systems • Active Disks for OSDs • Use Active Storage to improve parallel file system performance • Use Active Storage to improve parallel file system reliability • Application aware storage and autonomic storage using active OSDs. Active Storage Networks HEC FSIO Workshop August 9, 2011

  17. Active Disks • Can we use OSDs to make Active Disks a reality? • Application-aware storage • Object attributes can give hints to the disk • Application specific • Parallel File Systems • Felix et al. added a filtering layer to Lustre to provide active processing • T10 OSD? Active Storage Networks HEC FSIO Workshop August 9, 2011

  18. Active Disks using OSD • Previous Implementation • Based on disc-osd • Object-oriented (Java) • Attach object types to storage objects • Define methods for object types • New Implementation • Based on osc-osd (supported by Panasas) • RPC - Call functions on OSD remotely • Execute Engines – C, Java, Python, etc. Active Storage Networks HEC FSIO Workshop August 9, 2011

  19. Active Disks using OSD • How do you move code from client to target within OSD framework? • Create an object with the code • Each function object has a special attribute that defines the type of associated execute engine • OSD can support multiple execute engines Active Storage Networks HEC FSIO Workshop August 9, 2011

  20. Active Disks using OSD • How do you execute the method remotely within the OSD framework? • New EXECUTE FUNCTION command so that we can invoke a function • We use the CDB continuation to specify the parameters • Results (if any) returned directly or written to a new object From T10/08-185r5 changes to OSD-2 Active Storage Networks HEC FSIO Workshop August 9, 2011

  21. Active Disks using OSD OSD OSD Target iSCSI C Engine Client Function Object .so, .class Java Engine Active Storage Networks HEC FSIO Workshop August 9, 2011

  22. Active Disks using OSD • Status: • C and Java engines complete • Python engine soon • OrangeFS support for OSDs Active Storage Networks HEC FSIO Workshop August 9, 2011

  23. Summary • Active storage networks • Improves performance of computation kernels • Useful in parallel file system optimizations • Active storage for improved file system performance • Acknowledgements: NSF CCF-0621448, CCF-093787 Active Storage Networks HEC FSIO Workshop August 9, 2011

  24. Communication and Protocols • Coherence schemes • Scalable abstractions for scientific data • Scalable replication, relocation, failure detection, and fault tolerance • Topology aware storage layout • Wide area storage access protocols • Cloud storage? • Inter-stack communication? • Memory hierarchy? Active Storage Networks HEC FSIO Workshop August 9, 2011

More Related