1 / 25

Disk & RAID

Disk & RAID. 2012-10-05. The first HDD (1956). IBM 305 RAMAC 4 MB 50x24” disks 1200 rpm 100 ms access 35k$/y rent Included computer & accounting software (tubes not transistors). 10 years later. 1.6 meters. Transportation of HDD. 1 inch disk drive!. 2000 IBM MicroDrive :

kassia
Download Presentation

Disk & RAID

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Disk & RAID 2012-10-05

  2. The first HDD (1956) • IBM 305 RAMAC • 4 MB • 50x24” disks • 1200 rpm • 100 ms access • 35k$/y rent • Included computer & accounting software(tubes not transistors)

  3. 10 years later 1.6 meters

  4. Transportation of HDD

  5. 1 inch disk drive! • 2000 IBM MicroDrive: • 1.7” x 1.4” x 0.2” • 1 GB, 3600 RPM, 5 MB/s, 15 ms seek • Digital camera, PalmPC? • 2006 MicroDrive • 8 GB, 50 MB/s!

  6. The internal look of HDD (now)

  7. Data access of HDD • Access Time = Seek Time + Rotational Delay + Transfer Time

  8. Redundant Array of Inexpensive Disks (RAID): 1987-1993 • Randy Katz and David Patterson: “Use many PC disks to build better storage?” • RAID I built on 1st SPARC, 28 disks • RAID II custom HW, 144 disks • Today, RAID ~$25B industry • RAID students join industry and academia, started own companies (VMware, Panassas)

  9. The RAID paper • D. A. Patterson, G. Gibson, and R. H. Katz, "A case for redundant arrays of inexpensive disks (RAID)," in SIGMOD'88 Proceedings of the 1988 ACM SIGMOD International Conference on Management of Data, 1988, vol. 17, no. 3, pp. 109-116. • One of the important publications in computer science. http://en.wikipedia.org/wiki/List_of_important_publications_in_computer_science • EMC, HP, IBM, NetApp… have produced so many RAID-related storage products.

  10. Better Storage? • Capacity? • Performance? • Availability? • ……

  11. RAID introduction • A RAID is a Redundant Array of Inexpensive Disks. • In industry, “I” is for “Independent” • The alternative is SLED, single large expensive disk • Disks are small and cheap, so it’s easy to put lots of disks (10s to 100s) in one box for increased storage, performance, and availability. • The RAID box with a RAID controller looks just like a SLED to the computer. Data plus some redundant information is Striped across the disks in some way. • How that Striping is done is key to performance and reliability----Different RAID levels 0-5, 6…

  12. RAID0 • Level 0 is non-redundant disk array • Files are Striped across disks, no redundant info • High read throughput • Best write throughput (no redundant info to write) • Any disk failure results in data loss • Reliability worse than SLED Stripe 0 Stripe 1 Stripe 2 Stripe 3 Stripe 7 Stripe 4 Stripe 5 Stripe 6 Stripe 8 Stripe 11 Stripe 9 Stripe 10 data disks

  13. RAID1 • Mirrored Disks, data is written to two places • On failure, just use surviving disk • On read, choose fastest to read • Write performance is same as single drive, read performance is 2x better • Expensive Stripe 0 Stripe 1 Stripe 2 Stripe 3 Stripe 0 Stripe 1 Stripe 2 Stripe 3 Stripe 7 Stripe 7 Stripe 4 Stripe 5 Stripe 6 Stripe 4 Stripe 5 Stripe 6 Stripe 8 Stripe 11 Stripe 8 Stripe 11 Stripe 9 Stripe 10 Stripe 9 Stripe 10 data disks mirror copies

  14. RAID4 • Block-level parity with Stripes • A read accesses all the data disks • A write accesses all data disks plus the parity disk • Heavy load on the parity disk Stripe 0 Stripe 1 Stripe 2 Stripe 3 P0-3 Stripe 7 Stripe 4 Stripe 5 Stripe 6 P4-7 Stripe 8 Stripe 11 P8-11 Stripe 9 Stripe 10 Parity disk data disks

  15. RAID5 • Block Interleaved Distributed Parity • Like parity scheme, but distribute the parity info over all disks (as well as data over all disks) • Better read performance, large write performance Stripe 0 Stripe 1 Stripe 2 Stripe 3 P0-3 Stripe 6 P4-7 Stripe 4 Stripe 5 Stripe 7 Stripe 8 Stripe 10 Stripe 11 P8-11 Stripe 9 data and parity disks

  16. RAID6 • Level 5 with an extra parity • Can tolerate two failures • What are the odds of having two concurrent failures? • May outperform Level-5 on reads, slower on writes Stripe 0 Stripe 1 Stripe 2 Stripe 3 P0-3 Q0-3 Stripe 6 P4-7 Stripe 4 Stripe 5 Q4-7 Stripe 7 Stripe 8 Q8-11 Stripe 10 Stripe 11 P8-11 Stripe 9 data and parity disks

  17. Comparison of RAIDs Note: S indicates the capacity of a single disk, N indicates the number of the disks in a RAID set.

  18. RAID4/5/6 Reconstruction

  19. Trend & Discussion… • Disk replace Tape? • Flash replace Disk? • RAID is dead?

  20. Backup slides

  21. Backup slides

  22. Backup

  23. Backup slides

  24. Backup slides

  25. Backup slides

More Related