460 likes | 988 Views
Flash Technology in the IBM Storage Portfolio. Ivailo Djilianov Technical Sales Specialist IBM. Agenda. Flash Technology Overview – Server Cache vs . SAN SSD vs. Dedicated IBM FlashSystem Portfolio – update Apr 2014 Detailed Architecture for FlashSystem 840/V840 Performance
E N D
Flash Technology inthe IBM Storage Portfolio Ivailo Djilianov Technical Sales Specialist IBM
Agenda • Flash Technology Overview – Server Cache vs. SAN SSD vs. Dedicated • IBM FlashSystemPortfolio – update Apr 2014 • Detailed Architecture for FlashSystem 840/V840 • Performance • Flash Quality (SLV vs. eMLC vs. MLC) • Added Value – 2D Raid, VSR • FlashSystem Solutions • Deployment Scenarios – EasyTier, Preferred Read, Manual Data Placement
What is Flash? Consumer grade flash memory (MLC) is used to store data in everyday electronics:
Storage Latencies DRAM X3850 X6 DDR3 Flash 5 µs IBM FlashSystem 100 µseconds 2.5” SSD (on the SAN) ex: (Novotel – Mr. Bricolage 5min) 1-3 milliseconds SAN Disk ex: (Flight: Sofia – Varna 45min) Tape 3-10 milliseconds ex: (Flight: Sofia – London 4h30min) Slow
Latency Translates to Application Efficiency Servers, Applications and Databases are FASTER! Time Waiting for I/0 (Waiting for Array) Time Processing Data (Server CPU) CPU Utilization & App. Efficiency 4% Total Application Processing Time 5,200us (5.2ms) 5,000us (5ms) 200us (.2ms) CPU Utilization & App. Efficiency 50% Total Application Processing Time 400us (.4ms) 200us (.2ms) 200us (.2ms) Benefits & economics outweigh disk No application Or architecture Changes What do you do with the Extra Time? Reduce floor space, power & cooling IBM FlashSystem Disk/Hybrid/SSD
Flash Reality • Clients will buy flash – sooner or later • Minimal application changes needed – non-disruptive • Can co-exist with any other storage vendors – no holy wars with IT • Flash could be quick and easy to deploy – various methods of deployment • Flash has a quick ROI – at least 3X storage performance improvement • Flash Solves Business Problems – TODAY Performance Gap From 1980 to 2010, CPU performance has grown 60% per year* …and yet, disk performance has grown ~5% per year during that same period**
What is TMS? 2012 2011 2010 2009 2008 2007 2006 2005 2004 2003 2002 RamSan-720: 5/10 TB SLC Flash, 4 FC (8 Gb)/IB RamSan-820: 12-24 TB eMLC Flash, 4 FC (8 Gb)/IB RamSan-810: 10 TB eMLC Flash, 4 FC (8 Gb)/IB (QDR) RamSan-710: 5 TB SLC Flash, 4 FC (8 Gb)/IB (QDR) RamSan-70: 900 GB SLC Flash, PCIe x8 2.0 RamSan-640: 8 TB SLC Flash, 10 FC (8 Gb)/IB (QDR) RamSan-630: 10 TB SLC Flash, 10 FC (8 Gb)/IB (QDR) RamSan-620: 5 TB SLC Flash, 8 FC (4 Gb) RamSan-20: 450 GB SLC Flash, PCIe x4 RamSan-440: 512 GB RAM, 8 FC (4 Gb) RamSan-500: 2 TB SLC Flash, 64 GB RAM, 8 FC (4 Gb) Recognized by IBM Research 34 Year History Deep Expertise 30+ Patents Strong IP Base 300+ Customers In over 50 countries RamSan-400: 128 GB RAM, 8 FC (4 Gb), 4 IB (4x) RamSan-320: 64 GB RAM, 8 FC (2 Gb) RamSan-210/220: 32 GB RAM, 4 FC (2 Gb) SAM 500: DSP/SSD, 64 GB RAM, 15 FC (1 Gb) 1997 SAM-350/SAM-450: DSP system 1990 SAM-2000: DSP system Custom systems for seismic industry 1978 Company founded by Holly Frost CMPS: custom SSD for Gulf Oil
IBM/TMS Shared Flash Systems - 2013 All units 1U form factor, less than 500 Watts
IBM FlashSystem840 (announced Jan 2014) Data Center Optimized • 2U • 625 watts • Field upgradeable, granular capacity: 4, 8, 12, 16, 20, 24, 32, 40, 48 TB options • Intuitive standardized GUI Macro Efficiency • Fully redundant and hot swappable architecture:flash modules, power supplies, batteries, interfaces, fans, RAID controllers, etc • Concurrent code load • Data-at-rest encryption Enterprise Reliability • Low latency: 135/90 µs R/W • Purpose-built, FPGA-based design • Boosts host CPU efficiency and productivity MicroLatency™ • 1.1M IOPS • 8 GB/s bandwidth • Multiple connectivity interfaces • 16Gb/8Gb Fibre Channel • 40Gb QDR InfiniBand • 10GbFCoE Extreme Performance
FlashSystem 840 Performance at-a-glance Data center optimized to deliver extreme performance, flexible capacity and total system protection Capacity Options
IBM Flash Offerings Strategy IBM FlashSystem addresses… 1. Absolute performance for throughput, IOPS, and latency 2. Enterprisefeatures that still maximize flash performance IBM FlashSystem IBM FlashSystem Solutions • Standalone all-flash array building block • Lowest latency • Eliminate IO bottleneck or cap BW • Use application-level or SDS services • Software Defined Storage • FlashSystem Enterprise Performance Solution • Storwize V7000 FlashSystem Edition • FlashSystem is a smart storage shelf • Use traditional SAN-based services
XIV DS8870 IBM Disk and FlashSystem Portfolio Enterprise IBM Flash Drawers, Adapters, & SSDs FlashSystem Solutions FlashSystem Storwize Family FlashSystem Family Midrange Winning Means: Knowing when, where and how to deploy Flash Focus on Storage Solutions for Improved Data Center Economics Storwize V7000 Entry Storwize V5000 Storwize V3700
IBM FlashSystem 840 • 2U - Purpose-built/FPGA – eMLC 24nm flash chips • Granular Capacities: 2TB and 4TB modules • 4, 8, 12, 16, 20, 24, 32, 40, 48 • NEW GUI • 1 or 3 year warranty • 24x7 – 4hrs – WW support • 9843 Model AE1 – 3 year • 9840 Model AE1 – 1 year • IMPROVED RAS features: ALL hot swappable components: - Flash modules, power supplies, batteries, interfaces, fans • Concurrent code load • Encryption • Low latency - 135/90 µs R/W parallel design • CPU efficiency and productivity: 625 watts • 1.1M IOPS • 8 GB/s bandwidth • Multiple connectivity interfaces • 16Gb/8Gb Fibre Channel • 40Gb QDR InfiniBand • 10Gb FCoE Flash Modules (not SSDs)
IBM FlashSystem840 Internal Performance at-a-glance Data center optimized to deliver extreme performance, flexible capacity and total system protection Capacity Options IBM and Business Partners Confidential use ONLY
IBM FlashSystem840 Internal Capacity Details IBM and Business Partners Confidential use ONLY
IBM FlashSystem 840 Internal • Capacity & Performance Configuration Guidelines • Cannot Mix 2TB with 4TB modules • 4TB modules provide HIGHER capacity and scalability options • Module type has little effect on Read performance • Write performance continues to be based primarily on number of chips & flash controllers • More chips and flash controllers = more Write Bandwidth • Only significant in “write-heavy” environments • Only significant in “sustained” write environments • Capacity upgrades are not yet available online, unless virtualized IBM and Business Partners Confidential use ONLY
IBM FlashSystem840 (Compared to 820) Internal • General Observations • 4x more front-end interfaces (FC) – 128Gb/s vs 32 Gb/s interface bandwidth • 2x flash chip density (64GiB 24nm) = double capacity per box • Similar FPGA design to FlashSystem 820: 12 modules, 48 flash controllers, 960 chips, etc. • Improved RAS features – concurrent code load, concurrent maintenance on everything • Performance Baseline Premises • 2x readperformance – 1.1M IOPS • More linear & consistent write performance – 4GB/s • Latency continues to be WAY AHEAD OF THE COMPETITION • Consistent low read latency – 135µs • Low, scalable write performance – 90µs – 4GB/s IBM and Business Partners Confidential use ONLY
IBM FlashSystem 840 (Compared to 820) Internal • Improved . . . • Performance • Serviceability • Management • Connectivity Successful Design Factors • FPGA, Flash-Optimized design • MicroLatency™ • VSRTM and Two-Dimensional (2D) Flash RAID • eMLC NAND technology • RAID-5 or RAID-0
IBM FlashSystem 840 (Compared to 820) Internal IBM and Business Partners Confidential use ONLY
Easy to install IBM FlashSystem 840 Easy to manage The GUI that revolutionized storage, now available in FlashSystem 840 Easy to service Easy to troubleshoot
IBM FlashSystem 840 • 2U - Purpose-built/FPGA – eMLC 24nm flash chips • Granular Capacities: 2TB and 4TB modules • 4, 8, 12, 16, 20, 24, 32, 40, 48 • NEW GUI • Encryption • Optional, licensable feature • AES-XTS 256-bit data-at-rest • Local management • Module-level implementation (Self-Encrypted Flash Controller) • Zero impact on performance • IMPROVED RAS features: ALL hot swappable components: - Flash modules, power supplies, batteries, interfaces, fans • Concurrent code load • Encryption • Low latency - 135/90 µs R/W parallel design • CPU efficiency and productivity: 625 watts • 1.1M IOPS • 8 GB/s bandwidth • Multiple connectivity interfaces • 16Gb/8Gb Fibre Channel • 40Gb QDR InfiniBand • 10Gb FCoE
IBM FlashSystem 840: Hardware View • Improved RAS features • Front/back accessible hot-swap flash modules, power supplies, batteries, fans,controller canisters w/ interface cards • Non-disruptive maintenance and firmware updates (concurrent code load) Flash Modules (12) Battery Modules (2) RAID Controllers (2) Interface Modules (4) Canisters (2) Fan Modules (4) Management Modules (2) Power Supplies (2)
IBM FlashSystem 840: Logical View • Hardware-only data path with extremely LOW latency • Custom FPGA-based data movement decreases latency vs. software • Distributed out-of-data-path CPU processing • High Performance (IOPS), High Bandwidth FC FC FCoE FCoE IB IB Interface Controller FPGA Management Module CPUs (18) RAID Controller Flash Modules (12)
IBM FlashSystem 840: Flash Module, Logical View Scalability and Parallelism: 7,680 parallel flash operations and 1,600 DMA parallel operations FPGA FPGA for a second: In most hybrid or SSD-based solutions, most of these actions are done in SW-heavy layers that ultimately impact LATENCY. FPGA FPGA FPGA Primary Board (2TB) Secondary Board (2TB) - Optional FPGA FPGA Flash Controller - FPGA2 or 4 per Module Data path, Hardware I/O logic Look up Tables and Write Buffer Each controls 20 flash Chips Gateway Interface FPGA I/O interface and Direct Memory Access path Control PPC and DRAM Out of Data path operations Garbage collection, Error Handling, System Health Wear Leveling, Statistics, etc. Flash Chips 20 per Flash Controller 40 or 80 per Module Data XOR parity
IBM FlashSystem 840: Reliability Ingredients Superior Durability: Using the Best Flash Superior Protection: Beyond Disk RAID Chip/Plane/Die level protection • SLC Market demand decreasing. • eMLC data protection techniques delivering more wear life than what market demands • eMLC delivers best Price/Performance Variable Stripe Sizes Read Disturb Mitigation Automatic Read Sweeper High-Speed Clock Recovery Advanced Engineering = Less Maintenance 3X 10X Protection Within And Across Flash Modules Self-Recovering Flash Modules Avoid system rebuilds
Flash Quality Matters! • Flash type matters • P/E cycles vary • MLC: Multilevel cell flash is consumer-grade • eMLC: Enterprise-grade MLC • 10x improvement over MLC • SLC: Single-level cell • 33x improvement over MLC • eMLCwill handle most enterprise applications workload requirements • FlashSystemtechnologies like Variable Stripe RAID™ and 2D Flash RAIDTM, wear leveling and over-provisioning lengthen system life by improving endurance of both eMLC and SLC
Flash Quality and Endurance (Technical) • Writes will always be sequential • Via “wear leveling” • Total Flash Capacity = 1 Full Write Cycle • 1st P/E Cycle would start on 2nd Write Cycle • Flash quality is driven by the chip type • MLC, eMLC and SLC • Flash Quality = 3K, 30K or 100K respectively • Media Write Bandwidth • How much data will written per second • This is driven by the application • Is NOT going to change! • How to increase endurance? • 1. Increase capacity • 2. Increase quality of chip • eMLC is 10x the capacity of cMLC
Flash Quality and Endurance (Technical) cont. Let’s assume a 720, 820 and other MLC arrays • Use a reference 1GB/s Sustained BW • 720 (SLC), 5TB = 15.85 Years • 820 (eMLC), 10TB = 9.51 Years • Both at same $ cost of capacity • OTHER Arrays (cMLC) • 1 Year at 1GB/s before degradation • Sustained Writes don’t help cMLC • Will require a lot more capacity to increase endurance levels
2D Flash RAID™ (for increased protection) ExternalInterfaces (FC, IB) Interface A Interface B RAID Controllers RAID Controller A RAID Controller B RAID 5 withinFlash Modules(9 data + 1 parity) FlashSystem 2D Flash RAID™ RAID 5 across Flash Modules (10 data + 1 parity + 1 hot spare)
SVC Easy Tier with FlashSystem: Solution Architecture • Gain advantage of FlashSystem write performance • Capacity enhanced by using traditional storage • Enhance read/write performance of traditional storage using Easy Tier (FlashSystem can be utilized as high performance tier) • Cost savings by using heterogenous storage devices IBM Power IBM SAN Volume Controller Easy Tier switched on IBM SAN Volume Controller Easy Tier switched on SVC stretched cluster Short distance (10 km) to avoid latency IBM FlashSystem 840 IBM Storwize or XIV IBM FlashSystem 840 IBM Storwize or XIV Leverage economical FlashSystem 840 as performance accelerator of SVC easy tier. Also enables enterprise business continuity via stretch cluster 34
IBM Flash Software 5th Generation Easy Tier (IBM DS8870 & IBM Power 7 Cooperative Caching) IBM FlashCache Storage Accelerator (IBM System x Caching) Server Cache Disks Close storage to CPU performance gap Tier data and boost storage performance with Easy Tier Server
IBM Flash Software Intelligent caching – right out of the box Transforms Flash into a transparent acceleration cache for virtual or physical servers Immediate, dynamic and seamless - no tuning, reconfiguration or data placement needed; adapts to changing workloads Caches most active data closer to application Eliminates I/O bottlenecks and minimizes latency Increases VM density per host Preserve IT agility with VMware vMotion, DRS, and HA support Flexible flash caching Supports extensive portfolio of IBM Flash including PCIe adapters, SSDs and eXFlash. Leverage SAN/NAS/DAS investments with server side cache One caching for multiple servers: VMware, Windows and Linux IBM FlashCache Storage Accelerator (IBM System x Caching) Server Cache Disks Close storage to CPU performance gap "Information being released represents IBM's current intent, is subject to change or withdrawal, and represents only goals and objectives"
IBM Flash Software Easy Tier Server – A new IBM Easy Tier feature that automatically places a copy of frequently accessed data on server Flash Drawers attached to Power7/7+ Servers running AIX IO intensive apps realize 5x improvement in app performance Improved user experience, number of transactions, better data insight, increased business opportunity and revenue Savings in licensing costs, floor space, energy and SAN utilization Target apps – OLTP, Analytics, Big Data Cooperative Caching: Continuous server/storage communication to optimize hot data placement 5th Generation Easy Tier (IBM DS8870 & IBM Power 7 Cooperative Caching) Tier data and boost storage performance with Easy Tier Server http://www.youtube.com/watch?v=SLQfNoidG3I
Cooperative Caching value over Caching Provide, end to end, ONE point of storage data placement Enable mission critical customer with storage replication (flash copy and remote copy) Will coordinate the server cache to support storage replication function Enable cache and data sharing between multiple servers Leverage network storage as sharing point to support shared storage application such as vMotion or clustered DB With each new write, network storage will broadcast the Cache invalidation to all registered servers to purge old copy of cache data Adapt to workload changes Based on workload storage can instruct to bypass cache Leverage same API for applications to pre-stage hot data or accelerate pushing down of cold data Applications can instruct storage to pre-populate hot data via CLI. Enable applications to predict the hot data and place the data in right place before the event happens. IBM Server IBM Server Client Driver Client Driver IBM Storage Flash Cache Flash Cache Easy Tier Server EasyTier SAN Storage Coherency HDD SSD
DS8870 Easy Tier with FlashSystem: Solution Architecture Flash Everywhere: Flash Adapter 90 acting as Easy Tier server solution FlashSystem 840 as performance accelerator of SVC easy tier. Easily leverage mirroring for business continuity. IBM Power, Flash Adapter 90 DS8870 Easy Tier SSD HDD • Enhance server Easy Tier via Flash Adapter 90 • Cost savings by using heterogenous DS8870 SSD & HDD storage devices 39
IBM Flash Drawer Ultra Performance Up to 480,000 IOPS (100% read) Up to 410,000 IOPS (60/40% read/write) Up to 325,000 IOPS (100% write) Up to 4.5 GB/s bandwidth Ultra Enterprise RAS Multiple RAID options, hot-plug Up to 30 hot-plug SSD (up to 11.6 TB) Up to 48 drives & 43 TB downstream HDD SOD for Easy Tier capability within drawers with AIX Six-packs ease SSD ordering and save money Ultra Dense 1U direct attached drawer ZERO PCIe slots used – direct connect to GX++ High density & performance in any op environments AIX, IBM i, Linux supported EXP30 Ultra SSD I/O Drawer Most dense high performance SSD Power Systems has ever shipped
IBM Flash Adapters IBM offers a distinctive set of solutions for your storage performance dilemmas It’s no longer a question of if Flash will be part of your computer center, but rather when and how they’ll they be implemented. IBM’s Flash Adapters and SSDs and satisfy a need for extremely fast and reliable access to data within the server. Included within the server. No additional rack space required. High IOPs performance co-located with compute processors for low latency access Dedicated capacity for individual server performance IBM’s Flash Adapters and SSDs unleash dramatic performance increases such that every server can now readily contain the enterprise I/O performance Accelerate applications Improve response times Boost efficiency Reduce storage latencies and eliminate IO bottlenecks IBM Enterprise Solid-State Drive Technology is pioneering the way to new levels of price performance.
IBM Flash Adapter 90 Enterprise Reliability
Identity Insight with Flash Adapter 90 The IBM Flash Adapter 90 outperformed Exp30 and v7000 (HDD) from a TPM perspective, and outperforms all three from a $/Performance perspective Steady-state is more important than the peak FA 90 - TPM
Optimized IBM Flash Portfolio Options Software Servers Option Example Storage EXP30 Ultra SSD Drawer Flash Adapter 90, SSDs, EXP30 or EXP24S Direct Attached Drawers Flash in Server Flash in Storage IBM FlashSystem Flash Optimized Systems, All-Flash, Hybrid Flash, Flash Caching IBM High IOPS Adapters + FlashCache Flash Server Caching w/ Flash Adapters and SSDs Flash Cooperative Caching EXP30 Ultra SSD Drawer + IBM DS8870 Storage System + Easy Tier Server Advanced Tiering Software w/ Flash Drawers, Flash Adapter 90, SSDs Flash