100 likes | 356 Views
Demystifying Deduplication. What is deduplication?. Deduplication eliminates redundant copies of data by leveraging pointers to point duplicate files or blocks to a single object. APPROACH :. Eliminate redundant data. Maintain references to single instances of data across data store.
E N D
What is deduplication? Deduplication eliminates redundant copies of data by leveraging pointers to point duplicate files or blocks to a single object APPROACH: Eliminate redundant data Maintain references to single instances of data across data store Start with the backup environment as the first phase • Deduplication can decrease disk capacity requirements • by up to 98% and decrease bandwidth requirements for data transfer by up to 50 times.
Data Deduplication is a capacity optimization feature – not a capacity optimization solution Need to understand what problem you are trying to fix Dell can help find the right solution to your storage challenges As deduplication matures it will be ubiquitous across a wide range of storage products Deduplication integrated into software functionality provides the greatest benefits Need to understand what problem you are trying to fix Dell can help find the right solution to your storage challenges Deduplication technology will expand beyond backup to include static archive data and inactive primary data Dell’s point of view on deduplication
Deduplication – Confusion abounds Different Architectures Different technologies
Types of deduplication Data deduplication eliminates common data at a file, block, or sub-block level. Data object #1 Data object #2 B A C D A B C E F A B C D E F Unique data saved to disk Disk Capacity Required
DeduplicationEnables Cost Effective Disk To Disk Backup Backup to Disk with Dedupe 2 Secondary Disk Primary Disk Replication Backup Archive Deduplication
Why optimize disk-based backup with deduplication? • Example – 20TB of data growing 20% per year • How long until the deduplicated storage capacity required equals 3 years without dedupe? • Just over 15 years
How deduplication fits into the backup environment JBOD/NAS/SAN Application Servers Deduplication Appliance Backup Server OR Deduplication Deduplication Deduplication Here or here or here • Appliance-based (Target) • Advantages: • Ease of implementation • Works with variety of backup SW • Disadvantages: • Can be more expensive solution • Replication target restrictions • Greater network traffic overhead • Often on proprietary hardware • Server-based (Source)& Integrated (Hybrid) • Advantages: • Common management • Ease of use • Can be less expensive solution (lower TCO) • Reduces network traffic • Global deduplication opportunity