The Age of Infinite Storage has begun

The AgeofInfinite Storagehas begun Many of us have enough money in our pockets right now to buy all the storage we will be able to fill for the next 5 years. So having the storage capacity is no longer a problem. Managing it is a problem (especially when the volume gets large). How much data is there?

Googi 10100 . . . Yotta 1024 Zetta 1021 Exa 1018 Peta 1015 Tera 1012 Giga 109 Mega 106 Kilo 103 • Tera Bytes (TBs) are Here • 1 TB costs  1k$ to buy • 1 TB costs ~300k$/year to own • Management and curation are the expensive part • Searching 1 TB takes hours • I’m Terrified byTeraBytes • I’m Petrified by PetaBytes We are here • I’ll soon be Exafied by ExaBytes • I’m too old to ever be Zettafied by ZettaBytes • But you may be in your lifetime • You may even be Yottafied by YottaBytes • You may never be Googified byGoogiBytes • But the next generation may be?

How much information is there? Yotta Zetta Exa Peta Tera Giga Mega Kilo Everything! Recorded • Soon everything can be recorded and indexed. • Most of it will never be seen by humans. • Data summarization, trend detection, anomaly detection, data mining, are key technologies All Books MultiMedia All books (words) .Movie A Photo A Book 10-24 Yocto, 10-21 zepto, 10-18 atto, 10-15 femto, 10-12 pico, 10-9 nano, 10-6 micro, 10-3 milli

First Disk, in 1956 • IBM 305 RAMAC • 4 MB • 50 24” disks • 1200 rpm (revolutions per minute) • 100 milli-seconds (ms) access time • 35k$/year to rent • Included computer & accounting software(tubes not transistors) W. P. 7th Grade C.S. lab Asst.

10 years later 30 MB 1.6 meters

MemexAs We May Think, Vannevar Bush, 1945 “A memex is a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility” “yet if the user inserted 5000 pages of material a day it would take him hundreds of years to fill the repository, so that he can enter material freely”

Can you fill a terabyte in a year?

On a Personal Terabyte,How Will We Find Anything? • Need Queries, Indexing, Data Mining, Scalability, Replication… • If you don’t use a DBMS, you will implement one of your own! • Need for Data Mining, Machine Learning is more important then ever! Of the digital data in existence today, • 80% is personal/individual • 20% is Corporate/Governmental DBMS

I made up these Name! Projected data sizes are overrunning our ability to name their orders of magnitude! We’re awash with data! • Network data: • 10 terabytes by 2004 ~ 1013 Bytes • US EROS Data Center archives Earth Observing System (near Soiux Falls SD) Remotely Sensed satellite and aerial imagery data • 15 petabytes by 2007 ~ 1016 Bytes • National Virtual Observatory (aggregated astronomical data) • 10 exabytes by 2010 ~ 1019 Bytes • Sensor data from sensors (including Micro & Nano -sensor networks) • 10 zettabytes by 2015 ~ 1022 Bytes • WWW (and other text collections) • 10 yottabytes by 2020 ~ 1025 Bytes • Genomic/Proteomic/Metabolomic data (microarrays, genechips, genome sequences) • 10 gazillabytes by 2030 ~ 1028 Bytes? • Stock Market prediction data (prices + all the above?) • 10 supragazillabytes by 2040 ~ 1031 Bytes? Useful information must be teased out of these large volumes of raw data. AND these are some of the 1/5th of "Corporate" or "Governmental" data collections. The other 4/5ths of data sets are personnel!

Parkinson’s Law(for data) • Data expands to fill available storage • Disk-storage version of Moore’s Law • Available storage doubles every 9 months! • How do we get the information we need from the massive volumes of data we will have? • Querying (for the information we know is there) • Data mining (for the answers to questions we don't know to ask precisely).

The Age of Infinite Storage has begun

The Age of Infinite Storage has begun

Presentation Transcript

A new era of combination therapy has begun

Equity has begun to outperform gold

StarTrek 2004: The Quest for Planets has Begun

Instant Results with Infinite Storage

The Journey has begun…it’s day 1!!!

Bitcasa offers infinite storage!

THE COUNTDOWN HAS BEGUN!

your journey has begun…

Managing Economic Future: The future has already begun

A Journey has Begun…. Towards Repositioning the Status of Teachers in Pakistan

The Normans charge and the Battle of Hastings has begun.

The Dawning of the Age of Infinite Storage

ST 3 – The countdown has begun!

Summer Conference Madness Has Begun !

Corporate social responsibility : the journey has jus begun!

The Recovery Has Begun But It’s Hard To See

IF THE BELL HAS RUNG, THE CLASS HAS BEGUN .

inf idea , a journey of discovery – that has just begun

THE RACE HAS BEGUN

Fiscal Notes – the 81 st Legislative Session has begun!

ITIL 4 Foundation Launch - Countdown Has Begun!

The Dawning of the Age of Infinite Storage