370 likes | 725 Views
Big Data. By: John Lawrence, Patrick Fletcher, Travis Stancil , Chasten Caluya. What is Big Data?. http:// www.youtube.com / watch?v =ahZGEusG13A. What is Big Data?.
E N D
Big Data By: John Lawrence, Patrick Fletcher, Travis Stancil, Chasten Caluya
What is Big Data? http://www.youtube.com/watch?v=ahZGEusG13A
What is Big Data? “Big Data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured.” -SAS Institute
The Three “V’s” Volume- Exponentially increasing amounts of data require more comprehensive methods of organization, and interpretation. Velocity- The unprecedented speed with which data is streamed in the modern world requires timely analysis. Variety- Increasingly complex data sources and formats require integration and interpretation into a common intellectual medium in order to make informed decisions.
Additional Dimensions Variability- Inconsistency of data flows marked by periodic peaks/trends whether daily, seasonal, or event-triggered. Complexity- Similar to variety. Data is harvested across multiple sources with differing formats. Must consolidate and integrate these sources to make informed decisions.
The First Data Miner • John Gruant (1620-1674) • Bubonic Plague • Bills of Mortality • Refuted beliefs • London’s population
1989 British computer scientist Tim Bemers-Lee Internet pioneered by U.S. Gov. in the 60’s “Hypertext” system called World Wide Web
1997 NASA researchers Michael Cox and David Ellsworth “Big Data” Supercomputers generating massive amounts of information
2002 The U.S. Government mining data to thwart terrorism since the 9/11 attacks. Former National Security Advisor John Poindexter Defense Department
2004 Wal-Mart boasts a cache of 460 terabytes Customers’ shopping and personal habits 460 terabytes > 2x total data on the internet 1 terabyte = 1,024 gigabytes
May 2009 U.S. President Barack Obama’s Administration data.gov as part of Open Government Initiative More than 445,000 data sets
February 2011 200 million pages of information or 4 terabytes of disk storage IBM’s Watson Computer System (AI) Won quiz show “Jeopardy!”
Digital Revolution Growing number of sensors, digital devices, corporate databases, and social media sites 90% of World’s data generated over the last 2 years (SINTEF, 2013) Marketers to policy makers looking to Big Data
“In 2012, every day 2.5 quintillion bytes of data (1 followed by 18 zeros) are created, with 90% of the world’s data created in the last two years alone” -Maria Conner
Components of Big Data in Business Speed (Velocity)- Rate of data creation Type (Variety)- Use of semi-structured and unstructured data Volume- Ability to derive needed information
Data Production Online Transactions- E-commerce, Stock Trading and Bank Transactions Social Media- Facebook, Twitter, Instagram Mobile Devices- World population to active mobile devices
Processing Large Data Traditional database and computing software Commodity software Parallel processing Non-Relational data storage
Analytic Software Hadoop by Doug Cutting of Yahoo Inspired by Google’s MapReduce; Used to index the web Data processing over large networks of commodity software
Database Management System No SQL compared to SQL Analysis of large unstructured data sets Upgrade over traditional database
Why Utilize Big Data To harness relevant data and analyze it with the following objectives: 1) Cost reductions 2) Time reductions 3) New product development and optimized offerings 4) Smarter business decision-making
Teradata Workload specific platforms Hadoop Data warehouse software Discovery platform Tools and utilities Marketing applications Business consulting services
Cabela’s “…the extract, transform and load process into the SAS data mart was run biweekly, but loading often took as long as two weeks…” “…By the time it was done building, the data could be up to four weeks old, and we'd have to start again…"
Ebay http://www.teradata.com/videos/ebay-conquers-complexity/
Other Companies U.S. Department of Health Resources and Services Administration Zinch and UH Hilo
Example: IBM Solutions for NYSE http://www.youtube.com/watch?v=YJd62ooBVvI
Security Identify and understand data sensitivity levels Understand impact of data release Develop policies and procedures related to Big Data security and privacy Develop and execute a technical security approach that complements the security of your analytics platform (Smith, 2013).
Talent Gap There will be a shortage of employees who possess the analytical talents organizations need to take advantage of Big Data. • Projected shortage in the United States alone: • 140,000-190,000 analytical specialists • 1.5 million managers who can make decisions based on analytics
Conclusion “As modern businesses seek to not only gain a competitive edge in their respective industries but remain on par with competitors, it becomes essential that management not only recognizes but implements Big Data in their operating environment. While this will present various challenges including organization of information, security, and analytics, the potential for companies who chose Big Data is exponentially increasing.”