290 likes | 676 Views
BIG Data. Presented: January 2013. Together we build the right solution. Agenda. Introductions. Dennis J Perlot: Founder & CTO, Theia Solutions Over 25 years experience providing award winning, innovative IT solutions Smithsonian Innovators Award Global Innovation Award
E N D
BIG Data Presented: January 2013 Together we build the right solution
Introductions Dennis J Perlot: Founder & CTO, Theia Solutions • Over 25 years experience providing award winning, innovative IT solutions • Smithsonian Innovators Award • Global Innovation Award • Artificial Intelligence/Machine Learning • Technology Community Advocate • Speaker/ Technology Evangelist Megan Cocuzzo: Director, Business Intelligence • Over 15 years experience leveraging “BIG DATA” to deliver innovative financial and resource optimization strategies and tools • Financial Planning & Analysis • Capacity & Resource Planning • Opportunity and Risk Assessment • Capital Funding • Six Sigma Black Belt Professional • ISO 9000 Quality System Auditor
Theia Solutions LLC “Together we build the right solution” • Socially responsible technology services • Application Development • Data Optimization • Cloud Hosting • Data Analytics • Why Theia? • We put people first • Partnerships not just contracts • Innovative solutions
What is “Big Data”? • Data sets that can not be processed with traditional tools such as relational databases, requiring “massively parallel” approaches. • What is considered "big data" varies depending on the organization and the applications that are used to process and analyze the data set in its domain. • Traditional tools can not handle the 3 V’s: A visualization created by IBM of Wikipedia edits. At multiple terabytes in size, the text and images of Wikipedia are a classic example of big data.
Just how BIG… • 1000 Megabytes = 1 Gigabyte • 1000 Gigabytes = 1 Terabyte • 1000 Terabytes = 1 Petabyte [where most corporations are] • 1000 Petabytes = 1 Exabyte • 1000 Exabytes = 1 Zettabyte [where Facebook and Google are] • 1000 Zettabytes = 1 Yottabyte • 1000 Yottabytes = 1 Brontobyte
Where does it come from? • Web logs and blogs • eCommerce • Mobile - 4.5 billion phones • Sensors – temp, vibration, etc. • Smartphones • 400 million worldwide • Over 50% of US cell users
How fast is it generated? • eCommerce – 56 million plus transactions in Q3 2012 • RFID – location reporting • Large Hadron Collider: 700MB to 1 TB per second • Cell phone location tracking • Must consider data in motion vs. data at rest
Consider the electricity model • Do you build a power plant? • Do you run wires to your home? • Do you buy transformers, etc. • Let someone else worry about all that and just pay for what you use. • This is cloud computing • Pay for what you use • Rapid elasticity • Location transparent resources
Cloud Offerings • Infrastructure as a Service (IaaS) “… servers, servers, get your servers here” • Platform as a Service (PaaS) “… just give me a place for my application and data” • Software as a Service (SaaS) “… like Salesforce.com”
On-Premises Separation of Responsibilities Infrastructure (as a Service) Software (as a Service) Platform (as a Service) You manage Applications Applications Applications Applications You manage Data Data Data Data You manage Runtime Runtime Runtime Runtime Middleware Middleware Middleware Middleware Other Manages Other Manages O/S O/S O/S O/S Virtualization Virtualization Virtualization Virtualization Other Manages Servers Servers Servers Servers Storage Storage Storage Storage IaaS PaaS SaaS Networking Networking Networking Networking
Is it Secure? • Microsoft Azure Platform • SAS 70 Type 1 and Type 2 (now SSAE 16) • ISO 27001 • Safe Harbor • HIPPA • SOX • PCI DSS • Over 250 internal controls • More guards than engineers at most facilities
Who is using the cloud today? Who is NOT using the cloud today……..
For your information…. • 1 billion: Windows Live ID authentications each day • 3 to 4 billion: junk emails filtered daily • 2 billion: queries each month on Bing • 100 million plus: Windows Update users • 6 Regional Data Centers : 2 each in US, Europe, Asia • 400,000 plus: square footage in each datacenter
What the scoop? • Breaks problem down into smaller “chunks” • Why is it called Hadoop? • Doug Cutting was trying to think of a name for his “map reduce” system • His son said “Why don’t you name it after my toy elephant?
Comparison Hadoop Cluster Traditional Data Center
Who is using? • Amazon/A9 • Facebook • Fox interactive media • Google • IBM’s Watson • New York Times • J.P. Morgan • Rackspace • eBay • Yahoo! • More at http://wiki.apache.org/hadoop/PoweredBy
Next Steps & Recommendations • Monitor Hadoop in marketplace • Revise thinking on problems “Why not record every mouse click?” “If we capture it, we can process it” • Think about “recommender” apps • More is better!
Who Are They? • Computer skills • Understands Relational Databases • Write SQL queries • Linking internal and external data • Statistics skills • Design “experiments” • Create analytical models • Top Job on LinkedIn
Why BIG Data Matters and the importance of data agility The next frontier for innovation, competition, and productivity
Theia Solutions LLC Data Analytics Offerings • Our process begins with an end to end assessment and documentation of your current capabilities and data structure • No two organizations are alike • No two data sets are alike • We partner with you to develop a data strategy to exceed your goals in the form of a strategic roadmap • The key drivers to operational health vary as do the regulatory and compliance needs of each organization in each market/sector
Theia Solutions LLC Data Analytics Offerings
Theia Solutions LLC • So, no matter what your need, Theia Solutions can help you get there • Experienced, agile, specialized teams • Innovative Ideas, Old School Values • Long Terms Partnership with Clients
Questions www.TheiaSolutionsLLC.com Dennis.Perlot@theiasolutionsllc.com Megan.Cocuzzo@theiasolutionsllc.com