190 likes | 336 Views
Fitting Microsoft Hadoop I nto Your Enterprise BI Strategy. Cindy Gross | @ SQLCindy | SQLCAT PM http:// blogs.msdn.com/cindygross. Big Agenda. SQL, NoSQL , Hive. Data, Insights, Visualization. What Big Data Is and Isn’t. Microsoft Hadoop. I don’t need no NoSQL …. Do I?.
E N D
Fitting Microsoft Hadoop Into Your Enterprise BI Strategy Cindy Gross | @SQLCindy | SQLCAT PM http://blogs.msdn.com/cindygross
Big Agenda SQL, NoSQL, Hive Data, Insights, Visualization What Big Data Is and Isn’t Microsoft Hadoop
I don’t need no NoSQL…. Do I? SQL Server Structured SQL Fulfill Different Needs Unstructured NoSQL Hadoop
How do I leverage my #SQLAwesomeness? HiveQL TSQL SELECT deviceplatform, state, country FROM hivesampletable LIMIT 200; Hive
All your data are belong to us Sqoop to/from relational Azure Blob Store SFTP Amazon S3 Hive ODBC Driver Azure Data Market
Big Data is what again? Streaming Hadoop HDFS MachineLearning MapReduce Massively Parallel Processing Unstructured
Hadoop Ecosystem Snapshot ETL Tools BI Reporting RDBMS Mahout (ML) Lucene/Solr (search indexing) HCatalog Zookeepr (Coordination) Pig (Data Flow) Hive (SQL / DW) Sqoop (SSIS) Serialization (Thrift, Protobuf, Writable) MapReduce(Job Scheduling / Execution System) HBase (Column DB) Cassandra (Column DB) HDFS(Hadoop Distributed File System) External Stores (S3, Azure Blobs, Azure Data Market, etc) • Inspired by Tom White’s Hadoop: The Definitive Guide
When is big data a big fit? • IT Management • SLA Monitoring • Cyber Security • Forensic Analysis • Financial Services • Risk Modeling • Threat Analysis • Fraud Detection • Credit Scoring • Telemetry Management • Clickstream and Application Log Analysis • Sensor Data • Online Commerce • Sentiment Analysis • Recommendation Engines • Search Indexing / Quality
Big data is not the only tool The answer to everything - NO Simply a VLDB - NO Fast for subsets & filtered data - NO Replacement for relational - NO
VVVVroom Volume – beyond what environment can handle Velocity – Need decisions fast Variety – Many formats Variability – Multiple interpretations
What does Microsoft bring to the table? Sqoop Open Source Apache Hadoop Hadoop On Azure - CTP JavaScript Hive ODBC Driver
Why is Microsoft Hadoop a fit for my Enterprise? Self Service Interactivity Familiar, reusable skills Visualization Ease of data movement Elasticity
Who uses Big Data? Data Scientists / Data Teams Information Workers Those Seeking Insights Anyone who uses BI now
How do we visualize the results? Custom Tools PowerPivot Power View Hive ODBC Driver + Excel Add In
Insights to Action Discover Insights Take Action Rinse and Repeat
Big Summary SQL, NoSQL, Hive Data, Insights, Visualization What Big Data Is and Isn’t Microsoft Hadoop
Big Data References • Hadoop: The Definitive Guide by Tom White • SQL Server Sqoophttp://bit.ly/rulsjX • JavaScript http://bit.ly/wdaTv6 • Twitter https://twitter.com/#!/search/%23bigdata • Hive http://hive.apache.org • Excel to Hadoop via Hive ODBC http://tinyurl.com/7c4qjjj • HadoopOn Azure Videos http://tinyurl.com/6munnx2 • Klouthttp://tinyurl.com/6qu9php • Microsoft Big Data http://microsoft.com/bigdata • Denny Lee http://dennyglee.com/category/bigdata/ • Carl Nolan http://tinyurl.com/6wbfxy9 • Cindy Gross http://tinyurl.com/SmallBitesBigData
Fitting Microsoft Hadoop Into Your Enterprise BI Strategy Cindy Gross | @SQLCindy | SQLCAT PM http://blogs.msdn.com/cindygross