1 / 0

N oSQL for the SQL Server Pro

N oSQL for the SQL Server Pro. Lynn Langit. Feb 2013 – SDC, Sweden. Is NoSQL just Hadoop? . HUGE Hype factor over last few years. Apache Hadoop  is a  software framework  that supports data-intensive  distributed applications  under a  free license

jemima
Download Presentation

N oSQL for the SQL Server Pro

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. NoSQL for the SQL Server Pro

    Lynn Langit Feb 2013 – SDC, Sweden
  2. Is NoSQL just Hadoop? HUGE Hype factor over last few years Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license enables applications to work with thousands of nodes and petabytes of data was inspired by Google's MapReduce and Google File System (GFS) papers
  3. Hadoop in the Enterprise
  4. Working with Hadoop Common Tools / Languages Java (JDK) / Eclipse MapReduce Map (query/format) Reduce (aggregate) plug-in for Eclipse (Java) Pig (ETL -- Java) Hive (HQL Query) HBase tables Others Mahout (analyze) Karmasphere (analyze) R (analyze)
  5. Demo -HDInsight– Cluster Allocation
  6. What is the relationship?
  7. BigData = Exponentially More Data Retail Example -> ‘Feedback Economy’ Number of transactions Number of behaviors (collected every minute)
  8. BigData = ‘Next State’ Questions
  9. Demo - HDInsight - MapReduce
  10. Hitting (Relational) Walls CA Highly-available consistency CP Enforced consistency AP Eventual consistency
  11. So many NoSQL options More than just the Elephant in the room Over 120+ types of NoSQL databases
  12. Flavors of NoSQL
  13. Key / Value Database Schema-less State (Persistentor Volatile) Examples AWS Dynamo DB Riak
  14. Column Database Wide, sparse column sets Examples: Cassandra HBase BigTable GAE HR DS Azure Tables SQL 2012 Tabular Model
  15. More about Column Databases Type A Column-families Non-relational Sparse Examples: HBase, Cassandra, xVelocity (SQL 2012 Tabular) Type B Column-stores Relational Dense Example: SQL Server 2012 Columnstore index
  16. Demo - Document Database (Mongo DB) document-oriented (collection of JSON documents)w/semi structured data Encodings includeBSON, JSON, XML… binary forms PDF, Microsoft Office documents -- Word, Excel…)
  17. Demo - Graph Database (Neo4j) a lot of many-to-many relationships recursive self-joins when your primary objective is quickly finding connections, patterns and relationships between the objects within lots of data
  18. So which type of NoSQL? Back to CAP… CP = NoSQL/column Hadoop Big Table H-baseMemCacheDB CA = SQL/RDBMS SQL Sever / Oracle MySQL AP = NoSQL/document or key/value DynamoDB CouchDB CassandraVoldemort
  19. Which type of NoSQL for which type of data?
  20. Cloud-hosted NoSQL up to 50x CHEAPER
  21. The reality…two pivots
  22. NoSQL (Cloud) BLOB Storage Buckets Amazon – S3 or Glacier The gold standard Google – Cloud Storage Free for developers Microsoft Azure BLOBS DropBox, Box…
  23. Cloud-hosted RDBMS AWS RDS – SQL Server, mySQL, Oracle Medium cost Solid feature set, i.e. backup, snapshot Use existing tooling Google – mySQL Lowest cost Most limited RDBMS functionality Microsoft – SQLAzure Highest cost
  24. Demo - AWS RDS SQL Server, MySQL or Oracle Essential to understand pricing models
  25. Cloud Offerings– RDBMS AND NoSQL
  26. Data Scientists…
  27. Comparing…
  28. Karmasphere Studio for AWS
  29. Hadoop Connector to Excel
  30. Google BigQuery Hadoop-like (Dremel) based service For massive amounts of data SQL-like query language
  31. Dremel Realized => Impala Interactive Hadoop?
  32. Other types of cloud data services
  33. NoSQL To-Do List
  34. The Changing Data Landscape
  35. recipes) www.TeachingKidsProgramming.org Free Courseware ( Do a Recipe  Teach a Kid (Ages 10 ++) Java or Microsoft SmallBasic 
  36. Toward Data Craftsmanship…
More Related