290 likes | 506 Views
Big Data Technologies and the Evolution of NoSQL. Dwight Merriman dwight@10gen.com. Why #1 : the Imperitive to Scale (horizontally). http://en.wikipedia.org/wiki/Moore's_law. cloud, virtualization. commodity. power/cooling. data explosion.
E N D
Big Data Technologies and the Evolution of NoSQL Dwight Merriman dwight@10gen.com
cloud, virtualization commodity power/cooling data explosion http://www.globalnerdy.com/2007/09/07/multicore-musings/
UI compute data processing / ETL caching database / datastore network not just the database
UI √ compute data processing / ETL caching √ database / datastore network √
` { _id : ObjectId("4e2e3f92268cdda473b628f6"), title : “Too Big to Fail”, when : Date(“2011-07-26”), author : “joe”, text : “blah”, tags : [“business”, “news”, “north america”], votes : 3, voters : [“dmerr”, “sj”, “jane” ], comments : [ { by : “tim157”, text : “great story” }, { by : “gora”, text : “i don’t think so” }, { by : “dmerr”, text : “also check out...” } ] }
NoSQL = Non-relational next generation operation data stores and databases no joins + light transactional semantics = horizontally scalable architectures
legal SFA CMS customer preferences, behavior, relationships organizational knowledge team (human) process information EMR
no joins + light transactional semantics -> new data models
{ _id : ObjectId("4e2e3f92268cdda473b628f6"), title : “Too Big to Fail”, when : Date(“2011-07-26”), author : “joe”, text : “blah”, tags : [“business”, “news”, “north america”], votes : 3, voters : [“dmerr”, “sj”, “jane” ], comments : [ { by : “tim157”, text : “great story” }, { by : “gora”, text : “i don’t think so” }, { by : “dmerr”, text : “also check out...” } ] } db.posts.find( { author : “joe” } ).sort({when:1}) db.posts.find( {tags:”news”,votes:{$gt:100} ) db.posts.find( { “comments.by” : “gora” } ) db.posts.ensureIndex({“comments.by”:1})
{ _id : ObjectId("4e2e3f92268cdda473b628f6"), title : “Too Big to Fail”, when : Date(“2011-07-26”), author : “joe”, text : “blah”, tags : [“business”, “news”, “north america”], votes : 3, voters : [“dmerr”, “sj”, “jane” ], comments : [ { by : “tim157”, text : “great story” }, { by : “gora”, text : “i don’t think so” }, { by : “dmerr”, text : “also check out...” } ] } db.posts.find( { author : “joe” } ).sort({when:1}) db.posts.find( { “comments.by” : “gora” } )
NoSQL = Non-relational next generation operation data stores and databases Benefits : - scale - leverage the vast swathes of semi-structured data - agility, nimbleness