530 likes | 643 Views
This presentation delves into MongoDB, a popular NoSQL database, outlining its difference from SQL databases, distinguishing characteristics, examples, and usage scenarios. Explore key concepts such as BASE transactions, Brewer's CAP Theorem, and examples like CouchDB and Column Store. Learn about the features of MongoDB, scalability, high performance, and its advantages over traditional SQL databases. Discover the world of NoSQL databases through MongoDB and its practical applications.
E N D
MongoDB Introduction, Installation & Execution By Prof. B.A.Khivsara Note: The material to prepare this presentation has been taken from internet and are generated only for students reference and not for commercial use.
Difference Between SQL and NoSQL • SQL Databases • SQL Standard • SQL Characteristics • SQL Database Examples • NoSQL Databases • NoSQLDefintion • General Characteristics • NoSQL Database Types • NoSQL Database Examples
SQL Database Examples • Commercial • IBM DB2 • Oracle RDMS • Microsoft SQL Server • Sybase SQL Anywhere • Open Source (with commercial options) • MySQL • Ingres Significant portions of the world’s economy use SQL databases!
NoSQL Products/Projects http://www.nosql-database.org/ lists 122 NoSQL Databases • Cassandra • CouchDB • Hadoop & Hbase • MongoDB • StupidDB • Etc.
BASE Transactions • Acronym contrived to be the opposite of ACID • Basically Available, • Soft state, • Eventually Consistent • Characteristics • Weak consistency – stale data OK • Availability first • Best effort • Approximate answers OK • Aggressive (optimistic) • Simplerand faster
Consistency • all nodes see the same data at the same time – Wikipedia • client perceives that a set of operations has occurred all at once – Pritchett • More like Atomic in ACID transaction properties
Availability • node failures do not prevent survivors from continuing to operate – Wikipedia • Every operation must terminate in an intended response – Pritchett
Partition Tolerance • the system continues to operate despite arbitrary message loss – Wikipedia • Operations will complete, even if individual components are unavailable – Pritchett
Other Non-SQL Databases • XML Databases • Graph Databases • Codasyl Databases • Object Oriented Databases • Etc…
Column Store Comments • More efficient than row (or document) store if: • Multiple row/record/documents are inserted at the same time so updates of column blocks can be aggregated • Retrievals access only some of the columns in a row/record/document
NoSQL Examples: Key-Value Store • Hash tables of Keys • Values stored with Keys • Fast access to small data values • Example – Project-Voldemort • http://www.project-voldemort.com/ • Linkedin • Example – MemCacheDB • http://memcachedb.org/ • Backend storage is Berkeley-DB
Map Reduce • Technique for indexing and searching large data volumes • Two Phases, Map and Reduce • Map • Extract sets of Key-Value pairs from underlying data • Potentially in Parallel on multiple machines • Reduce • Merge and sort sets of Key-Value pairs • Results may be useful for other searches
Map Reduce Patent Google granted US Patent 7,650,331, January 2010 System and method for efficient large-scale data processing A large-scale data processing system and method includes one or more application-independent map modules configured to read input data and to apply at least one application-specific map operationto the input data to produce intermediate data values, wherein the map operation is automatically parallelized across multiple processors in the parallel processing environment. A plurality of intermediate data structures are used to store the intermediate data values. One or more application-independent reduce modules are configured to retrieve the intermediate data values and to apply at least one application-specific reduce operation to the intermediate data values to provide output data.
CouchDB JSON Example { "_id": "guid goes here", "_rev": "314159", "type": "abstract", "author": "Keith W. Hare" "title": "SQL Standard and NoSQL Databases", "body": "NoSQL databases (either no-SQL or Not Only SQL) are currently a hot topic in some parts of computing.", "creation_timestamp": "2011/05/10 13:30:00 +0004" }
CouchDB JSON Tags • "_id" • GUID – Global Unique Identifier • Passed in or generated by CouchDB • "_rev" • Revision number • Versioning mechanism • "type", "author", "title", etc. • Arbitrary tags • Schema-less • Could be validated after the fact by user-written routine
• Scalable High-Performance Open-source, Document-orientated database. • Built for Speed • Rich Document based queries for Easy readability. • Full Index Support for High Performance. • Replication and Failover for High Availability. • Auto Sharding for Easy Scalability. • Map / Reduce for Aggregation. What is MongoDB ?
• SQL was invented in the 70’s to store data. • MongoDB stores documents (or) objects. • Now-a-days, everyone works with objects (Python/Ruby/Java/etc.) • And we need Databases to persist our objects. Then why not store objects directly ? • Embedded documents and arrays reduce need for joins. No Joins and No-multi document transactions. Why use MongoDB?
• RDBMS replacement for Web Applications. • Semi-structured Content Management. • Real-time Analytics & High-Speed Logging. • Caching and High Scalability Web 2.0, Media, SAAS, Gaming What is MongoDB great for? HealthCare, Finance, Telecom, Government
• Highly Transactional Applications. • Problems requiring SQL. Some Companies using MongoDB in Production Not great for?
BSON Example { "_id" : "37010" “City" : “Nashik", “Pin" : 423201, "state" : “MH", “Postman” : { name: “Ramesh Jadhav” address: “Panchavati” } }
Data Types • String : This is most commonly used datatype to store the data. String in mongodb must be UTF-8 valid. • Integer : This type is used to store a numerical value. Integer can be 32 bit or 64 bit depending upon your server. • Boolean : This type is used to store a boolean (true/ false) value. • Double : This type is used to store floating point values. • Min/ Max keys : This type is used to compare a value against the lowest and highest BSON elements. • Arrays : This type is used to store arrays or list or multiple values into one key. • Timestamp : ctimestamp. This can be handy for recording when a document has been modified or added. • Object : This datatype is used for embedded documents.
Data Types • Null : This type is used to store a Null value. • Symbol : This datatype is used identically to a string however, it's generally reserved for languages that use a specific symbol type. • Date : This datatype is used to store the current date or time in UNIX time format. You can specify your own date time by creating object of Date and passing day, month, year into it. • Object ID : This datatype is used to store the document’s ID. • Binary data : This datatype is used to store binay data. • Code : This datatype is used to store javascript code into document. • Regular expression : This datatype is used to store regular expression
Installation in Windows Download MongoDB from Website: https://www.mongodb.org/downloads Select option Windows Download and Run
Installation in Ubuntu Download MongoDB from Website: https://www.mongodb.org/downloads Select option Linux Download and Run