490 likes | 1.06k Views
Oracle NoSQL Database and Big Data. Chao Huang Senior Manager, Oracle NoSQL Database development. Relational Era. Relational Databases replace IMS for ‘Business Data processing’ Relational Databases become the standard for OLTP SQL is better Query Optimizers beat programmers.
E N D
Oracle NoSQL Database and Big Data Chao Huang Senior Manager, Oracle NoSQL Database development
Relational Era • Relational Databases replace IMS for ‘Business Data processing’ • Relational Databases become the standard for OLTP • SQL is better • Query Optimizers beat programmers • 1970s - OLTP Replaced By Edgar F. Codd
Big Data Era - Volume • What’s wrong with Sharding • Unhappy Developers • More code to write • Unhappy DBAs • Complicates backup, schema changes, indexing, hotspots • Unhappy users • No cross-shard applications
Big Data Era - Variety • What’s wrong with my Data • More and more semi-structured data sources • Consumers are talking and so are sensors • Schema-less is the critical need for agility
Big Data Era - Simplicity Simple Data Models K-V User Profiles Real-time Ads Logs, Sensors
Trends Driving NoSQL Technology • Why NoSQL • Horizontal Scale • Data Volume requires scaling on multiple machines • Leading Technology giants • have embraced NoSQL - Google, Amazon, Facebook et al. • New Kinds of Data • more semi-structured and unstructured data sources. • New Speeds of Data • real time customer experience, real time sensor feedbacks • Cost Sensitivity • Scale is not a yes/no question. Scale is an ROI question.
What is Oracle NoSQL Database • Scalable, HA, Key-Value • Simple Key-Value Data Model • Horizontally Scalable • Highly Available • Simple administration • ACID Transactions at scale • Transparent load balancing • Elastic Configuration • Commercial grade software and support
Performance • YCSB • 1.25M ops/sec • 2 billion records • 2 TB of data • 95% read, 5% update • Low latency • High Scalability
Use Case – Web Scale Transactions Application • NoSQL DB for High velocity, High volume, High variety, Low information density ACID transactional data capture • Hadoop and Data Warehouse for analytics • Applications: Web browsing, Web Retail, CDR processing, Sensor data capture • Oracle NoSQL DB NoSQL DB Driver Enterprise Data Warehouse Hadoop Cluster Organize Analyze Acquire
Use Case – Last Mile Content Delivery End Customers Business Users • Platform for real-time content delivery • Content & market segmentation Acquired and Analyzed in Hadoop & RDBMS • NoSQL provides low latency content lookup and delivery to end-customers • Applications: Advertising, Marketing segmentation, Location-aware content delivery • Oracle NoSQL DB Multi-Reporting Dimensional Application NoSQL DB Driver Web Click-stream Hadoop Cluster Content Delivery Acquire, Analyze, Prepare
Use Case – Real Time Event Processing Oracle Event Processor Rule Action • Real time events trigger rule execution in Oracle Event Processor • OEP rules perform low latency lookups to Oracle NoSQL DB for additional data • Applications: Utilities, Medical Monitoring, Factory Automation, Oil & Gas Exploration, Geo-location Tracking • Oracle NoSQL DB Event Stream Low Latency Lookups NoSQL DB Driver
NoSQL for Social Online Betting Challenge Objectives • “Oracle NoSQL Database enabled the rapid, scalable processing of incoming XML, ensuring high available and guaranteed event ordering.” • Scalable in-play sports betting platform • Increase new business revenue • Improve operational efficiency James AnthonyChief Technology OfficerPassoker Solution Solution Real-Time, In-Play Sports Betting • Match in-play bets with incoming events • Promote interaction between customers • Scale system with customers and events • Feeds MySQL database for revenue tracking and operational reporting App XML Providers Event Capture & Store Accounting & Operations Customers NoSQL DB MySQL
NoSQL for Scalable PaaS Application Benefits Challenge NoSQL DB Driver • Low latency application object persistence • Flexible data format and serialization techniques • Highly reliable data store that can scale as the number of hosted apps and app objects grow • Provide special purpose application server services to financial institutions • Provide cost competitive subscriptions Solution • Oracle NoSQL database for unstructured data capture and application object persistence • Oracle database for business analytics and insight into the data collected
NoSQL for Oracle Communications Mgmt Benefits Challenge • Extend Coherence data caching to disk • Manage growth in data volume, 400M customers • Handle extreme TXN volumes with low latency • Always on and highly available • Improve billing and revenue management • Calculate charges for any service combo • Provide scalable CDR processing Solution Application Data Ingestion • Coherence cluster for real time event rating • NoSQL database for rated event persistence and consumption by downstream systems • Coherence memory optimization using NoSQL database for out of band data NoSQL DB Driver OCM Rated Event s Processor
Compatibility • Oracle NoSQL • Supported OS – Linux and Solaris 10 • Oracle NoSQL DB is written in Java • Java and C APIs • Bindings available for Jython, Jruby, Clojure, Groovy, Rhino
Features - Sharding Hash Fn • Provides linear scale-out of write ops/sec • No need to develop sharding logic in Application • Intelligent Driver – aware of hash function and shard location • Hash function to map a key to a partition • Each partition is routed to a single shard • Auto-Sharding Partitions … Shard 0 Shard 11 Rep Node Replica Rep Node Replica Rep Node Master Rep Node Master Rep Node Replica Rep Node Replica
Features - Replication Shard 0 • Master-Slave replication • Supports Heterogeneous platform hardware/OS/JVM • Provides linear scale out for read ops/sec • High Availability Rep Node Master Rep Node Replica Rep Node Replica
Features – Configurable CAP • Greater Flexibility • Configurable Durability per operation • Configurable Consistency per operation • ACID by default • Transaction scope is single API call • Records share same major key • Multiple operations supported
Features - Failover Replication factor = 5 • Automatic election of new Master • Rejoining nodes automatically synchronize with the Master • Isolated nodes can still service reads • All nodes are symmetric • Automatic Failover Rep Node Replica Rep Node Replica Rep Node Master Rep Node Replica Rep Node Replica New Master
Enterprise Integration – Extensions • Integration with Oracle Database – External Tables • RDF Graph for NoSQL – Support for SPARQL • Event Processing with Oracle Event Processor • Integration with Hadoop
Why Oracle NoSQL Database • It’s Oracle • A trusted vendor here for the long term • Scalable, Available with Predictable Latency • Differentiating Features • Always-On Elastic Processing • Configurable ACID Transactions at scale • Easy to use Smart Data Center Topologies • Integration with the Oracle technology stack