220 likes | 248 Views
BigData - NoSQL Hadoop - Couchbase. Tugdual “Tug” Grall. Technical Evangelist. email: tug@couchbase.com. twitter: @tgrall. About me. Web @tgrall http://blog.grallandco.com tgrall NantesJUG co-founder Pet Project : http://www.resultri.com. Tugdual “Tug” Grall Couchbase
E N D
BigData - NoSQLHadoop - Couchbase Tugdual “Tug” Grall Technical Evangelist email: tug@couchbase.com twitter: @tgrall
About me • Web • @tgrall • http://blog.grallandco.com • tgrall • NantesJUG co-founder • Pet Project : • http://www.resultri.com • Tugdual “Tug” Grall • Couchbase • Technical Evangelist • eXo • CTO • Oracle • Developer/Product Manager • Mainly Java/SOA • Developer in consulting firms
$30B Database Market Being Disrupted <50%? 95% Relational Technology Relational Technology Other Relational Technology Relational Technology NoSQL Technology 2012 2027 All new database growth will be NoSQL
NoSQL Real-time, Interactive Databases Analytic Databases Fast access to data Get insights from data Operational vs. Analytic Databases Cloudera Hortonworks Mapr Couchbase MongoDB Cassandra Hbase
What Is Biggest Data Management Problem Driving Use of NoSQL in Coming Year? 49% 35% 29% 16% 12% 11% Lack of flexibility/rigid schemas Inability to scale out data Performance challenges Cost All of these Other Source: Couchbase Survey, December 2011, n = 1351.
What is Sqoop? Sqoop is a tool designed to transfer data between Hadoop and relational databases. You can use Sqoop to import data from a relational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS), transform the data in Hadoop MapReduce, and then export the data back into an RDBMS. sqoop.apache.org
T What is Sqoop? Traditional ETL Data Data Application
What is Sqoop? A different paradigm Application Data Data
What is Sqoop? A very scalable different paradigm Application Data Application Data Application Data Data
T T T T T T T T T T T T What is Sqoop? Where did the Transform go? Application Data
Sqoop Details • Sqoop • Default connection is via JDBCLots of custom connectorsCouchbase, VoltDB, VerticaTeradata, NetezzaOracle, MySQL, Postgres
Ad and offer targeting 40 milliseconds to respond with the decision. profiles, real time campaign statistics 3 2 1 profiles, campaigns events
Couchbase Server Core Principles Easy Scalability Consistent High Performance Grow cluster without application changes, without downtime with a single click Consistent sub-millisecond read and write response times with consistent high throughput Always On 24x365 Flexible Data Model No downtime for software upgrades, hardware maintenance, etc. JSON document model with no fixed schema.