140 likes | 222 Views
Apache Hive What to Expect in the Next Release Carl Steinbach. Real-Time Big Data Meetup , March 2013. Speaker Bio: Carl Steinbach. Currently: Engineer @ Citus Data PMC Chair, Committer -- Apache Hive Project Formerly: Cloudera, Informatica, NetApp, Oracle
E N D
Apache Hive What to Expect in the Next Release Carl Steinbach • Real-Time Big Data Meetup, March 2013
Speaker Bio: Carl Steinbach • Currently: • Engineer @ Citus Data • PMC Chair, Committer -- Apache Hive Project • Formerly: • Cloudera, Informatica, NetApp, Oracle • Contact: • Twitter: @cwsteinbach • LinkedIn: carlsteinbach
What is Apache Hive? • SQL to MapReduce • (OLAP, not OLTP) • MetaStore • Format Handlers
What’s New? HiveServer2 - Committed earlier today…
What’s New? HCatalog - Is Merging into Hive…
What’s New? Columnar Formats - Optimized Row Columnar Format (ORC) - Parquet
What’s New? • Analytic SQL • Work in progress on feature branch • HIVE-896
What’s New? Better Query Plans HIVE-3784, HIVE-2340, HIVE-3952, HIVE-HIVE-3562, HIVE-3972, HIVE-3841, HIVE-948, HIVE-2340, HIVE-3891, …
What’s New? Smarter Query Compiler MapJoin hint inferred automatically in most cases (HIVE-3784, HIVE-3403)
What’s on the Horizon? New Runtime Framework Apache Tez…
What’s on the Horizon? Vectorized Query Execution
Real-time SQL on Hadoop CitusDB, Impala, Apache Drill, … What matters: Data Locality Block aware query planner
Monthly Hive Meetups in the Bay Area Hive User Group Meetup Hive Contributors Group Meetup
We’re Hiring • citusdata.com/job