1 / 14

Real-Time Big Data Meetup , March 2013

Apache Hive What to Expect in the Next Release Carl Steinbach. Real-Time Big Data Meetup , March 2013. Speaker Bio: Carl Steinbach. Currently: Engineer @ Citus Data PMC Chair, Committer -- Apache Hive Project Formerly: Cloudera, Informatica, NetApp, Oracle

Télécharger la présentation

Real-Time Big Data Meetup , March 2013

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Apache Hive What to Expect in the Next Release Carl Steinbach • Real-Time Big Data Meetup, March 2013

  2. Speaker Bio: Carl Steinbach • Currently: • Engineer @ Citus Data • PMC Chair, Committer -- Apache Hive Project • Formerly: • Cloudera, Informatica, NetApp, Oracle • Contact: • Twitter: @cwsteinbach • LinkedIn: carlsteinbach

  3. What is Apache Hive? • SQL to MapReduce • (OLAP, not OLTP) • MetaStore • Format Handlers

  4. What’s New? HiveServer2 - Committed earlier today…

  5. What’s New? HCatalog - Is Merging into Hive…

  6. What’s New? Columnar Formats - Optimized Row Columnar Format (ORC) - Parquet

  7. What’s New? • Analytic SQL • Work in progress on feature branch • HIVE-896

  8. What’s New? Better Query Plans HIVE-3784, HIVE-2340, HIVE-3952, HIVE-HIVE-3562, HIVE-3972, HIVE-3841, HIVE-948, HIVE-2340, HIVE-3891, …

  9. What’s New? Smarter Query Compiler MapJoin hint inferred automatically in most cases (HIVE-3784, HIVE-3403)

  10. What’s on the Horizon? New Runtime Framework Apache Tez…

  11. What’s on the Horizon? Vectorized Query Execution

  12. Real-time SQL on Hadoop CitusDB, Impala, Apache Drill, … What matters: Data Locality Block aware query planner

  13. Monthly Hive Meetups in the Bay Area Hive User Group Meetup Hive Contributors Group Meetup

  14. We’re Hiring • citusdata.com/job

More Related