1 / 5

B REAKOUT S ESSION - B IG B ENCH -

B REAKOUT S ESSION - B IG B ENCH -. 3rd W orkshop on Big D ata Benchmarking July 16-17 Xi‘an , China. B IG B ENCH – F URTHER D EVELOPMENT (1). Late Binding needs to be addressed in BigBench Pre- or Post-queries Workload has to deal with missing values Possibly start with Weblogs

kasia
Télécharger la présentation

B REAKOUT S ESSION - B IG B ENCH -

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. BREAKOUT SESSION- BIGBENCH - 3rd Workshop on Big Data Benchmarking July 16-17 Xi‘an, China

  2. BIGBENCH – FURTHER DEVELOPMENT (1) • Late Binding needs to be addressed in BigBench • Pre- or Post-queries • Workload has to deal with missing values • Possibly start with Weblogs • Add columns to tables dynamically • Scaling factor needs to be proven for data generation rate and query result size • Data model specific: • Integration of media resources considered, but excluded • Localization (WGS84) aspect for Customer (potentially for reviews, considered as minor important since postal code available) Late Binding::= the schema information will be evaluated during runtime.

  3. BIGBENCH – FURTHER DEVELOPMENT (2) Support for Graph structures: • Integration of hash-tag functionality • (Re-)Tweet like methods on recommendation of Customer • On-the-fly analysis will end in graph structures (e.g., “give me all Customers retweeting a positive review of product XY“)

  4. BIGBENCH – OPEN ISSUES Open Issues: • Is localization an issue for a benchmark? • Do images/other media add value to a data benchmark?

  5. BIGBENCH – FURTHER STEPS • Big Data Challenge • Have people implement BigBench • Hive version will be out soon • Discussion later • Big Data Pipeline • BigBench somewhere in the middle/end? • Discussion later

More Related