1 / 15

Crunching Big Data

This presentation delves into the transformative journey of big data from historical astronomical measurements by Tycho Brahe and Johannes Kepler to modern computational advancements with the Large Hadron Collider (LHC) and its massive data processing capabilities. It highlights the significance of large-scale analytics, including Dryad and DryadLINQ frameworks for distributed execution and machine learning applications. Discover how big data influences various fields, from genetics to environmental studies, and see real-world examples showcasing its power and potential.

galvin
Télécharger la présentation

Crunching Big Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Crunching Big Data MihaiBudiu Microsoft Research, Silicon Valley September 27, 2011 big DATA

  2. 500 YEARS AGO Tycho Brahe1546 – 1601 Johannes Kepler 1571 – 1630

  3. The laws of PLANETARY MOTION Tycho’s measurements Kepler’slaws

  4. THE LARGE HADRON COLLIDER LHC Grid: 200K computing cores 15 PB/year

  5. genetic CODE 106 107 108 109 1010 1011 MYCOPLASMA GRAM POSITIVE BACTERIA GRAM NEGATIVE BACTERIA FUNGI/MOULDS ALGAE WORMS CRUSTACEANS ECHINODERMS INSECTS MOLLUSKS BIRDS BONY FISH CARTILAGINOUS FISH REPTILES MAMMALS AMPHIBIANS FLOWERING PLANTS

  6. bigDATA

  7. bigCOMPUTERS

  8. some TRUE STORIES

  9. >4B queries/month [comscore]

  10. Large-Scale Analytics Engine Distributed Execution: Dryad

  11. Cluster Programming: DryadLINQ Distributed Execution: Dryad Distributed Storage: TidyFS

  12. LEARN FROM DATA Application: Decision forests Library: Machine learning Programming: DryadLINQ Training examples Distributed Execution: Dryad Machine learning Classifier

  13. CONCLUSION

  14. CONCLUSION

More Related