120 likes | 170 Vues
Big Data. Internet2 Day Presentation Dr. Greg Newby, ARSC http://people.arsc.edu/~newby. Mandelbrot Set. Big data are getting Bigger! Networks aren’t keeping up Even small users need big data The biggest data are really, really big!
E N D
Big Data Internet2 Day Presentation Dr. Greg Newby, ARSC http://people.arsc.edu/~newby Mandelbrot Set Big Data at Internet2 Day 2006
Big data are getting Bigger! Networks aren’t keeping up Even small users need big data The biggest data are really, really big! Big data projects, big systems, and big networking capability are requirements for Alaskans to benefit Big Data in a Nutshell Fiber Optic Cable Bundle Big Data at Internet2 Day 2006
Basic Lingo Byte (8 bits each): one per character (this sentence has 68 bytes) Kilobyte = 1000 bytes. A text email. Megabyte = 1million bytes. A digital photo. Gigabyte = 1billion bytes. A 30 minute DVD movie Terabyte = 1trillion bytes. 1500 music CDs Petabyte = 1quadrilion bytes. 8 days of UAF’s Internet2 traffic at 100% utilization. 41 minutes of the largest 40Gb Internet2 connection (256 times faster!) Exabyte = 1quintillion bytes. PetaFLOP computers will produce this much output! Big Data at Internet2 Day 2006
Sample “Big Data” Projects All are generating petabytes of data All use high-speed Internet to share data, distribute computation, provide end-user access, and as part of fundamental operations All have national & international collaborations National Virtual Observatory: Distributed astronomy Earth System Grid: Computational earth science Large Hadron Collider: High-energy physics 1992 NSFNet Data Rates Big Data at Internet2 Day 2006
National Virtual Observatory Ongoing sky survey data (visible; radio; x-ray) from many observatories Shared data & processing Transparent access via portals NVO Architecture Big Data at Internet2 Day 2006
Earth System Grid Many large computers running simulations Sharing output, which can be quite big! For computational grids, more bandwidth & lower latency among systems is critical (Where is AK on this map?) ESG Structure Big Data at Internet2 Day 2006
High Energy Physics The CERN Large Hadron Collider (LHC) will become operational in 2007 100 times the collisions of Argonne’s accelerator (the world’s largest today), producing 100 times the data rate per experiment Several PB/year, which requires extensive post-processing to be useful LHC Assembly Big Data at Internet2 Day 2006
Big Data in Alaska: for Science Climate study; Oceanography; Weather ARSC’s Augustine Forecast Big Data at Internet2 Day 2006
Remote sensing with the new hyperspectral satellite 220 spectral bands, versus ~20 previously; higher spatial resolution, too • December 19, 2000 - EO-1 First Light Images • An image of Alaska (right) taken by EO-1's Advanced Land Imager (ALI) in the panchromatic (PAN) band compared with an image (left) taken by Landsat 7 under nearly identical lighting and surface conditions. http://www.gsfc.nasa.gov/topstory/2002/20020624eo1.html Big Data at Internet2 Day 2006
Coming Soon:Big Data in Alaska for …. Medicine: Remote imagery & diagnosis; medical collaboration K-12 education: live remote scientific instruments (AlaskaScope); remote instruction; larger virtual classrooms Government: eGovernment; govdocs; communication to government leaders Entertainment: interaction & entertainment Industry: distributed organizations; consulting; internationalization Carbon Nanotubes, via WikiPedia Big Data at Internet2 Day 2006
More bandwidth needed to participate in Big Data activities Continued large computer systems, informed personnel, scientific expertise, leadership commitment More bandwidth! Fill in the “bandwidth map” for Alaska communities More bandwidth! What’s next for Big Data in Alaska? WCI Fibre Route Big Data at Internet2 Day 2006