1 / 23

GridPP, The Grid & Industry

GridPP, The Grid & Industry. Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton, Middleware Coordinator Neasan O’Neill, Events Officer. Who we are, what it is and what we can do. Who are GridPP?. 19 UK Universities, CERN and CCLRC (RAL & Daresbury)

Télécharger la présentation

GridPP, The Grid & Industry

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. GridPP, The Grid & Industry Tony Doyle, Project Leader Steve Lloyd, Collaboration Board Chairman Robin Middleton, Middleware Coordinator Neasan O’Neill, Events Officer Who we are, what it is and what we can do.

  2. Who are GridPP? 19 UK Universities, CERN and CCLRC (RAL & Daresbury) Funded by PPARC: GridPP1 2001-2004 (£17m) “From Web to Grid” GridPP2 2004-2007 (£16m) “From Prototype to Production” GridPP3 2007-2011 (proposed) “From Production to Exploitation” Developed a working, highly functional Grid

  3. No. of Internet hosts (millions) Year Web: information sharing • Invented at CERN by Tim Berners-Lee • Quickly crossed over into public use • Agreed protocols: HTTP, HTML, URLs • Anyone can access information and post their own

  4. Why do particle physicists need the Grid? The CERN LHC The world’s most powerful particle accelerator 4 Large Experiments

  5. One year’s data from LHC would fill a stack of CDs 20km high Concorde (15 Km) Mt. Blanc (4.8 Km) Why do particle physicists need the Grid? Example from LHC: starting from this event • ~100,000,000 electronic channels • 800,000,000 proton-proton interactions per second • 0.0002 Higgs per second • 10 PBytes of data a year • (10 Million GBytes = 14 Million CDs) We are looking for this “signature” Selectivity: 1 in 1013 Like looking for 1 person in a thousand world populations Or for a needle in 20 million haystacks!

  6. Solution – Build a Grid • Share more than information • Efficient use of resources at many institutes • Leverage over other sources of funding • Data, computing power, applications • Join local communities • Challenges: • share databetween thousands of scientists with multiple interests • link major and minor computer centres • ensure all data accessible anywhere, anytime • grow rapidly, yet remainreliablefor more than a decade • cope withdifferent management policiesof different centres • ensuredata security • be up and running routinely by2007

  7. Middleware is Everything Your Program Single PC Grid Your Program PROGRAMS MIDDLEWARE User Interface Machine Word/Excel Games Email/Web Resource Broker Information Service OPERATING SYSTEM CPU Replica Catalogue Disks, CPU etc Bookkeeping Service Middleware is the Operating System of a distributed computing system CPU Cluster CPU Cluster CPU Cluster Disk Server

  8. GridPP Middleware Development Workload Management Grid Data Management Network Monitoring Information Services Security Storage Interfaces

  9. What you need to use the Grid 1. Get a digital certificate (UK Certificate Authority) Authentication – who you are 2. Join a Virtual Organisation (VO) Authorisation – what you are allowed to do 3. Get access to a local User Interface Machine (UI) and copy your files and certificate there 4. Write some Job Description Language (JDL) and scripts to wrap your programs ############# HelloWorld.jdl ################# Executable = "/bin/echo"; Arguments = "Hello welcome to the Grid "; StdOutput = "hello.out"; StdError = "hello.err"; OutputSandbox = {"hello.out","hello.err"}; #########################################

  10. EGEE LCG GridPP International Context GridPP is part of EGEE and LCG (currently the largest Grid in the world) EU Enabling Grids for e-Science (EGEE) 2004-2008 Grid Deployment Project for all disciplines UK National Grid Service UK’s core production computational and data Grid LHC Computing Grid (LCG) Grid Deployment Project for LHC NorduGrid (Scandinavia) Grid Research and Development collaboration Open Science Grid (USA) Science applications from HEP to biochemistry

  11. The LCG Grid Status Worldwide 182 Sites 23,438 CPUs 9.2 PB Disk 2,200 Years of CPU time UK 21 Sites 4,482 CPUs 180 TB Disk 593 Years of CPU time

  12. What GridPP Has Done So Far • Analysed 300,000 possible drug components in the fight against the Avian Fluvirus • Simulated 46 million molecules for medical research in 5 weeks, which would have taken over 80 years on a single PC • Reached transfer speeds of 1 Gigabyte per second in high speed networking tests from CERN – a DVD every 5 seconds • Simulated 500 million particle physics collisions with the BaBar experiment • Transformed the way particle physics computing problems are approached

  13. Who else can use a Grid? • Bioinformatics • Engineering • Astronomy • Healthcare • Commerce • Gaming

  14. “UK contributes to EGEE's battle with malaria” Number of Biomedical jobs processed by country BioMed Successes/Day 1107 Success % 77% WISDOM (Wide In Silico Docking On Malaria) The first biomedical data challenge for drug discovery, which ran on the EGEE grid production service from 11 July 2005 until 19 August 2005. GridPP resources in the UK contributed ~100,000 kSI2k-hours from 9 sites Normalised CPU hours contributed to the biomedical VO for UK sites, July-August 2005

  15. "GridPP has been developed to help answer questions about the conditions in the Universe just after the Big Bang," said Professor Keith Mason, head of the Particle Physics and Astronomy Research Council (PPARC). "But the same resources and techniques can be exploited by other sciences with a more direct benefit to society."

  16. GridPP & IndustryWhat We Have To Offer • Our Grid • Security tools • GridSite • R-GMA • APEL accounting system

  17. Our Grid • The UK Grid (via one of the individual university sites) can be used to run applications for areas such as finance and image processing.

  18. Security Tools & GridsiteGrid Security for the WebWeb platforms for Grids • Digital Certificates • Certification Authority • Gridsite identifies users to websites with the digital certificates • GridSiteWiki is an extension to the tool • GridSite is open source (http://www.gridsite.org/)

  19. RGMA & APEL accounting system • RelationalGridMonitoringArchitecture • An information and monitoring system for static and dynamic information about grid resources, applications and networks • Accounting Processor for Event Logs • Provides a summary of the resources consumed based on attributes such as CPU time, Wall Clock Time, Memory and grid user identity

  20. GridPP & IndustryCurrent Involvement • HP are sponsoring a joint project with GridPP at Bristol. • GridPP has an association with IBM through collaboration on ScotGrid and R-GMA. • Specific sites also have close relationships with various industrial suppliers.

  21. GridPP & IndustryCurrent Involvement • Posters at “Technology Opportunities from CERN: the impact of Big Physics on Industry”. • Attended KITE club meetings on: • Healthcare, • Medical image processing • Film and computer games • Speakers at a forum on Network and Grid Security organised for the IT industry.

  22. Future Plan to establish a small steering group to lead technology transfer activity. The group, working with various companies, would examine different methods of technology transfer and identify the GridPP activities that can be used in industry and business.

More Related