1 / 83

Grid Computing

Dr. Nguyen Tuan Anh. Grid Computing. Faculty of Computer Science and Engineering Ho Chi Minh city University of Technology. Contacts: Email: tuananh.nguyen@hcmut.edu.vn Tel: (84-8) 8647256, ext. 5840. Course objectives. Understand “Grid” concepts

kineks
Télécharger la présentation

Grid Computing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Dr. Nguyen Tuan Anh Grid Computing Faculty of Computer Science and Engineering Ho Chi Minh city University of Technology Contacts: Email: tuananh.nguyen@hcmut.edu.vn Tel: (84-8) 8647256, ext. 5840

  2. Course objectives • Understand “Grid” concepts • Roles of “Grid” in our modern society • Grid architectures • Understand Grid related technologies • Security • Web services • Grid services and standards • Ways to efficiently exploit the Grid power • Programming models • “Know-how” • Programming the Grid

  3. Evaluation • Self study and self research • Active • Efficient • Practical exercises and seminars: 50% • Exercises • Seminar: each group will study a Grid-related topics and present this in the class • Final exam: 50%

  4. References • Lecture notes • http://www.cse.hcmut.edu.vn/~tuananh/courses/Grid • Books • I. Foster and C. Kesselman, The Grid: Blueprint for a New Computing Infrastructure. Morgab Kaufmann Publishers, 1999. • Maozhen Li, Mark Baker, The Grid Core Technologies, Wiley, 2005. • Fran Berman, Anthony J. G. Hey and Geoffrey C. Fox, Grid computing: Making the Global Infrastructure a Reality. John Wiley & Sons Ltd, 2003. • Luis Ferrelra et al. Grid Services Programming and Application Enablement. IBM Redbooks, 2004. • Mark Endrei et al. Patterns: Service Oriented Architecture and Web Services. IBM Redbooks, 2004. • Web links: • Globus project: http://www.globus.org/alliance/ • Global Grid Forum: http://www.ggf.org

  5. Grid introduction

  6. Balloon (30 Km) CD stack with 1 year LHC data! (~ 20 Km) 6 cm 50 CD-ROM = 35 GB Concorde (15 Km) CERN (founded 1954) = “Conseil Européen pour la Recherche Nucléaire” “European Organisation for Nuclear Reseach” Mt. Blanc (4.8 Km) 27 km circumference tunnel CERN Large Hadron Collider

  7. Un little bit of history … • I.Foster and C.Kesselman come from the parallel computing field Parallelism • 1970-80 : The pioneers • Vectorial computers (Cray 1, Cray 2,..) • 1980-90 : The exuberance • Massively Parallel Computers (CrayT3D/T3E, Connection Machine,…) • 1990-2000 : The profit • Cluster (Beowulf, ASCII Red,..) • 2000 ->? The Globalization • GRID • Global Computing

  8. Overview • Grid concepts • Challenges and issues • Grid middleware • Grid applications • Grid projects • Future directions

  9. Part I: concepts

  10. What is “Grid”? • 1001 definitions • HPC researchers • Connect supercomputing sites • Distributed supercomputing • Web engineers • Stateful web services for accessing resources • Next generation of the Internet and Web • Companies • Services accessible by other users • Portal to access local resources • Users • Like electrical grid • …

  11. Grid environment 11

  12. « Grid » domains • Originally • To link together supercomputing sites • Today • Collaborative engineering • Data exploration • High-throughput computing • Distributed high performance computing

  13. A little bit of vocabulary • Usually one distinguishes three types of GRID • Knowledge GRID • To share knowledge • Idea: Virtual distributed laboratory • Data GRID • To share the data • Idea: Large scale data storage • Computing GRID • To share computing power • Idea: Alternative to supercomputers

  14. The GRID today -1 • Knowledge GRID • Virtual distributed Institutes/Laboratories allowing remote people to collaborate as they would be at the same location • Purpose: to share knowledge, to collaborate for researches or specific activities • Few specific tries • Remote surgery • ….

  15. The GRID today - 2 • Data GRID • Also referenced as «  peer-to-peer » technology • Versus to the « Client/Server » technology who dominated the market of distributed systems during last decades, with this approach every participant is both client and server • Nobody knows exactly where the data are • The system figures to find the nearest requested data out • The requested data could be distributed on several location and same data could be located at several places (data coherence problem) • The pursuit of the data is transparent to the user • Some examples: • Napster, Gnutella, pirate software, Project DATAGRID (CERN),…

  16. The GRID today - 3 • Computing GRID • A supercomputer on you the table of your kitchen … • Some facts • Supercomputers are very expensive and age very rapidly • Except for some very specific persons, we do not need a supercomputer every day… … but when we need one it is very frustrating not to have one ! • The total time personal and other computer waste time not doing anything is huge • I am (almost) permanently connected to a lot of computers. • Potentially I am permanently connected to a supercomputer • First try to exploit this fact: Projet seti@home

  17. GRID computing : different views • Virtual Supercomputing • An association of several supercomputers geographically distributed (10-1000) • Each node is a parallel machine managed through a scheduling system (batch) • To provide a virtual supercomputer • Internet computing • A combination of a large number of PCs(10000 - 1000000) • To exploit unused computing • Metacomputing • An association of several servers providing a set of applications

  18. “Grid” vs. cluster computing • “Partial view” vs. “Global view” of the environment • Large scale, geographically distributed • Non-structured • Heterogeneous • Dynamic, unstable • Multiple administrative domains

  19. Cluster SMP Supercomputer Supercomputer Workstation PC Computing Grid: our goal My laptop ?

  20. The dream • To make computers user friendly: To switch • from a program centric view… • « Execute this program on this (these) machine(s)» • to a service centric view… • « Provide me this service with this QoS level » • Today the WEB provides access to a large amount of documents and data BUT…. • Access is not transparent…. URL • To find the correct information is a tedious work… search engines The GRID will change our way to use computer as many drasticallythat WEB did more than 10 years ago

  21. New characteristics • Large-scale • Need for dynamic selection • Partial view of the environment • Heterogeneity • Hardware, OS, network, software environments (languages, libraries, tools...) • Complex • unpredictable structure • Dynamic • unpredictable behavior • Multiple administrative domain • no centralized control

  22. Challenges • Efficient exploitation of Grid performance • New programming models • Service-centric vision • Resource connectivity • Middleware services • Applications • Interoperability vs. performance • Good interoperability -> (very) slow • Fast -> not so interoperable • Grid scalability • Resource management mechanisms/models • Large number of heterogeneous resources • Grid infrastructure and Grid application evaluations • How to benchmark Grid systems and applications • Trust and security • How to trust the environment • How to protect sensitive data

  23. Part II: Grid Middleware & components

  24. What is Grid middleware • System software between applications and OS • Provide services to applications • Discovery • Execution • Storage • Data movements • Information • Service integration • ... • Failure detection and recovery • Resource monitoring • ... • Hide all complexities of the Grid environment

  25. Grid layered architecture: principle

  26. Grid architecture • Fabric layer • Physical resources • Computers/ computing platforms • Data storages • Sensors • Network • Layered on top of local OS • Resource & connectivity protocols layer • Provide access to resources and services in the fabric layer • Security management • Authentication • Authorization • Secure communication

  27. Grid architecture • Collective service layers • Facilitate the use of resources/services • Provide meta-services • Resource discovery • Resource brokers • Failure detection • ... • User application layers • Grid applications • Parallel applications • Service-oriented applications • ...

  28. Community Grid Architecture Model

  29. Service-centric view: Grid environment

  30. Virtual organization (VO) • Grid environment can be organized as VOs • VO: groups of services/resources • Similar characteristics • Location/site • Functions • A VO can a part of a bigger VO

  31. Grid evolution GUSTO testbed GT3.0 OGSA GT4.x WSRF GT1.0 1996 1999 2002 2005 GGF Proof of concept Standardization • Basic services? • Security issues • Grid architecture • Grid interoperability Globus Toolkit !!!! http://www.globus.org

  32. Applications High-level Services and Tools GlobusView Testbed Status POP-C++ MPI ProActive CC++ Nimrod/G globusrun Core Services Nexus GRAM Metacomputing Directory Service Globus Security Interface Heartbeat Monitor Gloperf GASS Local Services Condor MPI TCP UDP LSF Easy NQE AIX Irix Solaris Globus layered architecture

  33. Globus Toolkit 4x • Sustainable changes on the services interoperability and infrastructure • Open Grid Services Architecture (OGSA) • Stateful Web Services • Enable the integration of user specific Grid services • Define standard interfaces • How to access Grid services • Disadvantage • Slow

  34. GT: Core service architecture

  35. Globus Toolkit • Grid Service Specification • How to write, publish and use a Grid service • GT components: • GT core • Meta-services used to implement other services and service behaviors (e.g. service creation, destruction) • GT base services • Use the GT core to implement Grid capacities: resource management, information services, data transfer, etc. • Other Grid services • Implemented by user to enable some enhancement capacities

  36. GT: from other perspectives Local-level services • WS GRAM • Fork • PBS • LSF • GridFTP • RFTP … create GT container GT security create VO-level services • Information service • …???

  37. What GT DOES NOT address • GT focus on accessing local resources • Things still missing • Coordination services • Resource/service discovery • Information collection • Resource connectivity • Programming models/tools • Things to be improved • Performance!

  38. Part III: Grid security

  39. Grid security: a scenario

  40. Symmetric key

  41. Asymmetric cryptography Unique Mathematical relation

  42. How to guarantee the information integrity?

  43. Digital Signatures • A message signed by the private-key

  44. How to identify users?(User authentication)

  45. Authentication models Direct-trusted model Third-party trusted model 4: sign(M) with sender’s private key Sender Receiver Sender Receiver 3: message M 1: Register sender’s public key 2: store trusted public key 5: public key existed? Trusted authority Digital certificate Key repository

  46. Digital certificate • Based on asymmetric cryptography • Consist of • Public key • User identity information (user name, organization, address, …) • One or more digital signature signed by some well known Certificate Authority (CA)

  47. Certificate authority • Responsible for • Positively identify entities requesting certificates • Issuing, removing, and archiving certificates • Protecting the Certificate Authority server • Maintaining a namespace of unique names for certificate owners • Serve signed certificates to those needing to authenticate entities • Logging activity

  48. Access the Grid

  49. Grid authentication

  50. Grid Delegation

More Related