170 likes | 315 Vues
Resource Monitoring & Service Discovery in GeneGrid. Sachin Wasnik Belfast e-Science Centre. http://www.qub.ac.uk/escience. Fusion Antibodies. Amtec Medical. Support from BT plc. GeneGrid Project. Collaborative Industrial R&D project Stakeholders. £820,000 (DTI funding £406,000).
 
                
                E N D
Resource Monitoring & Service Discovery in GeneGrid Sachin Wasnik Belfast e-Science Centre http://www.qub.ac.uk/escience
Fusion Antibodies • Amtec Medical • Support from BT plc GeneGrid Project • Collaborative Industrial R&D project • Stakeholders • £820,000 (DTI funding £406,000)
GeneGrid: Objectives • Grid Based Framework for Bioinformatics Analysis • Integration of Existing Technologies & Data Sets • Production of a ‘Virtual Bioinformatics Laboratory’ • Platform for scientists to access collective skills and experiences in a secure, reliable and scalable manner • in silico knowledge discovery
GeneGrid: Components • Application Integration & Management (GAM) • Data Access, Integration & Storage (GDM) • Resource Monitoring & Service Discovery (GRM) • Workflow & Process Management (GWM) • Portal
Resource Monitoring & Service Discovery • Built upon the Belfast e-Science Grid Manager project which consists of 1) GeneGrid Application & Resources Registry (GARR) • Registry service - GT3 based 2) GeneGrid Node Monitors (GNM) • Light weight adapter present on all Node
GeneGrid Environment GeneGrid Portal GDM Service GeneGrid Environment # 2 GeneGrid Environment # n GeneGrid Workflow Manager GDM Service GAM Service GAM GAM Swissprot EMBL GeneGridApp & ResourceRegistry GARR GeneGrid Workflow Definition GeneGrid STRIP Swissprot EMBL TMHMM bl2seq GAM Service TMHMM SignalP GAM Service GAM Service SignalP RP TMHMM bl2seq EMBOSS RP GeneWise Primer3 ClustalW HMMER EMBOSS BLAST DB query 6p SMP sparc (solaris 7) DB query RP Eliminator I686 Linux Sparc (Solaris 8) QUB QUB 4p SMP linux BT Data Centre 4p SMP linux 32 x Sun Blade linux University Melbourne SDSC Belfast e-Science Centre GeneGrid Overview
Portal GWMSF GNM GNM GARR GNM GNM GDMSF GDMSF GWDD GSTRIP GeneGrid Environment • GWMSF and both GDMSFs in the GE register their existence with the GARR • GWMSF and the GeneGrid Portal are both configured with the location of the GARR service • Upon start up, the Portal connects to the GARR to discover the location of the GDMSF for both the GWDD and the GSTRIP databases GNM on all GeneGrid Environment nodes registering with the GARR.
GARR Service • GARR is the central service that mediates service discovery by publishing information about various services available • provides an interface to query • captures the information which is sent by the GNM • Stores the information in GARR Database
GNM GARR GAMSF BLAST SDSC GNM GNM BeSC GAMSF GDMSF BLAST EMBL GeneGrid Node Monitor System Information • Hardware address • System Time, • IP address, • CPU speed, • CPU load, • Total Memory • Free Memory • Operating System’s Name • Operating Systems version • Uptime • Hostname • System Architecture • Number of Processor • Load average for last 1 minute, 5 minutes 15 minutes • Custom Data. Application Data Name of the Resource Type of the Resource Grid Service Handle (GSH) GNM on multiple resources across administrative domains registering resource information securely to a GARR
GARR VO 1 GNM GNM GAMSF GAMSF Resource A Resource B GARR VO 2 Shared Resources • GAM and GDM services make up the GeneGrid Shared Resources • GNM can register with many GARR services across multiple GE allowing the resources to be shared between multiple organisations • Organisations have complete control over what resources, if any, they wish to share with other organisations, forming dynamic virtual organisations Resource A registers with both VO1 & VO2. Resource B registers with VO1 only.
Future Work • Capture the network information in order to effectively utilize the resources when huge file transfer are to be performed • Predicting the performance of resources based on the data stored in GARR database • Add metadata about the services registered
Contact • Project Manager: Dr Paul Donachy • p.donachy@qub.ac.uk • Bioinformatician: P.V. Jithesh • p.jithesh@qub.ac.uk • Grid Programmer: Sachin Wasnik • s.wasnik@qub.ac.uk • More information: http://www.qub.ac.uk/escience/projects/genegrid
Thank You! http://www.qub.ac.uk/escience