The Design of a Grid Computing System for Drug Discovery and Design

The Design of a Grid Computing System for Drug Discovery and Design David Gobaud Computational Drug Discovery Stanford University 14 March 2006

Outline • Overview • Network Topology • Structure Summary • Resource Management Agent • Job Submission Client • Scheduler Node • Execution Node • Experimental Results • Questions

Overview • Novel Grid computing system for drug discovery and design (DDG) • Use idle resources donated by nodes over the internet • P2P technology • Hybrid resource management architecture • Avoid master/slave problems

Network Topology • Centralized + Decentralized P2P • Clusters that wish to compute join DDG and become execution nodes • Grouped into Virtual Organizations (VO) by structures and interests • Communicate via Resource Management Agents (RMA) in P2P mode • One peer fails – other peers can compensate • Each local RMA captures and maintains resource info of a VO

Structure Summary • End-users submit protein molecule files to DDG via web portal • End-users create job request template for a new application • Specify QoS requirements • Job submitted to Scheduler Node by Submission Node • DDG provides a distributed repository where execution nodes publish their real-time workload info

Structure Summary • Scheduler Node • Divide job into many parallel sub-jobs • Query repository to find and allocate execution nodes for sub-jobs to satisfy QoS requirements • Combine results from sub-jobs into whole docking result • Return result to submission node which informs the user the job is complete

High-Level Workflow of DDG

Experimental Results • Feasibility and performance evaluated by protein molecules docking experiments • Many DBs used – Specs, ACD, CNPP, and NCBI • 20 PC LAN • 1Ghz Pentium IVs • 256 Mb RAM • 40 Gb hard drives • 100 Mb/sec Ethernet • SGI Origin 3800 cluster of 64 processors • SUNWAY cluster of 32 processors • SUNWAY cluster of 64 processors • SUNWAY cluster of 256 processors

Experimental Results – Running Time

Experimental Results – Robustness • 20 PCs each receiving 100 protein molecules to dock • Some PCs randomly shut down • After P2P network stabilized fraction of molecules that could not be processed measures • Molecule docking failure rate almost equal to nodes failure rate as expected • No significant resource scheduling failure -> Robust

Questions • Why are failed docking computations not rescheduled? • What type of network topology does Folding@Home use?

The Design of a Grid Computing System for Drug Discovery and Design

The Design of a Grid Computing System for Drug Discovery and Design

Presentation Transcript

Drug design and discovery

Drug Discovery Grid -- A real grid application

Grid Computing For Scientific Discovery

Drug Design / drug discovery

Chapter 13. Drug Design and Discovery

Using Parallel Computing in Drug Design

The Organic Chemistry of Drug Design and Drug Action

Drug Discovery Grid -- A real grid application

The Organic Chemistry of Drug Design and Drug Action

The Organic Chemistry of Drug Design and Drug Action

Drug Action and Drug Design

The Organic Chemistry of Drug Design and Drug Action

Computer simulations in drug design, and the GRID

Granular Computing for the Design of Information Retrieval Support System

Drug Targets their role in drug design / discovery

Drug Design / drug discovery

“Grid Platform for Drug Discovery” Project

The Grid System Design

The Design Concept and Initial Implementation of AgentTeamwork Grid Computing Middleware

Discovery and Design of Algorithms

Key Aspect for Approach of Smart Grid Design System