230 likes | 322 Vues
The Big Questions. Life & death. Future of the planet. Nature of the universe. Consciousness. Empirical. Theory. Simulation. Data. How Do We Answer Them?. <0. 1700. 1950. 1990. The Same is True of Smaller Questions. Designing new chemical catalysts Selling advertising
E N D
The Big Questions Life & death Future of the planet Nature of the universe Consciousness
Empirical Theory Simulation Data How Do We Answer Them? <0 1700 1950 1990 Service-Oriented Science: Globus Software in Action
The Same is True of Smaller Questions • Designing new chemical catalysts • Selling advertising • Creating entertainment • Finding parking Service-Oriented Science: Globus Software in Action
Information Technology and Science “Paul Erdös claimed that a mathematician is a machine forturning coffee into theorems. The scientist is arguably a machinefor turning data into insight.” • Service-Oriented Science, I. Foster, Science, 308, p. 814. Service-Oriented Science: Globus Software in Action
What are the Products of Science? • Papers • “We learned this, and here’s how.” • Data and Datasets • “We collected this data. Download it, write an analysis program, and see what you can learn from it.” • Web Portals • “We constructed this scientific model. Use our data or bring your own, supply some parameters, and see how it behaves.” • Requires manual operation. • Web Services • “Here’s our climate model. Integrate it with your models for [ocean currents/weather/crop forecasts] and see what happens.” • “Here’s our indexed data from the latest experiment run. Run your filters against it and see if you can find anything interesting.” • “Here’s our genome analysis engine. Upload your proteins and see what they will do in a cell.” Increasing degrees of collaboration Service-Oriented Science: Globus Software in Action
Grid:An Enabler of eScience The dubious electrical power grid analogy Must we buy (or travel to) a power source? Or can we ship power to where we want to work? Enable on-demand access to, and integration of, diverse resources & services, regardless of location Service-Oriented Science: Globus Software in Action
1st Generation Grids Focus on aggregation of many resources for massively (data-)parallel applications EGEE Service-Oriented Science: Globus Software in Action
Second-Generation Grids • Empower many more users by enabling on-demand access to services • Science gateways (TeraGrid) • Service oriented science • Or, “Science 2.0” “Service-Oriented Science”, Science, 2005 Service-Oriented Science: Globus Software in Action
“Web 2.0” • Software as services • Data- & computation-richnetwork services • Services as platforms • Easy composition of services to create new capabilities (“mashups”)—that themselves may be made accessible as new services • Enabled by massive infrastructure buildout • Google projected to spend $1.5B on computers, networks, and real estate in 2006 • Many others are spending substantially • Paid for by advertising Declan Butler, Nature Service-Oriented Science: Globus Software in Action
Automating Science • Human access to data is nice • Automated access by software tools is revolutionary • “In the time that a human user takes to locate one useful pieceof information within a Web site, a program may access and integratedata from many sources and identify relationships that a humanmight never discover unaided.” - Foster Service-Oriented Science: Globus Software in Action
User Discovery tools Analysis tools Science 2.0:E.g., Virtual Observatories Gateway Data Archives Figure: S. G. Djorgovski Service-Oriented Science: Globus Software in Action
Service-Oriented Science People create services (data or functions) … which I discover (& decide whether to use) … & compose to create a new function ... & then publish as a new service. I find “someone else” to host services, so I don’t have to become an expert in operatingservices & computers! I hope that this “someone else” can manage security, reliability, scalability, … ! ! “Service-Oriented Science”, Science, 2005 Service-Oriented Science: Globus Software in Action
Are Scientists ReallyDeveloping Web Services? Service-Oriented Science: Globus Software in Action
Cancer Bioinformatics Grid • Common system architecture (caGrid) provides the service interface “plumbing” and the service hosting capability • Web services • Community participants supply useful services • Data access, analysis, modeling, filtering, authoring, etc. • https://cabig.nci.nih.gov/ Service-Oriented Science: Globus Software in Action
The Introduce Authoring Tool • Define service • Create skeleton • Discover types • Add operations • Configure security • Modify service Generates GT4-compatible WebServices Introduce: Hastings, Saltz, et al., Ohio State University Service-Oriented Science: Globus Software in Action
The Importance of “Hosting”and “Management” Tell me about this star Tell me about these 20K stars Support 1000sof users E.g., Sloan DigitalSky Survey, ~10 TB; others much bigger Service-Oriented Science: Globus Software in Action
Skyserver Sessions(Thanks to Alex Szalay) Service-Oriented Science: Globus Software in Action
Hosting & Management:Application Hosting Services Application providers Appln Code Application deployment Application Prep Tool(s) Provisioning Application client Resource Provider Appln Code Users Resource Provider Appln Code AHSmanagement Hosting Service Author ization Admins Persistence Policymanagement PDP Service-Oriented Science: Globus Software in Action
Who Will Host Your Services? • Your institution (campus resources) • (Inter)national systems • TeraGrid, Open Science Grid, UK Nat’l Grid Service, ChinaGrid, NaukaGrid, etc. • Science domain systems • caBIG, NEES, Earth System Grid, Orion*, LEAD, NEON*, LHC Computing Grid, etc. • Commercial systems • Amazon, Google, etc. Service-Oriented Science: Globus Software in Action
Examples ofProduction Scientific Grids • APAC (Australia) • China Grid • China National Grid • DGrid (Germany) • EGEE • NAREGI (Japan) • Open Science Grid • Taiwan Grid • TeraGrid • ThaiGrid • UK Nat’l Grid Service Service-Oriented Science: Globus Software in Action
A B 1 1 9 9 Shared Distributed Infrastructure Application-Infrastructure Gap Dynamicand/orDistributedApplications Service-Oriented Science: Globus Software in Action
Tool Tool Workflow Uniform interfaces, security mechanisms, Web service transport, monitoring Registry Credent. DAIS GRAM User Svc User Svc GridFTP Host Env Host Env Bridging the Application-Resource Gap User Application Database Specialized resource Computers Storage Service-Oriented Science: Globus Software in Action
Grid Infrastructure • Distributed management • Of physical resources • Of software services • Of communities and their policies • Unified treatment • Build on Web services framework • Use WS-RF, WS-Notification (or WS-Transfer/Man) to represent/access state • Common management abstractions & interfaces Service-Oriented Science: Globus Software in Action