290 likes | 423 Vues
The integration of instruments and sensors into grid computing environments is essential to enable real-time interactions from grid applications. This approach aims to abstract sensor functionalities, facilitating robust and flexible applications that reduce reliance on specialized knowledge. The primary goals include developing standardized methodologies for "grid enabling" instruments, collaborating with diverse scientific disciplines, and advancing automatic metadata production. The methodologies target large and small scientific instruments, focusing on improving data acquisition and enhancing applications across various fields, including X-ray crystallography and environmental monitoring.
E N D
Instruments and Sensors as Grid Services Donald F. (Rick) McMullen1 Kenneth Chiu2 John C. Huffman1 Kia Huffman1 Randall Bramley1 1Indiana University 2SUNY Binghamton
Motivations • Instruments and sensors are not well integrated into grids • Data is acquired, processed and stored before it “hits the grid”. • Need methodology for interacting with instruments and sensors in real time from grid applications • Some abstraction of sensor and instrument functionality is needed to make grid applications that use them more robust and flexible.
Goals • Integrate instruments and sensors (e.g. real-time data sources) into a Grid computing environment with Grid services interfaces. • Abstract instrument capabilities and functions to reduce data acquisition and analysis applications’ dependence on specialized knowledge about particular instruments. • Move production of metadata as close to instruments as possible and facilitate the automatic production of metadata. • Develop a standard, reusable methodology for “grid enabling” instruments. • Collaborate with scientists in academia and industry in a broad range of disciplines who either develop instruments or whose work depends on the details of using them.
Ground motion sensor array Electron Microscope Increasingbandwidth per sensor Traffic sensors X-Ray Crystallography Wireless ‘mote’sensor network Radio Telescope Increasing number of sensors Increasing real-time application • Simple 2-D analysis of instrument taxonomy • At lease five dimensions identified in more detailed analysis • Project must address enough points (classes) to assure breadth of applicability
Initial Applications • High brilliance X-ray crystallography • Large instrument application • Deeply integrated into bio and medical discovery research methodology • Mature analysis software and large user community • Robotic telescopes • Small numbers of sensors: CCD, environmental; some control aspects: filters, aiming, dome. • Global coordination needed for scheduling • Aggregation of disparate sensors into a “composite”instrument • Small sensors • Minimal memory and CPU • Wireless connectivity • Developing parallel project to use ad hoc/swarm networks for data collection for real-time simulation and prediction • Updating of data flows in response to sensor/network reconfiguration.
CIMA implementation targets Large scientific instruments Embedded sensorsand controllers verylargesystems, few elements verysmallsystems, manyelements PC104 industrialcontroller board Synchrotron beamline(APS/ALS) MICA Mote wireless sensor/controller board
MMSF Automated Telescope • Typical remote access automated observatory • System has 33 distinct “sensors”, 12 controllers • Open/close of roof based on a Polaris transparency monitor and rain detector – simple grid of wires detecting rain drops • Telescope direction and dome control • Filter selection and telescope focus • Liquid nitrogen fills of the CCD dewar jars ….
MMSF Observatory Features • Instruments producing data without units • Temperature, humidity cutoffs determined empirically as resistances, not degrees or % • Hierarchical and co-located instruments • Single platform holds three instruments, so orienting one changes orientation of all • Updates to equipment occurs frequently • Data transferred via 28k modem line –middleware needs to work locally, directly between instruments and sensors
Observatory Data Architecture • Control Data • object control • instrument control • Object Data (i.e., object of scientific interest) • Full spectrum from raw unitless data to derived data artifacts • Instrument package • system package data (multiple attributes output) • system sensor data (single attribute output) • nonsystem sensor data (weather data from NOAA) • calibration data • access protocols
X-Ray Crystallography Proxy Box InstrumentManager Portal InstrumentServices LAN WAN LAN DataArchive Non-grid service Grid service Persistent Non-persistent
LTER • Automatic updating of flows • Provenance • SOAP/ARTS (Antelope Real-Time System)
Further applications currently being explored • Electron Microscopy • U. Queensland EM group – regional scale, multiple instruments • - KISTI 2nd largest EM (after Osaka) Robotic telescopes - Bradford robotic telescope, Oxenhope observatory, Faulkes telescope project • Environmental monitoring • Water use • contaminants • Industrial monitoring and control, e.g. Train axles • - Ore trains – Km long, derailment is very expensive to fix • - Temperature sensors on axles monitor bearing status, anticipate wheel failure
Common Instrument Middleware Architecture (CIMA) Elements • Schema for instrument functionality (and ontology for schema attributes); • Data model for representing instrument metrics and calibration; • A small, high performance, embeddable Web Services stack, initially in Java, including Proteus support for multi-protocol, multimodal transport; • Service implementation for accessing the instrument’s functionality and metrics via the Proteus-mediated interface; • Ability to dynamically insert new protocols into running instances • OGSA and WS-RF compliant functions to register with a location service, authenticate users, provide access control to instrument controls and data, send and receive events, and co-schedule the instrument into a Grid computing and storage context.
Example CIMA minimal knowledge bootstrap procedure CIMA instrument Application Globus init, user Proteus/SOAP calls authentication, and instrument lookup Service “ send description” Implementation (SI) returns Application parses “ RDF description” description of itself description for ports to and instrument read for calibration and voltage “ read calibration port” SI returns “ calibration” stored calibration “ read thermocouple port” SI calls controller “ thermocouple voltage” function to read Application reads and return voltage thermocouple voltage then computes and displays a temperature
Sensor Sensor ChannelSource ChannelSource Sensor ChannelSource ChannelSource Services • Instrument • get • set • Sensor • get • set • ChannelSource • get • set • register • ChannelSink • receive • get • set Instrument ChannelSink
Parcels • Wish to unify our data models, etc. • Toolkit must be application-independent as much as possible. • Attributes • Type (string) • Globally unique ID (string) • Encoding • CDATA, Binary, ASCII, Base64 • Location • Inline, URL, Other • Parcel Sets • Special data used for connectivity information.
Technologies • Web services • XML, SOAP, WSDL, binary XML • Grid services • OGSA/OGSI, WS-RF, DAIS • Axis C++ gSOAP • Proteus (SC 2002) • XBS (HPC 2004) • Schema-specific parsing
Proteus Motivation • Web Services for scientific computing • SOAP performs well as a lingua franca • But suffers from performance problems for scientific data • Solution: establish initial communication with SOAP, and then switch to a faster protocol. • Grid intermediaries
Proteus Overview • Provides multiprotocol RMI system to applications • Can wrap existing protocol implementations with dynamic invocation • Facilitates use of SOAP as common language • Switch to faster protocols if supported by both sides. • Mediates between protocol providers and applications • Applications use Proteus client API • Providers use Proteus provider API • Allows a new provider to be added (at run-time) without changing application • Generic serialization/deserialization allows marshalling code to be reused for multiple protocols
Client Client Proteus Proteus ProviderA ProviderA ProviderB ProviderB Multiprotocol Process 1 Process 2 Protocol A Protocol B Network
Proteus APIs Application Client API Proteus Provider API Protocol A Provider Protocol B Provider
Schema-Specific Parsing • XML processing stages (conceptual) • Well-formedness • Lexical and syntactic, defined by core XML specification. • Validity • Conformance to a schema, mainly structural • Application
Application Application Application Validation Validation Validation Well-formedness Well-formedness Well-formedness Merging Stages Fully articulated Implicitly validated Schema-specific parsing User written Merged
Compiler-Based Approach C (fast) • Front-end parses schema into intermediate representation. • Back-end generates code from intermediate representation. • Intermediate representation is a generalized automata. XMLSchema IR Java RELAXNG C(low-pow)
Summary • Create standards for accessing broad spectrum instruments and sensors. • Incompatible components should still have some base level of interoperability.