70 likes | 195 Vues
The NARA Persistent Archives Prototype, developed by Bill Underwood and project members including Reagan Moore and Richard Marciano, focuses on enhancing archival services using a virtual data grid framework. Objectives include the implementation of ingestion workflow prototypes, XML schema for Submission Information Packages (SIP), file type identification, and information extraction services. The project aims to validate digital objects and metadata while ensuring compliance with NARA's records management requirements. Key functionalities include service registration, metadata verification, and demonstration using SLAC science data.
E N D
NARA Report: NARA Persistent Archives Prototype Bill Underwood GTRI, Atlanta CCSDS, MOIMS DAI / IPR WGs Toulouse, 2 Nov-5 Nov 2004
Project Members Reagan Moore, PI, SDSC SDSC - Richard Marciano, Wayne Schroeder Univ. of Maryland - Joseph Jaja SLAC - Jean Deken GTRI - Bill Underwood
Project Objectives • Virtual Data Grid Services • Ingestion Workflow Prototype • XML Schema for SIP • Data Description Languages
Virtual Data Grid Services • Archival services provided by GTRI • File System Packaging with NARA Metadata in XML DTD Manifest • File type identification • File conversion • File viewers and readers • Information Extraction • Services described in WSDL • Register Services • Discover and request archival services • Demonstrate on SLAC science data
Ingestion Workflow Prototype • Addressing issues similar to the Producer-Archive Interface • Provides data to NARA based on a prior agreement with Records Creator • Consists of metadata server and an ingestion client • Provides initial arrangement, context and metadata • NARA • validates digital objects and metadata, • stores objects in a digital repository and • stores metadata in a catalog • Demonstrate on SLAC science data.
XML Schema for SIP • Modifications of MET to meet NARA records management requirement. • Client generates and receive METs documents. • Client contacts Metadata server using X.509 certificates. • Metadata server stores METS items in a MySQL database. • Metadata server manages certificates. • NARA server verifies metadata integrity against schema and specification document.
Data Description Languages for Data Grids • BinX M. Westhead and M. Bull. Representing Scientific Data Sets on the Grid, EPPC, University of Edinburgh, Jan 2003. • DFDL M. Westhead. Data Format Description Language – Primer, Global Grid Forum Data Format Description Language Working Group