230 likes | 349 Vues
Persistent Archive Research Group GGF5 Reagan W. Moore San Diego Supercomputer Center 9500 Gilman Drive, La Jolla, CA 92093-0505 Phone: 858 534-5073 FAX: 858 534-5152 E-mail: moore@sdsc.edu http://www.npaci.edu/DICE/. Working Group Goals.
E N D
Persistent Archive Research Group GGF5 Reagan W. Moore San Diego Supercomputer Center 9500 Gilman Drive, La Jolla, CA 92093-0505 Phone: 858 534-5073 FAX: 858 534-5152 E-mail: moore@sdsc.edu http://www.npaci.edu/DICE/
Working Group Goals • Define a persistent archive architecture that builds upon virtual data grid technology • Identify minimal capabilities that need to be implemented in virtual data grids • Present final draft at GGF-6 • Demonstrate working prototypes • SDSC (4 persistent archives - NARA, LC, NSDL, CDL) • Another site?
Preservation Infrastructure Old Application Old Operating System Old Storage System Old Display System Digital Entity
Technology Management New Application New Operating System Wrap Storage System Wrap Display System Old Storage System Old Display System Migrate Encoding Format Digital Entity
Preservation Approaches • Storage repository abstraction • Set of operations needed to manipulate data in a storage repository • Information repository abstraction • Set of operations needed to manipulate a catalog in an information repository • Logical name space • Persistent global identifier that is independent of the storage repository • Authenticity • Audit trails for all changes to documents, persistent collection, ACLs, assurance copies
Review Persistent Archive Draft Paper • Identify missing capabilities • Reduce proposed set of capabilities (130) to the minimal required set • Identify sites planning to implement a persistent archive
Data Grid Survey • Participants providing information • SAM - Igor Terekhov (Fermi) • Magda - Torre Wenaus (Brookhaven National Laboratory) • SDM - Scott Studham (Pacific Northwest National Laboratory) • JASMine Chip Watson (Jefferson Laboratory) • GDMP - Heinz Stockinger (CERN) • Globus tool kit - Ann Chervenak (ISI/USC) • SRB - Arcot Rajasekar (SDSC)
Summary • 29 core capabilities have been defined for the implementation of persistent archives from data grids