340 likes | 556 Vues
NAL-Institutional Repository: A Case Study CSIR Metadata Harvester. I.R.N. Goudar Head, ICAST, NAL goudar@nal.res.in National Symposium on Open Access and Building Institutional Repository National Aerospace Laboratories Bangalore- 560 017 21-23 Jan 2009. NAL-IR.
 
                
                E N D
NAL-Institutional Repository: A Case StudyCSIR Metadata Harvester I.R.N. Goudar Head, ICAST, NAL goudar@nal.res.in National Symposium on Open Access and Building Institutional Repository National Aerospace Laboratories Bangalore- 560 017 21-23 Jan 2009
NAL-IR • Started in 2003 using GSDL • Adopted E-Prints in 2005 • Plans to Switch over to DSpace • Presently about 3000 Documents
IR Download Statistics 6000-10000/PM from more than 120 Countries • USA 40% • India 25% • UK 10% • Canada 6% • Japan 5% • China 3% • Germany 3% • France 3%
Metadata Harvesting • Harvesting • in the OAI context, harvesting refers specifically to the gathering together of metadata from a number of distributed repositories into a combined data store • OAI-PMH (OAI Protocol for Metadata Harvesting) • OAI-PMH is a harvesting protocol for sharing metadata between services.
Interoperability through OAI-PMH Protocol* • Data Provider (Ex. Institutional repository) • Maintain repository • Expose metadata according to a metadata standard (e.g. DC) • Register with OAI • Service provider • Register with OAI • Extract metadata from registered repositories (‘harvest’) • Provide services (e.g. central index) IR-1 IR-2 * http://www.openarchives.org/
Harvesting Software • To harvest metadata from the OAI-compliant repositories (data providers), a harvesting software is needed • PKP Harvester from SFU • http://pkp.sfu.ca/harvester_download • Arc from ODU • http://oaiarc.sourceforge.net/
CSIR Knowledge Harvester • Set up at ICAST, NAL • PKP Harvester • Presently Covers 4 CSIR Labs • About 5500 documents
Harvesting CSIR IRs Tech Reports Pre-prints Journal Articles Presentation Thesis, etc Deposit Metadata +Full Pub) Digital Repository Service Provider ICAST, NAL Access & Dissemination NAL NCL NIO NPL SERC Etc Metadata OAI-PMH Local Intranet access Remote Internet access
EPrints and DSpace • Widely used IR software • Platform • – EPrints: Unix/ Linux/ Perl/ Apache/ MySQL/ • XML/ HTML/ • – DSpace: Unix/ Linux/ Java/ Tomcat or • Apache/ XML/ HTML/ Ant/ PostGreSQL • Imply software knowledge required for installing, configuring, and maintaining archives developed using these packages.
OAI-PMH: Structure Model Data Provider e-prints e-print Requests: Identify ListMetadataformats ListSets ListIdentifiers ListRecords GetRecord Repository Data Provider Images e-print Repository Service Provider Data Provider OPAC e-print Repository Data Provider Harvester Data Provider Responses: General information Metadata formats Set structure Record identifier Metadata Museum e-print Repository Data Provider Archive e-print Repository
Some Useful References • http://www.openarchives.org/ • To register as data provider • http://www.openarchives.org/pmh/ • For OAI-related tools • http://www.openarchives.org/pmh/tools/ • OAI Repository Explorer for interactive exploration and validation of OAI repositories • http://re.cs.uct.ac.za/