140 likes | 267 Vues
This document presents a comprehensive technical overview of the Open Archives Initiative (OAI) protocol, elucidating the processes of data harvesting from repositories. It explores various supporting protocol requests such as Identify, ListMetadataFormats, ListSets, ListRecords, ListIdentifiers, and GetRecord. Detailed examples illustrate the usage of specific OAI commands for harvesting items, selective harvesting based on date ranges and sets, as well as the response format. This overview is essential for service providers and data providers involved in the OAI framework.
E N D
the OAI: technical overview Herbert Van de Sompel & Carl Lagoze Cornell University --Computer Science OAI Open Meeting–Washington DC – January 23rd2001
supportdata repos i tory harves ter • oai protocol • supporting • harvesting • requests items harvesting data OAI metadata harvesting
harvest withindate range repos i tory record record selective harvesting - datestamps
S1 harvest within set repos i tory record record record selective harvesting - sets S2
harvest records with specific format repos i tory record record selective harvesting – metadata format
HTTP encoding - requests BASE-URL -----------> an.oa.org/OAI-scriptkeyword arguments --> verb=ListIdentifers&set=S1 GET http://an.oa.org/OAI-script?verb=ListIdentifers&set=S1
xml namespaces responseheader responsedata HTTP encoding - responses <? xml version=1.0 encoding=“UTF-8” ?><GetRecord xmlns=“http://oai.namespace.uri” xmlns:xsi=“http://w3.namespace.uri” xsi:schemaLocation=“http://oai.namespace.uri http://oai.schemaURL”> <responseDate>2000-19-01T19:30:30-04:00</responseDate> <requestURL>http://an.oa.org/OAI-script?verb=GetRecord &identifier=oai%3AarXiv%3A0001 &metadataPrefix=oai_dc</requestURL> <record>record contents </recordadditional records</GetRecord>
repos i tory harves ter OAI harvesting tools service provider data provider • Supporting protocol requests: • Identify • ListMetadataFormats • ListSets • Harvesting protocol requests: • ListRecords • ListIdentifiers • GetRecord
repos i tory harves ter supporting protocol requests service provider data provider Identify • Repository name • Base-URL • Admin e-mail • OAI protocol version • Description Container
repos i tory harves ter supporting protocol requests service provider data provider ListMetadataFormats • REPEAT • Format prefix • Format XML schema • /REPEAT
repos i tory harves ter supporting protocol requests service provider data provider ListSets • REPEAT • Set Specification • Set Name • /REPEAT
repos i tory harves ter harvesting requests service provider data provider * from=a * until=b * set=klm ListRecords * metadataPrefix=oai_dc • REPEAT • Identifier • Datestamp • Metadata • About Container • /REPEAT
repos i tory harves ter harvesting requests service provider data provider * from=a * until=b ListIdentifiers * set=klm • REPEAT • Identifier • Datestamp • /REPEAT
repos i tory harves ter harvesting requests service provider data provider * identifier=oai:mlib:123a GetRecord * metadataPrefix=oai_dc • Identifier • Datestamp • Metadata • About