160 likes | 283 Vues
The Alexandria Digital Library Project is focused on creating a distributed digital library (DL) framework, particularly for georeferenced environmental information. Supported by NSF, initiatives like the Alexandria Digital Earth Prototype (ADEPT) aim to enhance interoperability, scalability, and personalized discovery across heterogeneous information systems. The project encompasses multiple activities, methodologies, and components to support diverse collections—from extremely large to small—while implementing advanced search functionality and metadata standards. Through effective client-server architecture and collaboration with various organizations, the project enhances access to valuable environmental data.
E N D
DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project
OVERVIEW • DL development activities • NSF supported activities • Alexandria Digital Earth Prototype (ADEPT) • NSDL model of a distributed DL • Extension to a heterogeneous DL • Example of distributed DL for environmental information
NSF-SUPPORTED DL ACTIVITIES • DLI-1 • 94-98 • 6 projects • DLI-2 • 99-05 • About 30 projects • NSDL • 00-06 • About 70 projects • DLESE • 99-? • 1 project
COMPONENTS OF A DISTRIBUTED DL • SINGLE SYSTEM • Client(s) • multifunction • Search middleware • Collection(s) • Item metadata • Collection metadata • Items • Collections building services • Metadata entry • Other metainformation services • Gazetteers, thesauri, knowledge bases,… • MULTIPLE (HETEROGENEOUS) SYSTEMS
ADEPT GOALS • Goals • distributed digital library for georeferenced information • services supporting DL federation and interoperation • personalized “learning spaces” • Scalability • many collections • collections, very large to very small • extreme heterogeneity
Z39.50+MARC+ AACR2 SDLIP increasing functionality GDLIP HTTP+ HTML OAI SOAP increasing structure, standardization increasing generality INTEROPERABILITY LANDSCAPE ADEPT
item item item ADEPT ARCHITECTURE (HIGH-LEVEL) client • uniform client services • item-level metadata mapped to search buckets (high-level, typed fields with rich search semantics) • uniform collection- level metadata includes coverage histograms • plugins support common collection implementations collection discovery service middleware RDBMS Z39.50 proxy collection collection personal
Unifying threads: common collection-level metadata “bucket” framework for item-level metadata Buckets transparent metadata aggregation system = Dublin Core plus: search-oriented fields strong typing search semantics explicit representation of metadata mappings Items… map native metadata to buckets Collections… index mapped metadata aggregate mappings compute statistics Collection discovery service… indexes collection-level metadata & statistics CORE ARCHITECTURE (2/2)
ranking methods access control mechanisms ADEPT IMPLEMENTATION C L I E N T web browser JIGI SDLIP proxy HTTP web intermediary/ XMLHTML converter HTTP transport RMI transport HTTP XML M I D D L E W A R E client-side services (Java classes) core functionality access control (service- and collection-level)query fan-out & results mergingquery result rankingresult set caching configuration file server-side interface (Java interfaces) XML S E R V E R JDBC Bucket99 driver query translator proxy driver RDBMS group driver configuration files, scripts
REFERENCE SERVICES • Gazetteer protocol developed • collaborated with ESRI, NGS • formal definition of gazetteer • characterizes gazetteer services • Lots of interest • within and outside InterLib: USGS, NASA, UMass, SRI,... • our gazetteer • protocol itself • Use of gazetteer in semantic mappings: geoinformation text • Prelude to additional reference services • thesaurus/ontology services
A DISTRIBUTED DL FOR EI • NSDL core integration system (CIS) model • Central metadata repository • Common metadata standards • Dublin core and others • Integration of ADEPT nodes • Conversion to ADEPT search buckets • ADEPT middleware available
EXTENDING CORE CIS ARCHITECTURE • Extending the spectrum of search interoperability • collections with non-DC metadata schemas • distributed and heterogeneous collections • richer search functionality • geospatial search, thesaurus/concept space search, ... • Supporting the creation of new and personalized collections • Providing access to thesaurus and gazetteer services
EXTENDING SEARCH INTEROP metadata repository harvest OAI portal 2. harvest & interpret 3. h & i metadata ADEPT 1. map ADEPT collection discovery ADEPT client ADEPT per collection provider
THREE SETS OF SERVICES • Search over heterogeneous collections • mapping between OAI metadata and ADEPT search buckets • installations of ADEPT middleware • Collection building services • metadata entry tool for ADN metadata content standard • personalized collections of existing and new metadata • Information access services • gazetteer and thesaurus protocols • convert textual geospatial references
SUMMARY • Infrastructure and systems exist to build distributed DL for environmental information • Model of NSDL + ADEPT system • Example task: LTER DL integration
DLESE interactions • Adoption of the ADEPT architecture • middleware • RDBMS-based and local collections • Co-development of ingest tools • resource cataloger • spatial footprint specification tool • metadata/collection administration support • Co-development of a metadata content standard for learning objects • Exposure to user community • GDLIP