220 likes | 344 Vues
This document outlines the global initiative to improve the publishing and indexing of taxonomic data within the GBIF network through the Darwin Core Archive. Key objectives include developing capacity for effective data publication, creating a simple exchange format, and promoting a standardized approach to documenting taxonomic data. The outcomes aim to integrate taxonomy into broader biodiversity data management, improving data interoperability and recognition of taxonomy's significance. This solution incorporates a suite of publication tools and facilitates collaboration among global biodiversity stakeholders.
E N D
INFORMATIONFACILITY A Darwin-Core Archive solution to publishing and indexing taxonomic data within the GBIF network GLOBALBIODIVERSITY David Remsen ECAT Program Officer September 2010 WWW.GBIF.ORG Thanks: Peter Desmet, Canadensys- (graphics)
Enabling global discovery: Objectives • Develop capacity to document and publish taxonomic data • A simple exchange format • Suite of publication tools • Promote the publication of taxonomic data in a common format • Build and maintain an index of published checklists • Build services on this index that address user needs in the GBIF network
Enabling global discovery: Outcomes • Embed taxonomy into large-scale biodiversity data/info. management • Improved Interoperability among resources • Improved Precision and Recall within resources • Increase efficiencies in taxon-related linking, mapping, data-mining, and data management • Increased recognition of the value and relevance of taxonomy within all biodiversity information interchange (large and small)
Darwin Core Archive Data Format
Darwin Core • Ratified in 2009 • Significant additions/refinements • Set of terms • http://rs.tdwg.org/dwc/terms/index.htm • Simple Darwin Core (Subset) • Express as Text • http://rs.tdwg.org/dwc/terms/guides/text/index.htm
Core components – single file • Classification • Synonymy • Publication Details Taxon • Simple to Export • Simple to Manage • Comma-Separated Values Text File
Extending Darwin Core • Extensions defined via simple schema • Darwin Core or other terms • Linked to controlled vocabularies • One taxa – many extension records one-to-many Taxon Types and Specimens Bibliography one-to-many • Simple to Export • Simple to Manage • Comma-Separated Values Text File
Metafile describes the set one-to-many Describes Describes Describes Core Types and Specimens Bibliography one-to-many Metafile
Core + Set of Extensions “GNA Simple Exchange Format” one-to-many one-to-many Vernacular Names describes Bibliography one-to-many one-to-many Taxa Metafile Types and Specimens Distribution
Metadata documents resource documents GBIF EML profile one-to-many one-to-many Vernacular Names describes Bibliography one-to-many one-to-many Taxa Metafile Types and Specimens Distribution
Validator Status: Under Evaluation http://tools.gbif.org/dwca-validator/
Darwin Core Archive Publishing Options
Integrated Publishing Toolkit Compose EML Metadata Connect to database Upload Data Transform to DWCA Publish via GBIF http://ipt.gbif.org Status: Stable release – end 2010
Guidelines and Best Practices • DB Admin skills • Database export • No tools required • Successful pilots • Ireland • NBN UK • Norway • Avian Knowledge network • IPNI • IRMNG Status: Drafts for Novembercampaign (see roadmap)
Authoring Descriptor XML Metafile Status: Ready for Review http://tools.gbif.org/dwca-assistant/
Excel Spreadsheet Templates Status: Ready for Review/Testing
Spreadsheet Processor Status: Ready for Review http://tools.gbif.org/spreadsheet-processor/
Checklist Bank http://ecat-dev.gbif.org/ Status: Dev version in place. Integration with GBIF data portal 2011
Roadmap • Evaluation and testing and refinement Q4 2010 • Consolidate docs and publishing for ver. 1 Simple Exchange Format using DWC-A • Target current taxonomic data export publishers • Small grants to pilot DWC-A exports • Seed funds to GBIF Nodes • Publish regional and thematic species checklists • Evaluate 1.0 extensions and vocabularies