250 likes | 271 Vues
This presentation provides an overview of a national terminology service, exploring contexts, examples, and issues. It discusses the importance of structured representations in the digital environment and the evolution from managing information assets to multiple entities and relationships. Examples of services and terms are highlighted, emphasizing web-based access to vocabularies for metadata creation and searching. The presentation also addresses challenges and opportunities in developing shared resources and capacity building for terminologies.
E N D
A national terminology service? Lorcan DempseyVP Research and Chief Strategist, OCLC JISC Terminologies Workshop, London,February 13, 2004
Overview • JISC • Contexts • Examples • Issues A partial presentation! Assumption: M2M services with a human face. Assumption: discussion is about potential contours of a national terminologies service, rather than solving complex community discussions about vocabularies.
Contexts A changed digital environment
I know it when I see it … Structured representations of Personal and organizational names Concepts/categories Place names Audience levels Resource types Species names …. Labels: KOS (knowledge organization systems) Authorities Taxonomies Ontologies … Different worldviews, experience, expectations, legacy! Different motivations: research, service, …
The big change … Then: ‘information assets’ were primary objects of interest. Subjects, etc, were seen as attributes of assets. Systems built to reflect this. Now: We manage multiple entities, their representations and relationships: Assets Works; manifestations; copies Rights Collections Services Concepts Names Places … … …
Simple contrived example! Web services Validate Automatic class. Navigation Exchange Mapping Object metadatarepository KOS service Name authority service • Examples: • Discovery environment • Editing environmente.g Dspace • Routing • Objects • Collections • Services • Terms • Users • Institutions • Rights • Schemes • Rules • Version Control Dataassets • Ingest • Export • Search • Update • ID Creation • ID Check • Version Control • Analysis • Validation • Stats data services • Validation • Relation • Synchronization • Resolution • Authorization • Format Conversion Application services • Search • Request • Question • Navigate • Alert • Use Tracker (IFM) • Workflow
Terms and term sets are resources Release value in a web environment Webulated URI for names, concepts, … Concepts/names/etc are ‘ex-citable’ Traversable relationships Build services on top of this May be manifest through several services E.g. URI for a Dewey numberInfo:ddc/22/eng//004.678 Example services Mappings Caption, etc Navigate Bind classes of resources based on ‘link’ Authors Libraries Popular?
Example Some preliminary OCLC developments (by request)
Knowledge org systems Plethora of vocabularies Incompatible approaches to encoding Few connections Education GEM Subjects, ERIC Thesaurus, LCSH, JACS, CIP (Classification of instructional programs) Cultural Heritage AAT, Thesaurus for Graphic Materials (TGM) Subjects & Genre Terms Not built for the web Link to concepts
Terminology services at OCLC:‘Webulating’ knowledge organization Goal: to offer accessible, modular, web-based terminology services. Make vocabularies more available for Metadata creation Searching … Refine and extend mappings Represent vocabularies and mappings in major encoding and distribution standards, e.g., MARC, Zthes, TIF, OAI Prototype custom web services as appropriate to insert functionality in different workflows
Issues Some banal observations
JISC et al role • Act where it makes sense to do something once rather than many times • Remove redundancy from local operations • Create shared resources • Capacity build • Economies of scale and scope • Concentrate expertise and development effort • Terminology services lend themselves to this approach
Communities are the same and different E-science … Learning … Library … Cultural heritage … Biodiversity … Usage scenarios Capacity Expectations View of the world
Identify potential wins • Motivating use cases • Cross searching • Navigation/browsing • Metadata creation • Routing • … • Address compelling interest within capable communities • Emphasise diversity rather than universality (remember different values, legacy, …) • Scope will influence choices (research, prototype services, meet real needs) • Research into patterns of use and demand.
The past is another country .. • Need to think differently – using terminologies as resources in a distributed network environment calls forward different way of thinking • Unplug and play • Making functionality available within multiple workflows. • Bilateral development responsibility • Provider and consumer have development burden.
Avoid techeology • Techeology • substitution of ideology for engineering • manifest in dominance of acronym advocacy over service advocacy
Recombinant growth • Do not overspecify • Make several simple services available which encourage experimentation • For: • Online m2m and h2m interaction • Exchange • Selective harvest • Compare Google and Amazon APIs • Registry – webulation.
Opportunity … Thank you, http://www.oclc.org/research/