110 likes | 223 Vues
This document provides a comprehensive update on the current status of ISOcat, focusing on standardization processes, thematic domain groups (TDGs), and data categories. It discusses the completion of the ISO ballot for TDG members, the implementation of a standardization workflow, and the development of bulk load data categories. Additional topics include public data category specifications, new features in the relation registry, and ongoing support from the CLARIN-NL helpdesk. Stay informed on the latest ISOcat initiatives and opportunities for contribution.
E N D
ISOcat status Menzo.Windhouwer@mpi.nl
Outline • ISOcat forum • Thematic Domain Groups • Standardization • Bulk load data categories • Public DCSs • Other changes • Container data categories • Relation Registry • CLARIN-NL Helpdesk CLARIN-NL - Call 1 - ISOcat status
ISOcat forum • http://www.isocat.org/forum/ • General support forum • A forum for Thematic Domain Group • But a TDG (chair) has to request it, so we’ll know the forum will be monitored by the TDG members • A (private) forum for a group • Send a request to isocat@mpi.nl • Send a forum mediated email to the DC owner, a TDG/group member CLARIN-NL - Call 1 - ISOcat status
Thematic Domain Groups • The ISO ballot for TDG members has ended • Metadata, Morposyntax and Terminology are still the more active TDGs • TDGs in the process of being established: • Translation • Sign languages • Audio CLARIN-NL - Call 1 - ISOcat status
Standardization • The standardization workflow has been implemented • But needs some tweaks • Only TDG members can currently submit DCs for standardization • The metadata TDG will start standardizing in October CLARIN-NL - Call 1 - ISOcat status
Bulk load data categories • http://www.isocat.org/12620/ • DCIF Relax NG schema now also contains Schematron rules to fully check data category specifications • Supported by XML editors like oXygen • ISOcat can export DCIF • But for large DCSs/profiles this still takes quite long • ISOcat can import DCIF • Contact isocat@mpi.nl to upload your DCS via DCIF CLARIN-NL - Call 1 - ISOcat status
Public DCSs • GOLD ontology: • My Workspace > Public > RELISH > GOLD 2010 • But not all concepts make good data categories • Upcoming: • ISO 639: language (family) codes • Consensus on the PIDs and mapping • Needs better handling of large DCSs • TEI Header • Needs container data category type CLARIN-NL - Call 1 - ISOcat status
Other changes • DC Reference vocabulary • New dcr:valueDatcat attribute and element <feat name=“gender” dcr:datcat=“…/DC-1297” value=“masculine” dcr:valueDatcat=“…/DC-1883” /> • Many UI fixes • Contact isocat@mpi.nl or use the forum to report your own issues CLARIN-NL - Call 1 - ISOcat status
Container data categories • At the TC 37 plenary we got the green light to add the container data category type lexicon language entry alphabet japanese ipa lemma writtenForm CLARIN-NL - Call 1 - ISOcat status
Relation Registry • Implementation has started September 2010 • First focus on RESTful web services for CMDI • http://localhost:8080/rr/rest/set/cmdi • http://localhost:8080/rr/rest/set/cmdi/relations?relation=sameAs&resource=dc:language • RDF-based quad store • Contact isocat@mpi.nl to upload your own set of relations • Which relations? • OWL inspired (ontology): • sameAs, distinct, subClassOf/superClassOf • SKOS inspired (taxonomy): • broader/narrower, related • Partitive relations (OWL patterns): • partOf, directPartOf, indirectPartOf • … CLARIN-NL - Call 1 - ISOcat status
CLARIN-NL Helpdesk • Paul van Caspel • http://lux102/trac • helpdesk.clarin@uu.nl CLARIN-NL - Call 1 - ISOcat status