320 likes | 554 Vues
Sevim McCutcheon Monographs Cataloger, Assistant Professor Kent State University University Libraries ALA Midwinter Conference Cataloging Norms Interest Group January 16, 2010. Morphing Metadata: a Highly Automated Method of Cataloging Electronic Theses and Dissertations.
E N D
Sevim McCutcheon Monographs Cataloger, Assistant Professor Kent State University University Libraries ALA Midwinter Conference Cataloging Norms Interest Group January 16, 2010 Morphing Metadata:a Highly Automated Method of Cataloging Electronic Theses and Dissertations
ETD Project Timeline and Today’s Topics OhioLINK Consortium’s ETD Center est. 2000 First Kent State University ETD, November 2004 • OhioLINK statewide cataloging standards developed, 2005-2006 • Automated process created at KSU, 2005-2006 (“Cataloging Bot,” short for Cataloging Robot)
Cataloging ETDs Why catalog ETDs? Keyword searching isn’t perfect! Precision & recall Differences and Similarities compared to cataloging print: • Differences: • Author-supplied metadata is the basis of a MARC record • No physical object to track or examine • Similarities: • It’s still a monograph, with a title page as the chief source of information • It’s still a dissertation, requiring complex subject analysis
OhioLINK Committee Discussions • Born digital vs. reproduction • Available first, before book or microfiche = born digital • Published vs. unpublished • Born digital thus published
OCLC Bib Formats + Standards 3.1 Theses and Dissertations. Two types A. Those that exist as digital originals B. Those that are scanned versions of paper originals. “Digital originals should be treated as published items and cataloged as original electronic publications. “ See also:http://www.oclc.org/support/documentation/worldcat/cataloging/electronicresources/ .
When ETDs considered published, what changes? • Fixed Field (Leader/06): Record Type code is now “a,” language material; not “t” for manuscript • Fixed Field Country of Publication code and 260 $a, $b: include place of publication and publisher, which is the university • Fixed Field Government Publication Code might be affected: American state universities use GPub (008/28) = s
NETWORK NDLTD web site OhioLINK ETD Center SUBMIT SEARCH ETD Center web site
Student 1. Submission 4. Approval College Gatekeeper 2. Submission notification 3. Notification forwarded 5. Publication notification Library ETD Coordinator 6. Retrieve metadata Catalog Cataloger 8. Cataloging notification Cataloging Bot 7. Send MARC to catalog Cataloging Bot interactions with ETD Center ETD Center
Step 5: Publication Notification Standard Marcview.cgi is not true MARC = unusable!
ETD Center OAI-PMH URLs ETD-MS: ETD Metadata Standard (more complete than DC) http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_etdms&identifier=kent1122136806 DC: Dublin Core http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_dc&identifier=kent1122136806
How The Bot Works • Receives email • Parses email for document ID • Retrieves metadata • Parses record for useful data • Builds MARC record • Sends record to local catalog • Notifies staff via email notification
3. Retrieves metadata http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_etdms&identifier=kent1155924832 http://www.ohiolink.edu/etd/oai.php?verb=GetRecord&metadataPrefix=oai_etdms&identifier=kent1155924832
4. Parses record for useful data • Extracts data from XML file • Used regular expressions • There are other ways to do it! • Cataloging bot polishes raw material into provisional bibliographic record: • Translates some character entities • Takes out smart quotes • Replaces odd characters, like “&ndash” for numbers • Normalizes most capitalization, as when title is in all CAPITAL letters
5. Builds MARC Record • Miscellaneous reformatting • Splits subtitle from title • Adds GMD • Adds placeholder fields for pagination, etc. • Generates non-filing indicators for English • Local standards document • “Constant” data and formatting • MARC/perl module • Library for easy processing of MARC records • http://marcpm.sourceforge.net/
6. Sends MARC Record to Catalog • Direct communication between Bot and local system • “OCLC” interactive interface on III system • Perl Cookbook, recipe 17.10, (bidirectional forking client) • Record download a possible alternative • Import to OCLC or local system • Manual process
7. Sends email notification: ETD has MARC record in the local catalog
Once the Bot’s work ends, the Cataloger… • Exports provisional record from KentLINK to OCLC save file • Upgrades to full record • Contributes record to WorldCat • Overlays local catalog’s provisional record with OCLC Full Record
Sources OhioLINK ETD Center http://www.ohiolink.edu/etd/ OhioLINK ETD Cataloging Standards http://platinum.ohiolink.edu/dms/catstandards/etd.pdf NDLTD: promotes dissemination and use of ETDs; documentation section includes OhioLINK and KSU materials 1. OhioLINK cataloging standards; and 2. ETD Cataloging Checklist / Sevim McCutcheon http://www.ndltd.org/ OCLC Bibliographic Formats and Standards http://www.oclc.org/bibformats/default.htm
Any questions? Want to implement? Sevim McCutcheon, Cataloging issues Lmccutch@kent.edu 330-672-1703 Mike Kreyche, Technology issues mkreyche@kent.edu 330-672-1918